Vertex AI data labeling and datasets
Datasets play such a significant role in the machine learning process that the quality of datasets has a huge impact on the ML model performance. As we discussed in Chapter 4, Developing and Deploying ML Models, data preparation is the first and most important step in any machine learning process.
Vertex AI Data labeling is a Google Cloud service that lets end users work with human workers to review and label datasets uploaded by users. After the datasets are labeled, they can be used to train machine learning models. The human workers are employed by Google, and the users will need to provide the dataset, the labels, and instructions to the human workers for labeling.
End users can also upload labeled datasets directly. Vertex AI datasets are part of a Google Cloud service that provides users with the ability to upload data of varying types for the purpose of building, training, and validating machine learning models. Currently, Vertex AI...