Preparing ML data with Amazon SageMaker Data Wrangler
Amazon SageMaker has a lot of capabilities and features to assist data scientists and ML engineers with the different ML requirements. One of the capabilities of SageMaker focused on accelerating data preparation and data analysis is SageMaker Data Wrangler:
Figure 5.18 – The primary functionalities available in SageMaker Data Wrangler
In Figure 5.18, we can see what we can do with our data when using SageMaker Data Wrangler:
- First, we can import data from a variety of data sources such as Amazon S3, Amazon Athena, and Amazon Redshift.
- Next, we can create a data flow and transform the data using a variety of data formatting and data transformation options. We can also analyze and visualize the data using both inbuilt and custom options in just a few clicks.
- Finally, we can automate the data preparation workflows by exporting one or more of the transformations configured in the...