You're reading from Machine Learning with Amazon SageMaker Cookbook 80 proven recipes for data scientists and developers to perform machine learning experiments and deployments

Product type Paperback

Published in Oct 2021

Publisher Packt

ISBN-13 9781800567030

Length 762 pages

Edition 1st Edition

Languages

Python

Tools

AWS

Concepts

Machine Learning

Author (1):

Joshua Arvin Lat

View More author details

Table of Contents (11) Chapters

Preface

1. Chapter 1: Getting Started with Machine Learning Using Amazon SageMaker

2. Chapter 2: Building and Using Your Own Algorithm Container Image FREE CHAPTER

3. Chapter 3: Using Machine Learning and Deep Learning Frameworks with Amazon SageMaker

4. Chapter 4: Preparing, Processing, and Analyzing the Data

5. Chapter 5: Effectively Managing Machine Learning Experiments

6. Chapter 6: Automated Machine Learning in Amazon SageMaker

7. Chapter 7: Working with SageMaker Feature Store, SageMaker Clarify, and SageMaker Model Monitor

8. Chapter 8: Solving NLP, Image Classification, and Time-Series Forecasting Problems with Built-in Algorithms

9. Chapter 9: Managing Machine Learning Workflows and Deployments

10. Other Books You May Enjoy

Generating a synthetic dataset for analysis and transformation

In this recipe, we will generate a synthetic dataset that will be used in the next four recipes involving dimensionality reduction, cluster analysis, and conversion to protobuf recordIO format. We will generate one labeled version of the dataset and one unlabeled version of the dataset. This dataset will have two easily identifiable clusters, as shown in Figure 4.32. It will also have six columns for the labeled version of the dataset and five columns for the unlabeled version of the dataset.

Figure 4.32 – Synthetic dataset

After we have completed this recipe, we should have a synthetic dataset similar to what is shown in Figure 4.32. In the Performing dimensionality reduction with the built-in PCA algorithm recipe, we will use the PCA algorithm to perform dimensionality reduction with this synthetic dataset. In the Performing cluster analysis with the built-in KMeans algorithm recipe, we...

The rest of the chapter is locked

You're reading from Machine Learning with Amazon SageMaker Cookbook 80 proven recipes for data scientists and developers to perform machine learning experiments and deployments

Table of Contents (11) Chapters

Generating a synthetic dataset for analysis and transformation

Authors (1)

Personalised recommendations for you

You're reading from Machine Learning with Amazon SageMaker Cookbook 80 proven recipes for data scientists and developers to perform machine learning experiments and deployments

Table of Contents (11) Chapters

Generating a synthetic dataset for analysis and transformation

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you