You're reading from Amazon SageMaker Best Practices Proven tips and tricks to build successful machine learning solutions on Amazon SageMaker

Product type Paperback

Published in Sep 2021

Publisher Packt

ISBN-13 9781801070522

Length 348 pages

Edition 1st Edition

Languages

Python

Tools

Amazon SimpleDB

Concepts

Machine Learning

Authors (3):

Randy DeFauw

Shelbee Eigenbrode

Sireesha Muppala

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Processing Data at Scale

2. Chapter 1: Amazon SageMaker Overview FREE CHAPTER

3. Chapter 2: Data Science Environments

4. Chapter 3: Data Labeling with Amazon SageMaker Ground Truth

5. Chapter 4: Data Preparation at Scale Using Amazon SageMaker Data Wrangler and Processing

6. Chapter 5: Centralized Feature Repository with Amazon SageMaker Feature Store

7. Section 2: Model Training Challenges

8. Chapter 6: Training and Tuning at Scale

9. Chapter 7: Profile Training Jobs with Amazon SageMaker Debugger

10. Section 3: Manage and Monitor Models

11. Chapter 8: Managing Models at Scale Using a Model Registry

12. Chapter 9: Updating Production Models Using Amazon SageMaker Endpoint Production Variants

13. Chapter 10: Optimizing Model Hosting and Inference Costs

14. Chapter 11: Monitoring Production Models with Amazon SageMaker Model Monitor and Clarify

15. Section 4: Automate and Operationalize Machine Learning

16. Chapter 12: Machine Learning Automated Workflows

17. Chapter 13:Well-Architected Machine Learning with Amazon SageMaker

18. Chapter 14: Managing SageMaker Features across Accounts

19. Other Books You May Enjoy

What this book covers

Chapter 1, Amazon SageMaker Overview, provides a high-level overview of the Amazon SageMaker capabilities that map to the various phases of the machine learning process. This sets a foundation for a best practice discussion of using SageMaker capabilities to handle data science challenges.

Chapter 2, Data Science Environments, provides a brief overview of technical requirements along with a discussion on setting up the necessary data science environments using Amazon SageMaker. This sets the foundation for building and automating ML solutions throughout the rest of the book.

Chapter 3, Data Labeling with Amazon SageMaker Ground Truth, kicks off with a review of challenges involved in labeling data at scale – costs, time, unique labeling needs, inaccuracies, and bias. Best practices to use Amazon SageMaker Ground Truth to address the challenges identified are discussed.

Chapter 4, Data Preparation at Scale Using Amazon SageMaker Data Wrangler and Processing, kicks off with a review of challenges involved in data preparation at scale – compute/memory resource constraints, long processing times, along with the challenges of the duplication of feature engineering efforts, bias detection, and understanding feature importance. A discussion on Amazon SageMaker capabilities to address these challenges along with best practices to apply follows.

Chapter 5, Centralized Feature Repository with Amazon SageMaker Feature Store, provides best practices for using a centralized repository for features built with Amazon SageMaker Feature Store. Techniques to ingest features and provide access to features to satisfy access time requirements are discussed.

Chapter 6, Training and Tuning at Scale, provides best practices for training and tuning machine learning models with large datasets using Amazon SageMaker. Techniques such as distributed training with data and model parallelism, automated model tuning, and grouping multiple training jobs to identify the best performing job are discussed.

Chapter 7, Profile Training Jobs with Amazon SageMaker Debugger, discusses best practices to debug, monitor, and profile training jobs to detect long-running non-converging jobs and eliminate resource bottlenecks. The monitoring and profiling capabilities offered by Amazon SageMaker Debugger help improve training time and reduce training costs.

Chapter 8, Managing Models at Scale Using a Model Registry, introduces SageMaker Model Registry as a centralized catalog of trained models. Models can be deployed from the registry and the metadata maintained in the registry is useful to understand the deployment history of an individual model. Model Registry is an important component of addressing the challenge of model deployment automation with CI/CD.

Chapter 9, Updating Production Models Using Amazon SageMaker Endpoint Production Variants, addresses the challenge of updating models in production with minimal disruption to the model consumers using Amazon SageMaker Endpoint production variants. The same production variants will be used to showcase advanced strategies such as canary deployments, A/B testing, blue/green deployments that balance cost with downtime, and ease of rollbacks.

Chapter 10, Optimizing Model Hosting and Inference Costs, introduces best practices to optimize hosting and inference costs on Amazon SageMaker. Multiple deployment strategies are discussed to meet the computation needs and response time requirements under varying inference traffic demands.

Chapter 11, Monitoring Production Models with Amazon SageMaker Model Monitor and Clarify, introduces best practices to monitor the quality of production models and receive proactive alerts on model quality degradation. You will learn how to monitor for data bias, model bias, bias drift, and feature attribution drift using Amazon SageMaker Model Monitor and SageMaker Clarify.

Chapter 12, Machine Learning Automated Workflows, brings together data processing, training, deployment, and model management into automated workflows that can be orchestrated and integrated into end-to-end solutions.

Chapter 13, Well-Architected Machine Learning with Amazon SageMaker, applies best practices provided by the AWS Well-Architected Framework to building ML solutions on Amazon SageMaker.

Chapter 14, Managing SageMaker Features across Accounts, discusses best practices for using Amazon SageMaker capabilities in a cross-account setup involving multiple AWS accounts, which allows you to better govern and manage machine learning activities across the machine learning development lifecycle.