Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Amazon SageMaker Best Practices

You're reading from   Amazon SageMaker Best Practices Proven tips and tricks to build successful machine learning solutions on Amazon SageMaker

Arrow left icon
Product type Paperback
Published in Sep 2021
Publisher Packt
ISBN-13 9781801070522
Length 348 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (3):
Arrow left icon
Randy DeFauw Randy DeFauw
Author Profile Icon Randy DeFauw
Randy DeFauw
Shelbee Eigenbrode Shelbee Eigenbrode
Author Profile Icon Shelbee Eigenbrode
Shelbee Eigenbrode
Sireesha Muppala Sireesha Muppala
Author Profile Icon Sireesha Muppala
Sireesha Muppala
Arrow right icon
View More author details
Toc

Table of Contents (20) Chapters Close

Preface 1. Section 1: Processing Data at Scale
2. Chapter 1: Amazon SageMaker Overview FREE CHAPTER 3. Chapter 2: Data Science Environments 4. Chapter 3: Data Labeling with Amazon SageMaker Ground Truth 5. Chapter 4: Data Preparation at Scale Using Amazon SageMaker Data Wrangler and Processing 6. Chapter 5: Centralized Feature Repository with Amazon SageMaker Feature Store 7. Section 2: Model Training Challenges
8. Chapter 6: Training and Tuning at Scale 9. Chapter 7: Profile Training Jobs with Amazon SageMaker Debugger 10. Section 3: Manage and Monitor Models
11. Chapter 8: Managing Models at Scale Using a Model Registry 12. Chapter 9: Updating Production Models Using Amazon SageMaker Endpoint Production Variants 13. Chapter 10: Optimizing Model Hosting and Inference Costs 14. Chapter 11: Monitoring Production Models with Amazon SageMaker Model Monitor and Clarify 15. Section 4: Automate and Operationalize Machine Learning
16. Chapter 12: Machine Learning Automated Workflows 17. Chapter 13:Well-Architected Machine Learning with Amazon SageMaker 18. Chapter 14: Managing SageMaker Features across Accounts 19. Other Books You May Enjoy

What this book covers

Chapter 1, Amazon SageMaker Overview, provides a high-level overview of the Amazon SageMaker capabilities that map to the various phases of the machine learning process. This sets a foundation for a best practice discussion of using SageMaker capabilities to handle data science challenges.

Chapter 2, Data Science Environments, provides a brief overview of technical requirements along with a discussion on setting up the necessary data science environments using Amazon SageMaker. This sets the foundation for building and automating ML solutions throughout the rest of the book.

Chapter 3, Data Labeling with Amazon SageMaker Ground Truth, kicks off with a review of challenges involved in labeling data at scale – costs, time, unique labeling needs, inaccuracies, and bias. Best practices to use Amazon SageMaker Ground Truth to address the challenges identified are discussed.

Chapter 4, Data Preparation at Scale Using Amazon SageMaker Data Wrangler and Processing, kicks off with a review of challenges involved in data preparation at scale – compute/memory resource constraints, long processing times, along with the challenges of the duplication of feature engineering efforts, bias detection, and understanding feature importance. A discussion on Amazon SageMaker capabilities to address these challenges along with best practices to apply follows.

Chapter 5, Centralized Feature Repository with Amazon SageMaker Feature Store, provides best practices for using a centralized repository for features built with Amazon SageMaker Feature Store. Techniques to ingest features and provide access to features to satisfy access time requirements are discussed.

Chapter 6, Training and Tuning at Scale, provides best practices for training and tuning machine learning models with large datasets using Amazon SageMaker. Techniques such as distributed training with data and model parallelism, automated model tuning, and grouping multiple training jobs to identify the best performing job are discussed.

Chapter 7, Profile Training Jobs with Amazon SageMaker Debugger, discusses best practices to debug, monitor, and profile training jobs to detect long-running non-converging jobs and eliminate resource bottlenecks. The monitoring and profiling capabilities offered by Amazon SageMaker Debugger help improve training time and reduce training costs.

Chapter 8, Managing Models at Scale Using a Model Registry, introduces SageMaker Model Registry as a centralized catalog of trained models. Models can be deployed from the registry and the metadata maintained in the registry is useful to understand the deployment history of an individual model. Model Registry is an important component of addressing the challenge of model deployment automation with CI/CD.

Chapter 9, Updating Production Models Using Amazon SageMaker Endpoint Production Variants, addresses the challenge of updating models in production with minimal disruption to the model consumers using Amazon SageMaker Endpoint production variants. The same production variants will be used to showcase advanced strategies such as canary deployments, A/B testing, blue/green deployments that balance cost with downtime, and ease of rollbacks.

Chapter 10, Optimizing Model Hosting and Inference Costs, introduces best practices to optimize hosting and inference costs on Amazon SageMaker. Multiple deployment strategies are discussed to meet the computation needs and response time requirements under varying inference traffic demands.

Chapter 11, Monitoring Production Models with Amazon SageMaker Model Monitor and Clarify, introduces best practices to monitor the quality of production models and receive proactive alerts on model quality degradation. You will learn how to monitor for data bias, model bias, bias drift, and feature attribution drift using Amazon SageMaker Model Monitor and SageMaker Clarify.

Chapter 12, Machine Learning Automated Workflows, brings together data processing, training, deployment, and model management into automated workflows that can be orchestrated and integrated into end-to-end solutions.

Chapter 13, Well-Architected Machine Learning with Amazon SageMaker, applies best practices provided by the AWS Well-Architected Framework to building ML solutions on Amazon SageMaker.

Chapter 14, Managing SageMaker Features across Accounts, discusses best practices for using Amazon SageMaker capabilities in a cross-account setup involving multiple AWS accounts, which allows you to better govern and manage machine learning activities across the machine learning development lifecycle.

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at ₹800/month. Cancel anytime