Section 2: Model Training Challenges
This section tackles the challenge of training at scale including using large datasets while saving costs, monitoring training resources to identify bottlenecks, speeding up long training jobs, and tracking multiple models trained for a common goal.
This section comprises the following chapters:
- Chapter 6, Training and Tuning at Scale
- Chapter 7, Profile Training Jobs with Amazon SageMaker Debugger