Containers and Accelerators on the Cloud
In this chapter, you’ll learn how to containerize your scripts and optimize them for accelerators on the cloud. We’ll learn about a range of accelerators for foundation models, including trade-offs around cost and performance across the entire machine learning lifecycle. You’ll learn about key aspects of Amazon SageMaker and AWS to train models on accelerators, optimize performance, and troubleshoot common issues. if you’re already familiar with containers and accelerators on AWS, feel free to skip this chapter.
In this chapter, we’re going to cover the following main topics:
- What are accelerators and why do they matter for foundation models?
- Containerize your scripts for accelerators on AWS
- Using accelerators with Amazon SageMaker
- Infrastructure optimizations on AWS
- Troubleshooting accelerator performance