You're reading from Pretrain Vision and Large Language Models in Python End-to-end techniques for building and deploying foundation models on AWS

Product type Paperback

Published in May 2023

Publisher Packt

ISBN-13 9781804618257

Length 258 pages

Edition 1st Edition

Languages

Python

Tools

AWS

Concepts

GPT/LLMs

Author (1):

Emily Webber

View More author details

Table of Contents (23) Chapters

Preface

1. Part 1: Before Pretraining

2. Chapter 1: An Introduction to Pretraining Foundation Models FREE CHAPTER

3. Chapter 2: Dataset Preparation: Part One

4. Chapter 3: Model Preparation

5. Part 2: Configure Your Environment

6. Chapter 4: Containers and Accelerators on the Cloud

7. Chapter 5: Distribution Fundamentals

8. Chapter 6: Dataset Preparation: Part Two, the Data Loader

9. Part 3: Train Your Model

10. Chapter 7: Finding the Right Hyperparameters

11. Chapter 8: Large-Scale Training on SageMaker

12. Chapter 9: Advanced Training Concepts

13. Part 4: Evaluate Your Model

14. Chapter 10: Fine-Tuning and Evaluating

15. Chapter 11: Detecting, Mitigating, and Monitoring Bias

16. Chapter 12: How to Deploy Your Model

17. Part 5: Deploy Your Model

18. Chapter 13: Prompt Engineering

19. Chapter 14: MLOps for Vision and Language

20. Chapter 15: Future Trends in Pretraining Foundation Models

21. Index

Why subscribe?

22. Other Books You May Enjoy

Hosting distributed models on SageMaker

In Chapter 5, we covered distribution fundamentals, where you learned how to think about splitting up your model and datasets across multiple GPUs. The good news is that you can use this same logic to host the model. In this case, you’ll be more interested in model parallel, placing layers and tensors on multiple GPU partitions. You won’t actually need a data parallel framework, because we’re not using backpropagation. We’re only running a forward pass through the network and getting inference results. There’s no gradient descent or weight updating involved.

When would you use distributed model hosting? To integrate extremely large models into your applications! Generally, this is scoped to large language models. It’s rare to see vision models stretch beyond single GPUs. Remember, in Chapter 4, Containers and Accelerators on the Cloud, we learned about different sizes of GPU memory. This is just as...

The rest of the chapter is locked

You're reading from Pretrain Vision and Large Language Models in Python End-to-end techniques for building and deploying foundation models on AWS

Table of Contents (23) Chapters

Hosting distributed models on SageMaker

Authors (1)

Personalised recommendations for you

You're reading from Pretrain Vision and Large Language Models in Python End-to-end techniques for building and deploying foundation models on AWS

Table of Contents (23) Chapters

Hosting distributed models on SageMaker

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you