You're reading from Machine Learning with Amazon SageMaker Cookbook 80 proven recipes for data scientists and developers to perform machine learning experiments and deployments

Product type Paperback

Published in Oct 2021

Publisher Packt

ISBN-13 9781800567030

Length 762 pages

Edition 1st Edition

Languages

Python

Tools

AWS

Concepts

Machine Learning

Author (1):

Joshua Arvin Lat

View More author details

Table of Contents (11) Chapters

Preface

1. Chapter 1: Getting Started with Machine Learning Using Amazon SageMaker

2. Chapter 2: Building and Using Your Own Algorithm Container Image FREE CHAPTER

3. Chapter 3: Using Machine Learning and Deep Learning Frameworks with Amazon SageMaker

4. Chapter 4: Preparing, Processing, and Analyzing the Data

5. Chapter 5: Effectively Managing Machine Learning Experiments

6. Chapter 6: Automated Machine Learning in Amazon SageMaker

7. Chapter 7: Working with SageMaker Feature Store, SageMaker Clarify, and SageMaker Model Monitor

8. Chapter 8: Solving NLP, Image Classification, and Time-Series Forecasting Problems with Built-in Algorithms

9. Chapter 9: Managing Machine Learning Workflows and Deployments

10. Other Books You May Enjoy

Working with Hugging Face models

The Hugging Face Transformers library has brought together multiple pre-trained transformer models to solve relevant NLP tasks such as text classification, text generation, and information extraction. These models include BERT, RoBERTa, GPT, GPT-2, and other state-of-the-art transformer models. What is the advantage of using pre-trained models? With pre-trained models and transfer learning, we can come up with accurate models in a shorter period of time. That's because we can start with a good set of weights from models that have been trained to solve a similar set of problems.

In this recipe, we will start with the pre-trained DistilBERT model and use the HuggingFace estimator class from the SageMaker Python SDK, along with a custom Python script file, to fine-tune our DistilBERT model. We will use the synthetic text data from the Generating a synthetic dataset for text classification problems recipe of Chapter 8, Solving NLP, Image Classification...