You're reading from Mastering Transformers The Journey from BERT to Large Language Models and Stable Diffusion

Product type Paperback

Published in Jun 2024

Publisher Packt

ISBN-13 9781837633784

Length 462 pages

Edition 2nd Edition

Languages

Python

Tools

BERT

Concepts

GPT/LLMs

Authors (2):

Savaş Yıldırım

Meysam Asgari- Chenaghlu

View More author details

Table of Contents (25) Chapters

Preface

1. Part 1: Recent Developments in the Field, Installations, and Hello World Applications

2. Chapter 1: From Bag-of-Words to the Transformers FREE CHAPTER

3. Chapter 2: A Hands-On Introduction to the Subject

4. Part 2: Transformer Models: From Autoencoders to Autoregressive Models

5. Chapter 3: Autoencoding Language Models

6. Chapter 4: From Generative Models to Large Language Models

7. Chapter 5: Fine-Tuning Language Models for Text Classification

8. Chapter 6: Fine-Tuning Language Models for Token Classification

9. Chapter 7: Text Representation

10. Chapter 8: Boosting Model Performance

11. Chapter 9: Parameter Efficient Fine-Tuning

12. Part 3: Advanced Topics

13. Chapter 10: Large Language Models

14. Chapter 11: Explainable AI (XAI) in NLP

15. Chapter 12: Working with Efficient Transformers

16. Chapter 13: Cross-Lingual and Multilingual Language Modeling

17. Chapter 14: Serving Transformer Models

18. Chapter 15: Model Tracking and Monitoring

19. Part 4: Transformers beyond NLP

20. Chapter 16: Vision Transformers

21. Chapter 17: Multimodal Generative Transformers

22. Chapter 18: Revisiting Transformers Architecture for Time Series

23. Index

Why subscribe?

24. Other Books You May Enjoy

Fine-tuning a BERT model for single-sentence binary classification

In this section, we will discuss how to fine-tune a pre-trained BERT model for sentiment analysis by using the popular IMDb sentiment dataset. Working with a GPU will speed up our learning process, but if you do not have such resources, you can work with a CPU as well for fine-tuning. Let’s get started.

To learn about and save our current device, we can execute the following lines of code:

from torch import cuda
device = 'cuda' if cuda.is_available() else 'cpu'

We will use the DistilBertForSequenceClassification class here, which is inherited from the DistilBert class, with a special sequence classification head at the top. We can utilize this classification head to train the classification model, where the number of classes is 2 by default:

from transformers import (
    DistilBertTokenizerFast, DistilBertForSequenceClassification)
model_path= 'distilbert...

The rest of the chapter is locked

You're reading from Mastering Transformers The Journey from BERT to Large Language Models and Stable Diffusion

Table of Contents (25) Chapters

Fine-tuning a BERT model for single-sentence binary classification

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you