You're reading from Mastering Transformers The Journey from BERT to Large Language Models and Stable Diffusion

Product type Paperback

Published in Jun 2024

Publisher Packt

ISBN-13 9781837633784

Length 462 pages

Edition 2nd Edition

Languages

Python

Tools

BERT

Concepts

GPT/LLMs

Authors (2):

Savaş Yıldırım

Meysam Asgari- Chenaghlu

View More author details

Table of Contents (25) Chapters

Preface

1. Part 1: Recent Developments in the Field, Installations, and Hello World Applications

2. Chapter 1: From Bag-of-Words to the Transformers FREE CHAPTER

3. Chapter 2: A Hands-On Introduction to the Subject

4. Part 2: Transformer Models: From Autoencoders to Autoregressive Models

5. Chapter 3: Autoencoding Language Models

6. Chapter 4: From Generative Models to Large Language Models

7. Chapter 5: Fine-Tuning Language Models for Text Classification

8. Chapter 6: Fine-Tuning Language Models for Token Classification

9. Chapter 7: Text Representation

10. Chapter 8: Boosting Model Performance

11. Chapter 9: Parameter Efficient Fine-Tuning

12. Part 3: Advanced Topics

13. Chapter 10: Large Language Models

14. Chapter 11: Explainable AI (XAI) in NLP

15. Chapter 12: Working with Efficient Transformers

16. Chapter 13: Cross-Lingual and Multilingual Language Modeling

17. Chapter 14: Serving Transformer Models

18. Chapter 15: Model Tracking and Monitoring

19. Part 4: Transformers beyond NLP

20. Chapter 16: Vision Transformers

21. Chapter 17: Multimodal Generative Transformers

22. Chapter 18: Revisiting Transformers Architecture for Time Series

23. Index

Why subscribe?

24. Other Books You May Enjoy

Introduction to text classification

Text classification (also known as text categorization) is a way of mapping a document (sentence, Twitter/X post, book chapter, email content, and so on) to a category out of a predefined list (classes). In the case of two classes that have positive and negative labels, we use binary classification – more specifically, sentiment analysis. For more than two classes, we call it multi-class classification, where the classes are mutually exclusive, or multi-label classification, where the classes are not mutually exclusive, which means a document can receive more than one label. For instance, the content of a news article may be related to sports and politics at the same time. Beyond this classification, we may want to score the documents in a range of [-1,1] or rank them in a range of [1-5]. We can solve this kind of problem with a regression model, where the type of the output is numeric, not categorical.

Luckily, the transformer architecture...