You're reading from Transformers for Natural Language Processing Build innovative deep neural network architectures for NLP with Python, PyTorch, TensorFlow, BERT, RoBERTa, and more

Product type Paperback

Published in Jan 2021

Publisher Packt

ISBN-13 9781800565791

Length 384 pages

Edition 1st Edition

Languages

Processing

Tools

BERT

Concepts

Mobile Application Development

Author (1):

Denis Rothman

View More author details

Table of Contents (16) Chapters

Preface

1. Getting Started with the Model Architecture of the Transformer

2. Fine-Tuning BERT Models FREE CHAPTER

3. Pretraining a RoBERTa Model from Scratch

4. Downstream NLP Tasks with Transformers

5. Machine Translation with the Transformer

6. Text Generation with OpenAI GPT-2 and GPT-3 Models

7. Applying Transformers to Legal and Financial Documents for AI Text Summarization

8. Matching Tokenizers and Datasets

9. Semantic Role Labeling with BERT-Based Transformers

10. Let Your Data Do the Talking: Story, Questions, and Answers

11. Detecting Customer Emotions to Make Predictions

12. Analyzing Fake News with Transformers

13. Other Books You May Enjoy

14. Index

Appendix: Answers to the Questions

Matching datasets and tokenizers

Downloading benchmarks datasets to train transformers has many advantages. The data has been prepared, and every research lab uses the same references. Also, the performance of a transformer model can be compared to another model with the same data.

However, more needs to be done to improve the performance of transformers. Furthermore, implementing a transformer model in production requires careful planning and defining best practices.

In this section, we will define some best practices to avoid critical stumbling blocks.

Then we will go through a few examples in Python using cosine similarity to measure the limits of tokenization and encoding datasets.

Let's start with best practices.

Best practices

Raffel et al. (2019) defined a standard text-2-text T5 transformer model. They also went further. They began destroying the myth of using raw data without preprocessing it first. Preprocessing data reduces training time. Common...

The rest of the chapter is locked

You're reading from Transformers for Natural Language Processing Build innovative deep neural network architectures for NLP with Python, PyTorch, TensorFlow, BERT, RoBERTa, and more

Table of Contents (16) Chapters

Matching datasets and tokenizers

Best practices

Authors (1)

Other recommended products

You're reading from Transformers for Natural Language Processing Build innovative deep neural network architectures for NLP with Python, PyTorch, TensorFlow, BERT, RoBERTa, and more

Table of Contents (16) Chapters

Matching datasets and tokenizers

Best practices

Unlock this book and the full library FREE for 7 days

Authors (1)

Other recommended products