You're reading from Machine Learning Techniques for Text Apply modern techniques with Python for text processing, dimensionality reduction, classification, and evaluation

Product type Paperback

Published in Oct 2022

Publisher Packt

ISBN-13 9781803242385

Length 448 pages

Edition 1st Edition

Languages

Python

Concepts

Machine Learning

Author (1):

Nikos Tsourakis

View More author details

Table of Contents (13) Chapters

Preface

1. Chapter 1: Introducing Machine Learning for Text

2. Chapter 2: Detecting Spam Emails FREE CHAPTER

3. Chapter 3: Classifying Topics of Newsgroup Posts

4. Chapter 4: Extracting Sentiments from Product Reviews

5. Chapter 5: Recommending Music Titles

6. Chapter 6: Teaching Machines to Translate

7. Chapter 7: Summarizing Wikipedia Articles

8. Chapter 8: Detecting Hateful and Offensive Language

9. Chapter 9: Generating Text in Chatbots

10. Chapter 10: Clustering Speech-to-Text Transcriptions

11. Index

Why subscribe?

12. Other Books You May Enjoy

Understanding BERT

Looking at the transformer’s encoder/decoder architecture discussed in the Introducing transformers section of Chapter 7, Summarizing Wikipedia Articles, we can observe a clear separation of tasks. The encoder is responsible for extracting features from an input sentence, such as syntax, grammar, and context. At the same time, the decoder maps it to a target sequence – for example, translates it to another language. This separation makes the two components self-contained; therefore, they can be used independently.

This section introduces a state-of-the-art transformer-based technique to generate language representation models named Bidirectional Encoder Representation from Transformers (BERT). BERT incorporates a stack of transformer encoders to understand the language better.

Similarly to word embedding, the method belongs to the self-supervised learning family because it does not require human-annotated observation labels. Therefore, BERT can...

The rest of the chapter is locked

You're reading from Machine Learning Techniques for Text Apply modern techniques with Python for text processing, dimensionality reduction, classification, and evaluation

Table of Contents (13) Chapters

Understanding BERT

Unlock this book and the full library FREE for 7 days

Authors (1)