You're reading from Natural Language Processing with TensorFlow The definitive NLP book to implement the most sought-after machine learning models and tasks

Product type Paperback

Published in Jul 2022

Publisher Packt

ISBN-13 9781838641351

Length 514 pages

Edition 2nd Edition

Languages

Processing

Tools

Processing

Concepts

Machine Learning

Author (1):

Thushan Ganegedara

View More author details

Table of Contents (15) Chapters

Preface

1. Introduction to Natural Language Processing FREE CHAPTER

2. Understanding TensorFlow 2

3. Word2vec – Learning Word Embeddings

4. Advanced Word Vector Algorithms

5. Sentence Classification with Convolutional Neural Networks

6. Recurrent Neural Networks

7. Understanding Long Short-Term Memory Networks

8. Applications of LSTM – Generating Text

9. Sequence-to-Sequence Learning – Neural Machine Translation

10. Transformers

11. Image Captioning with Transformers

12. Other Books You May Enjoy

13. Index

Appendix A: Mathematical Foundations and Advanced TensorFlow

Transformer architecture

A Transformer is a type of Seq2Seq model (discussed in the previous chapter). Transformer models can work with both image and text data. The Transformer model takes in a sequence of inputs and maps that to a sequence of outputs.

The Transformer model was initially proposed in the paper Attention is all you need by Vaswani et al. (https://arxiv.org/pdf/1706.03762.pdf). Just like a Seq2Seq model, the Transformer consists of an encoder and a decoder (Figure 10.1):

Figure 10.1: The encoder-decoder architecture

Let’s understand how the Transformer model works using the previously studied Machine Translation task. The encoder takes in a sequence of source language tokens and produces a sequence of interim outputs. Then the decoder takes in a sequence of target language tokens and predicts the next token for each time step (the teacher forcing technique). Both the encoder and the decoder use attention mechanisms to improve performance. For...

The rest of the chapter is locked

You're reading from Natural Language Processing with TensorFlow The definitive NLP book to implement the most sought-after machine learning models and tasks

Table of Contents (15) Chapters

Transformer architecture

Unlock this book and the full library FREE for 7 days

Authors (1)