You're reading from Natural Language Processing with TensorFlow The definitive NLP book to implement the most sought-after machine learning models and tasks

Product type Paperback

Published in Jul 2022

Publisher Packt

ISBN-13 9781838641351

Length 514 pages

Edition 2nd Edition

Languages

Processing

Tools

Processing

Concepts

Machine Learning

Author (1):

Thushan Ganegedara

View More author details

Table of Contents (15) Chapters

Preface

1. Introduction to Natural Language Processing FREE CHAPTER

2. Understanding TensorFlow 2

3. Word2vec – Learning Word Embeddings

4. Advanced Word Vector Algorithms

5. Sentence Classification with Convolutional Neural Networks

6. Recurrent Neural Networks

7. Understanding Long Short-Term Memory Networks

8. Applications of LSTM – Generating Text

9. Sequence-to-Sequence Learning – Neural Machine Translation

10. Transformers

11. Image Captioning with Transformers

12. Other Books You May Enjoy

13. Index

Appendix A: Mathematical Foundations and Advanced TensorFlow

How LSTMs solve the vanishing gradient problem

As we discussed earlier, even though RNNs are theoretically sound, in practice they suffer from a serious drawback. That is, when Backpropagation Through Time (BPTT) is used, the gradient diminishes quickly, which allows us to propagate the information of only a few time steps. Consequently, we can only store the information of very few time steps, thus possessing only short-term memory. This in turn limits the usefulness of RNNs in real-world sequential tasks.

Often, useful and interesting sequential tasks (such as stock market predictions or language modeling) require the ability to learn and store long-term dependencies. Think of the following example for predicting the next word:

John is a talented student. He is an A-grade student and plays rugby and cricket. All the other students envy ______.

For us, this is a very easy task. The answer would be John. However, for an RNN, this is a difficult task. We are trying to predict...

The rest of the chapter is locked

You're reading from Natural Language Processing with TensorFlow The definitive NLP book to implement the most sought-after machine learning models and tasks

Table of Contents (15) Chapters

How LSTMs solve the vanishing gradient problem

Unlock this book and the full library FREE for 7 days

Authors (1)