You're reading from Natural Language Processing with TensorFlow The definitive NLP book to implement the most sought-after machine learning models and tasks

Product type Paperback

Published in Jul 2022

Publisher Packt

ISBN-13 9781838641351

Length 514 pages

Edition 2nd Edition

Languages

Processing

Tools

Processing

Concepts

Machine Learning

Author (1):

Thushan Ganegedara

View More author details

Table of Contents (15) Chapters

Preface

1. Introduction to Natural Language Processing FREE CHAPTER

2. Understanding TensorFlow 2

3. Word2vec – Learning Word Embeddings

4. Advanced Word Vector Algorithms

5. Sentence Classification with Convolutional Neural Networks

6. Recurrent Neural Networks

7. Understanding Long Short-Term Memory Networks

8. Applications of LSTM – Generating Text

9. Sequence-to-Sequence Learning – Neural Machine Translation

10. Transformers

11. Image Captioning with Transformers

12. Other Books You May Enjoy

13. Index

Appendix A: Mathematical Foundations and Advanced TensorFlow

Getting to know the data

Let’s first understand the data we are working with both directly and indirectly. There are two datasets we will rely on:

The ILSVRC ImageNet dataset (http://image-net.org/download)
The MS-COCO dataset (http://cocodataset.org/#download)

We will not engage the first dataset directly, but it is essential for caption learning. This dataset contains images and their respective class labels (for example, cat, dog, and car). We will use a CNN that is already trained on this dataset, so we do not have to download and train on this dataset from scratch. Next we will use the MS-COCO dataset, which contains images and their respective captions. We will directly learn from this dataset by mapping the image to a fixed-size feature vector, using the Vision Transformer, and then map this vector to the corresponding caption using a text-based Transformer (we will discuss this process in detail later).

ILSVRC ImageNet dataset

ImageNet...

The rest of the chapter is locked

You're reading from Natural Language Processing with TensorFlow The definitive NLP book to implement the most sought-after machine learning models and tasks

Table of Contents (15) Chapters

Getting to know the data

ILSVRC ImageNet dataset

Unlock this book and the full library FREE for 7 days

Authors (1)