You're reading from Advanced Natural Language Processing with TensorFlow 2 Build effective real-world NLP applications using NER, RNNs, seq2seq models, Transformers, and more

Product type Paperback

Published in Feb 2021

Publisher Packt

ISBN-13 9781800200937

Length 380 pages

Edition 1st Edition

Languages

Processing

Tools

Processing

Concepts

Mobile Application Development

Authors (2):

Tony Mullen

Ashish Bansal

View More author details

Table of Contents (13) Chapters

Preface

1. Essentials of NLP

2. Understanding Sentiment in Natural Language with BiLSTMs FREE CHAPTER

3. Named Entity Recognition (NER) with BiLSTMs, CRFs, and Viterbi Decoding

4. Transfer Learning with BERT

5. Generating Text with RNNs and GPT-2

6. Text Summarization with Seq2seq Attention and Transformer Networks

7. Multi-Modal Networks and Image Captioning with ResNets and Transformer Networks

8. Weakly Supervised Learning for Classification with Snorkel

9. Building Conversational AI Applications with Deep Learning

10. Installation and Setup Instructions for Code

11. Other Books You May Enjoy

12. Index

Essentials of NLP

Language has been a part of human evolution. The development of language allowed better communication between people and tribes. The evolution of written language, initially as cave paintings and later as characters, allowed information to be distilled, stored, and passed on from generation to generation. Some would even say that the hockey stick curve of advancement is because of the ever-accumulating cache of stored information. As this stored information trove becomes larger and larger, the need for computational methods to process and distill the data becomes more acute. In the past decade, a lot of advances were made in the areas of image and speech recognition. Advances in Natural Language Processing (NLP) are more recent, though computational methods for NLP have been an area of research for decades. Processing textual data requires many different building blocks upon which advanced models can be built. Some of these building blocks themselves can be quite challenging and advanced. This chapter and the next focus on these building blocks and the problems that can be solved with them through simple models.

In this chapter, we will focus on the basics of pre-processing text and build a simple spam detector. Specifically, we will learn about the following:

The typical text processing workflow
Data collection and labeling
Text normalization, including case normalization, text tokenization, stemming, and lemmatization
- Modeling datasets that have been text normalized
- Vectorizing text
- Modeling datasets with vectorized text

Let's start by getting to grips with the text processing workflow most NLP models use.

The rest of the chapter is locked

You're reading from Advanced Natural Language Processing with TensorFlow 2 Build effective real-world NLP applications using NER, RNNs, seq2seq models, Transformers, and more

Table of Contents (13) Chapters

Essentials of NLP

Unlock this book and the full library FREE for 7 days

Authors (2)

Other recommended products