Natural Language Processing with Python Quick Start Guide: Going from a Python developer to an effective Natural Language Processing Engineer

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!

Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!

50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

Thousands of reference materials covering every tech concept you need to stay up to date.

Subscribe now

View plans & pricing

Tidying your Text

Data cleaning is one of the most important and time-consuming tasks when it comes to natural language processing (NLP):

"There's the joke that 80 percent of data science is cleaning the data and 20 percent is complaining about cleaning the data."

– Kaggle founder and CEO Anthony Goldbloom in a Verge Interview

In this chapter, we will discuss some of the most common text pre-processing ideas. This task is universal, tedious, and unavoidable. Most people working in data science or NLP understand that it's an underrated value addition. Some of these tasks don't work well in isolation but have a powerful effect when used in the right combination and order. This chapter will introduce several new words and tools, since the field has a rich history from two worlds. It borrows from both traditional NLP and machine learning. We&apos...

Key benefits

A no-math, code-driven programmer’s guide to text processing and NLP

Get state of the art results with modern tooling across linguistics, text vectors and machine learning

Fundamentals of NLP methods from spaCy, gensim, scikit-learn and PyTorch

Description

NLP in Python is among the most sought after skills among data scientists. With code and relevant case studies, this book will show how you can use industry-grade tools to implement NLP programs capable of learning from relevant data. We will explore many modern methods ranging from spaCy to word vectors that have reinvented NLP. The book takes you from the basics of NLP to building text processing applications. We start with an introduction to the basic vocabulary along with a work?ow for building NLP applications. We use industry-grade NLP tools for cleaning and pre-processing text, automatic question and answer generation using linguistics, text embedding, text classifier, and building a chatbot. With each project, you will learn a new concept of NLP. You will learn about entity recognition, part of speech tagging and dependency parsing for Q and A. We use text embedding for both clustering documents and making chatbots, and then build classifiers using scikit-learn. We conclude by deploying these models as REST APIs with Flask. By the end, you will be confident building NLP applications, and know exactly what to look for when approaching new challenges.

What you will learn

Understand classical linguistics in using English grammar for automatically generating questions and answers from a free text corpus

Work with text embedding models for dense number representations of words, subwords and characters in the English language for exploring document clustering

Deep Learning in NLP using PyTorch with a code-driven introduction to PyTorch

Using an NLP project management Framework for estimating timelines and organizing your project into stages

Hack and build a simple chatbot application in 30 minutes

Deploy an NLP or machine learning application using Flask as RESTFUL APIs

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!

Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!

50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

Thousands of reference materials covering every tech concept you need to stay up to date.

Subscribe now

View plans & pricing

Frequently bought together

Natural Language Processing with Python Quick Start Guide

€24.99

Artificial Intelligence and Machine Learning Fundamentals

€24.99

Hands-On Natural Language Processing with Python

€32.99

Total € 82.97

Feature	Spacy	NLTK	CoreNLP
Native Python support/API	Y	Y	Y
Multi-language support	Y	Y	Y
Tokenization	Y	Y	Y
Part-of-speech tagging	Y	Y	Y
Sentence segmentation	Y	Y	Y
Dependency parsing	Y	N	Y
Entity recognition	Y	Y	Y
Integrated word vectors	Y	N	N
Sentiment analysis	Y	Y	Y
Coreference resolution	N	N	Y

Natural Language Processing with Python Quick Start Guide: Going from a Python developer to an effective Natural Language Processing Engineer

What do you get with a Packt Subscription?

Natural Language Processing with Python Quick Start Guide

Tidying your Text

Bread and butter – most common tasks

Tokenization

Intuitive – split by...

Stemming and lemmatization

spaCy compared with NLTK and CoreNLP

Correcting spelling

Cleaning a corpus with FlashText

Summary

Page 1 of 8

Key benefits

Description

Who is this book for?

What you will learn

Product Details

What do you get with a Packt Subscription?

Product Details

Frequently bought together

Table of Contents

Recommendations for you

People who bought this also bought

About the author

FAQs

Natural Language Processing with Python Quick Start Guide: Going from a Python developer to an effective Natural Language Processing Engineer

What do you get with a Packt Subscription?

Key benefits

Description

Who is this book for?

What you will learn

Product Details

What do you get with a Packt Subscription?

Product Details

Packt Subscriptions

Frequently bought together

Table of Contents

Recommendations for you

People who bought this also bought

About the author

FAQs