Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Natural Language Processing with Python Quick Start Guide Going from a Python developer to an effective Natural Language Processing Engineer

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781789130386

Length 182 pages

Edition 1st Edition

Languages

Processing

Tools

Processing

Concepts

Mobile Application Development

Author (1):

Nirant Kasliwal

View More author details

Table of Contents (10) Chapters

Preface

1. Getting Started with Text Classification FREE CHAPTER

2. Tidying your Text

3. Leveraging Linguistics

4. Text Representations - Words to Numbers

5. Modern Methods for Classification

6. Deep Learning for NLP

7. Building your Own Chatbot

8. Web Deployments

9. Other Books You May Enjoy

Leave a review - let other readers know what you think

Bread and butter – most common tasks

There are several well-known text cleaning ideas. They have all made their way into the most popular tools today such as NLTK, Stanford CoreNLP, and spaCy. I like spaCy for two main reasons:

It's an industry-grade NLP, unlike NLTK, which is mainly meant for teaching.
It has good speed-to-performance trade-off. spaCy is written in Cython, which gives it C-like performance with Python code.

spaCy is actively maintained and developed, and incorporates the best methods available for most challenges.

By the end of this section, you will be able to do the following:

Understand tokenization and do it manually yourself using spaCy
Understand why stop word removal and case standardization works, with spaCy examples
Differentiate between stemming and lemmatization, with spaCy lemmatization examples

...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Kasliwal

Nirant Kasliwal maintains an awesome list of NLP natural language processing resources. GitHub's machine learning collection features this as the go-to guide. Nobel Laureate Dr. Paul Romer found his programming notes on Jupyter Notebooks helpful. Nirant won the first ever NLP Google Kaggle Kernel Award. At Soroco, image segmentation and intent categorization are the challenges he works with. His state-of-the-art language modeling results are available as Hindi2vec.

See other products by Kasliwal