Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Hands-On Python Natural Language Processing Explore tools and techniques to analyze and process text with a view to building real-world NLP applications

Product type Paperback

Published in Jun 2020

Publisher Packt

ISBN-13 9781838989590

Length 316 pages

Edition 1st Edition

Languages

Processing

Tools

NumPy

Concepts

Mobile Application Development

Authors (2):

Mayank Rasu

Aman Kedia

View More author details

Table of Contents (16) Chapters

Preface

1. Section 1: Introduction

2. Understanding the Basics of NLP FREE CHAPTER

3. NLP Using Python

4. Section 2: Natural Language Representation and Mathematics

5. Building Your NLP Vocabulary

6. Transforming Text into Data Structures

7. Word Embeddings and Distance Measurements for Text

8. Exploring Sentence-, Document-, and Character-Level Embeddings

9. Section 3: NLP and Learning

10. Identifying Patterns in Text Using Machine Learning

11. From Human Neurons to Artificial Neurons for Understanding Text

12. Applying Convolutions to Text

13. Capturing Temporal Relationships in Text

14. State of the Art in NLP

15. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

In this chapter, we expanded on the ideas introduced in Chapter 4, Transforming Text into Data Structures. Instead of using the syntactical aspects of a document, we focused on capturing the semantics of words in a sentence. Properties such as the co-occurrence of words help in understanding the context of a word, and we tried to leverage this to build vector representations of text using the Word2vec algorithm. We explored the pretrained Word2vec model developed by Google and looked at a few relationships that it can capture. We followed this up by learning about the architecture of a Word2vec model. After that, we trained a few Word2vec models from scratch. Limitations and bias around the Word2Vec model were then discussed, followed by a discussion on some applications of the Word2vec model. Finally, we looked at how the WMD algorithm uses word vectors to capture document distances.

In the next chapter, we will take this idea further to build vectors for documents, sentences...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Kedia

Aman Kedia is a data enthusiast and lifelong learner. He is an avid believer in Artificial Intelligence (AI) and the algorithms supporting it. He has worked on state-of-the-art problems in Natural Language Processing (NLP), encompassing resume matching and digital assistants, among others. He has worked at Oracle and SAP, trying to solve problems leveraging advancements in AI. He has four published research papers in the domain of AI.

See other products by Kedia

Rasu

Mayank Rasu is the author of the book Hands-On Natural Language Processing with Python. He has more than 12 years of global experience as a data scientist and quantitative analyst in the investment banking domain. He has worked at the intersection of finance and technology and has developed and deployed AI-based applications in the finance domain, which include sentiment analyzer, robotics process automation, and deep learning-based document reviewers. Mayank is also an educator and has trained/mentored working professionals on applied AI.

See other products by Rasu