Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Python Machine Learning Cookbook Over 100 recipes to progress from smart data analytics to deep learning using real-world datasets

Product type Paperback

Published in Mar 2019

Publisher Packt

ISBN-13 9781789808452

Length 642 pages

Edition 2nd Edition

Languages

Python

Tools

Pandas

Concepts

Deep Learning

Authors (2):

Giuseppe Ciaburro

Prateek Joshi

View More author details

Table of Contents (18) Chapters

Preface

1. The Realm of Supervised Learning FREE CHAPTER

2. Constructing a Classifier

3. Predictive Modeling

4. Clustering with Unsupervised Learning

5. Visualizing Data

6. Building Recommendation Engines

7. Analyzing Text Data

8. Speech Recognition

9. Dissecting Time Series and Sequential Data

10. Analyzing Image Content

11. Biometric Face Recognition

12. Reinforcement Learning Techniques

13. Deep Neural Networks

14. Unsupervised Representation Learning

15. Automated Machine Learning and Transfer Learning

16. Unlocking Production Issues

17. Other Books You May Enjoy

Leave a review - let other readers know what you think

Dividing text using chunking

Chunking refers to dividing the input text into pieces, which are based on any random condition. This is different from tokenization in the sense that there are no constraints, and the chunks do not need to be meaningful at all. This is used very frequently during text analysis. While dealing with large text documents, it's better to do it in chunks.

How to do it...

Let's look at how to divide text by using chunking:

Create a new Python file and import the following packages (the full code is in the chunking.py file that's already been provided to you):

import numpy as np 
nltk.download('brown')
from nltk.corpus import brown

Let's define a function to split the...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Giuseppe Ciaburro

Giuseppe Ciaburro holds a PhD and two master's degrees. He works at the Built Environment Control Laboratory - Università degli Studi della Campania "Luigi Vanvitelli". He has over 25 years of work experience in programming, first in the field of combustion and then in acoustics and noise control. His core programming knowledge is in MATLAB, Python and R. As an expert in AI applications to acoustics and noise control problems, Giuseppe has wide experience in researching and teaching. He has several publications to his credit: monographs, scientific journals, and thematic conferences. He was recently included in the world's top 2% scientists list by Stanford University (2022).

See other products by Giuseppe Ciaburro

Joshi

Vijay Joshi is a full stack web developer having more than a decade of experience in working with PHP and JavaScript.

See other products by Joshi