You're reading from Python Feature Engineering Cookbook Over 70 recipes for creating, engineering, and transforming features to build machine learning models

Product type Paperback

Published in Oct 2022

Publisher Packt

ISBN-13 9781804611302

Length 386 pages

Edition 2nd Edition

Languages

Python

Tools

Combine

Concepts

Machine Learning

Author (1):

Soledad Galli

View More author details

Table of Contents (14) Chapters

Preface

1. Chapter 1: Imputing Missing Data

2. Chapter 2: Encoding Categorical Variables FREE CHAPTER

3. Chapter 3: Transforming Numerical Variables

4. Chapter 4: Performing Variable Discretization

5. Chapter 5: Working with Outliers

6. Chapter 6: Extracting Features from Date and Time Variables

7. Chapter 7: Performing Feature Scaling

8. Chapter 8: Creating New Features

9. Chapter 9: Extracting Features from Relational Data with Featuretools

10. Chapter 10: Creating Features from a Time Series with tsfresh

11. Chapter 11: Extracting Features from Text Variables

12. Index

Why subscribe?

13. Other Books You May Enjoy

Extracting features from text

In Chapter 11, Extracting Features from Text Variables, we will discuss various features that we can extract from pieces of text utilizing pandas and scikit-learn. We can also extract multiple features from text automatically by utilizing Featuretools.

Featuretools supports the creation of two basic features from text as part of its default functionality, which are the number of characters and the number of words in a piece of text. In addition, there is an accompanying Python library, Natural Language Processing (NLP) primitives, which contains a lot more functionality to create more advanced features for use with text. Among these functions, we find primitives for counting the number of stop words and punctuation and calculating the amount of whitespace and uppercase, as well as more complex functions for deriving the diversity score or the polarity score.

Note

For a complete list of the functions available, visit https://featuretools.alteryx...

The rest of the chapter is locked

You're reading from Python Feature Engineering Cookbook Over 70 recipes for creating, engineering, and transforming features to build machine learning models

Table of Contents (14) Chapters

Extracting features from text

Authors (1)

Personalised recommendations for you

You're reading from Python Feature Engineering Cookbook Over 70 recipes for creating, engineering, and transforming features to build machine learning models

Table of Contents (14) Chapters

Extracting features from text

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you