You're reading from Python Feature Engineering Cookbook A complete guide to crafting powerful features for your machine learning models

Product type Paperback

Published in Aug 2024

Publisher Packt

ISBN-13 9781835883587

Length 396 pages

Edition 3rd Edition

Languages

Python

Tools

Combine

Concepts

Data Science

Author (1):

Soledad Galli

View More author details

Table of Contents (14) Chapters

Preface

1. Chapter 1: Imputing Missing Data

2. Chapter 2: Encoding Categorical Variables FREE CHAPTER

3. Chapter 3: Transforming Numerical Variables

4. Chapter 4: Performing Variable Discretization

5. Chapter 5: Working with Outliers

6. Chapter 6: Extracting Features from Date and Time Variables

7. Chapter 7: Performing Feature Scaling

8. Chapter 8: Creating New Features

9. Chapter 9: Extracting Features from Relational Data with Featuretools

10. Chapter 10: Creating Features from a Time Series with tsfresh

11. Chapter 11: Extracting Features from Text Variables

12. Index

Why subscribe?

13. Other Books You May Enjoy

Scaling to vector unit length

Scaling to the vector unit length involves scaling individual observations (not features) to have a unit norm. Each sample (that is, each row of the data) is rescaled independently of other samples so that its norm equals one. Each row constitutes a feature vector containing the values of every variable for that row. Hence, with this scaling method, we rescale the feature vector.

The norm of a vector is a measure of its magnitude or length in a given space and it can be determined by using the Manhattan (l1) or the Euclidean (l2) distance. The Manhattan distance is given by the sum of the absolute components of the vector:

The Euclidean distance is given by the square root of the square sum of the component of the vector:

Here, <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mrow><msub><mi>x</mi><mn>1</mn></msub><mo>,</mo><msub><mi>x</mi><mn>2</mn></msub><mo>,</mo></mrow></mrow></math> and are the values of variables 1, 2, and n for each observation. Scaling to unit norm consists of dividing each feature vector’s value by either l1 or l2, so that after the scaling, the norm of the feature...

The rest of the chapter is locked

You're reading from Python Feature Engineering Cookbook A complete guide to crafting powerful features for your machine learning models

Table of Contents (14) Chapters

Scaling to vector unit length

Authors (1)

Personalised recommendations for you

You're reading from Python Feature Engineering Cookbook A complete guide to crafting powerful features for your machine learning models

Table of Contents (14) Chapters

Scaling to vector unit length

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you