Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Machine Learning Algorithms Popular algorithms for data science and machine learning

Product type Paperback

Published in Aug 2018

Publisher Packt

ISBN-13 9781789347999

Length 522 pages

Edition 2nd Edition

Languages

Python

Tools

Scikit-learn

Concepts

Data Science

Author (1):

Giuseppe Bonaccorso

View More author details

Table of Contents (19) Chapters

Preface

1. A Gentle Introduction to Machine Learning FREE CHAPTER

2. Important Elements in Machine Learning

3. Feature Selection and Feature Engineering

4. Regression Algorithms

5. Linear Classification Algorithms

6. Naive Bayes and Discriminant Analysis

7. Support Vector Machines

8. Decision Trees and Ensemble Learning

9. Clustering Fundamentals

10. Advanced Clustering

11. Hierarchical Clustering

12. Introducing Recommendation Systems

13. Introducing Natural Language Processing

14. Topic Modeling and Sentiment Analysis in NLP

15. Introducing Neural Networks

16. Advanced Deep Learning Models

17. Creating a Machine Learning Architecture

18. Other Books You May Enjoy

Leave a review - let other readers know what you think

Managing categorical data

In many classification problems, the target dataset is made up of categorical labels that cannot immediately be processed by every algorithm. An encoding is needed, and scikit-learn offers at least two valid options. Let's consider a very small dataset made of 10 categorical samples with 2 features each:

import numpy as np

X = np.random.uniform(0.0, 1.0, size=(10, 2))
Y = np.random.choice(('Male', 'Female'), size=(10))

print(X[0])
array([ 0.8236887 ,  0.11975305])

print(Y[0])
'Male'

The first option is to use the LabelEncoder class, which adopts a dictionary-oriented approach, associating to each category label a progressive integer number, that is, an index of an instance array called classes_:

from sklearn.preprocessing import LabelEncoder

le = LabelEncoder()
yt = le.fit_transform(Y)

print(yt)
[0 0 0 1 0 1 1 0 0 1]

le.classes_array...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Other recommended products

Related to this chapter

Hands-On Unsupervised Learning with Python

Unsupervised learning is a key required block in both machine learning and deep learning domains. You will explore how to make your models learn, grow, change, and develop by themselves whenever they are exposed to a new set of data. With this book, you will learn the art of unsupervised learning for different real-world challenges.

Feb 2019 12h 52m

Hands-On Unsupervised Learning with Python

Feb 2019 12h 52m

Hands-On Unsupervised Learning with Python

Feb 2019 12h 52m

Hands-On Unsupervised Learning with Python

Feb 2019 12h 52m

Mastering Machine Learning Algorithms

This book is your guide to quickly get to grips with the most widely used machine learning algorithms. As a data science professional, this book will help you design and train better machine learning models to solve a variety of complex problems, and make the machine learn your requirements.

May 2018 19h 12m

Mastering Machine Learning Algorithms

May 2018 19h 12m

Mastering Machine Learning Algorithms

May 2018 19h 12m

Mastering Machine Learning Algorithms

May 2018 19h 12m

Mastering Machine Learning Algorithms

May 2018 19h 12m

Mastering Machine Learning Algorithms

A new second edition of the bestselling guide to exploring and mastering the most important algorithms for solving complex machine learning problems, updated to include Python 3.8 and TensorFlow 2.x as well as the latest in new algorithms and techniques.

Jan 2020 26h 36m

Mastering Machine Learning Algorithms

Jan 2020 26h 36m

Mastering Machine Learning Algorithms

Jan 2020 26h 36m

Personalised recommendations for you

Based on your interests and search pattern

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Jun 2024 24h 52m