Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Machine Learning with scikit-learn Quick Start Guide Classification, regression, and clustering techniques in Python

Product type Paperback

Published in Oct 2018

Publisher Packt

ISBN-13 9781789343700

Length 172 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Machine Learning

Author (1):

Kevin Jolly

View More author details

Table of Contents (10) Chapters

Preface

1. Introducing Machine Learning with scikit-learn

2. Predicting Categories with K-Nearest Neighbors FREE CHAPTER

3. Predicting Categories with Logistic Regression

4. Predicting Categories with Naive Bayes and SVMs

5. Predicting Numeric Outcomes with Linear Regression

6. Classification and Regression with Trees

7. Clustering Data with Unsupervised Machine Learning

8. Performance Evaluation Methods

9. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

This chapter was fundamental in helping you prepare a dataset for machine learning with scikit-learn. You have learned about the constraints that are imposed when you do machine learning with scikit-learn and how to create a dataset that is perfect for scikit-learn.

You have also learned how the k-NN algorithm works behind the scenes and have implemented a version of it using scikit-learn to predict whether a transaction was fraudulent. You then learned how to optimize the parameters of the algorithm using the popular GridSearchCV algorithm. Finally, you have learnt how to standardize and scale your data in order to optimize the performance of your model.

In the next chapter, you will learn how to classify fraudulent transactions yet again with a new algorithm – the logistic regression algorithm!

You have been reading a chapter from

Machine Learning with scikit-learn Quick Start Guide

Published in: Oct 2018

Publisher: Packt

ISBN-13: 9781789343700

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Jolly

Kevin Jolly is a formally educated data scientist with a master's degree in data science from the prestigious King's College London. Kevin works as a statistical analyst with a digital healthcare start-up, Connido Limited, in London, where he is primarily involved in leading the data science projects that the company undertakes. He has built machine learning pipelines for small and big data, with a focus on scaling such pipelines into production for the products that the company has built. Kevin is also the author of a book titled Hands-On Data Visualization with Bokeh, published by Packt. He is the editor-in-chief of Linear, a weekly online publication on data science software and products.

See other products by Jolly