Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Mastering Machine Learning with scikit-learn Apply effective learning algorithms to real-world problems using scikit-learn

Product type Paperback

Published in Jul 2017

Publisher

ISBN-13 9781788299879

Length 254 pages

Edition 2nd Edition

Languages

Python

Tools

Scikit-learn

Concepts

Machine Learning

Author (1):

Gavin Hackeling

View More author details

Table of Contents (15) Chapters

Preface

1. The Fundamentals of Machine Learning

2. Simple Linear Regression FREE CHAPTER

3. Classification and Regression with k-Nearest Neighbors

4. Feature Extraction

5. From Simple Linear Regression to Multiple Linear Regression

6. From Linear Regression to Logistic Regression

7. Naive Bayes

8. Nonlinear Classification and Regression with Decision Trees

9. From Decision Trees to Random Forests and Other Ensemble Methods

10. The Perceptron

11. From the Perceptron to Support Vector Machines

12. From the Perceptron to Artificial Neural Networks

13. K-means

14. Dimensionality Reduction with Principal Component Analysis

Evaluating clusters

We defined machine learning as the design and study of systems that learn from experience to improve their performance of a task as measured by some metric. K-means is an unsupervised learning algorithm; there are no labels or ground truth to compare with the clusters. However, we can still evaluate the performance of the algorithm using intrinsic measures. We have already discussed measuring the distortions of clusters. In this section, we will discuss another performance measure for clustering called silhouette coefficient. The silhouette coefficient is a measure of compactness and separation of clusters. It increases as the quality of clusters increases; it is large for compact clusters that are far from each other and small for large, overlapping clusters. The silhouette coefficient is calculated per instance; for a set of instances, it is calculated as...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Gavin Hackeling

Gavin Hackeling develops machine learning services for large-scale documents and image classification at an advertising network in New York. He received his Master's degree from New York University's Interactive Telecommunications Program, and his Bachelor's degree from the University of North Carolina.

See other products by Gavin Hackeling