You're reading from Mastering Machine Learning Algorithms Expert techniques for implementing popular machine learning algorithms, fine-tuning your models, and understanding how they work

Product type Paperback

Published in Jan 2020

Publisher Packt

ISBN-13 9781838820299

Length 798 pages

Edition 2nd Edition

Languages

Python

Tools

Keras

Concepts

Machine Learning

Authors (2):

Giuseppe Bonaccorso

View More author details

Table of Contents (28) Chapters

Preface

1. Machine Learning Model Fundamentals

2. Loss Functions and Regularization FREE CHAPTER

3. Introduction to Semi-Supervised Learning

4. Advanced Semi-Supervised Classification

5. Graph-Based Semi-Supervised Learning

6. Clustering and Unsupervised Models

7. Advanced Clustering and Unsupervised Models

8. Clustering and Unsupervised Models for Marketing

9. Generalized Linear Models and Regression

10. Introduction to Time-Series Analysis

11. Bayesian Networks and Hidden Markov Models

12. The EM Algorithm

13. Component Analysis and Dimensionality Reduction

14. Hebbian Learning

15. Fundamentals of Ensemble Learning

16. Advanced Boosting Algorithms

17. Modeling Neural Networks

18. Optimizing Neural Networks

19. Deep Convolutional Networks

20. Recurrent Neural Networks

21. Autoencoders

22. Introduction to Generative Adversarial Networks

23. Deep Belief Networks

24. Introduction to Reinforcement Learning

25. Advanced Policy Estimation Algorithms

26. Other Books You May Enjoy

27. Index

Example of label propagation

We can implement the algorithm in Python, using a test bidimensional dataset:

from sklearn.datasets import make_classification
nb_samples = 100
nb_unlabeled = 75
X, Y = make_classification(n_samples=nb_samples, n_features=2, n_informative=2, n_redundant=0, random_state=1000)
Y[Y==0] = -1
Y[nb_samples - nb_unlabeled:nb_samples] = 0

As in the other examples, we set y = 0 for all unlabeled samples (75 out of 100). The corresponding plot is shown in the following graph:

Partially labeled dataset

The dots marked with a cross are unlabeled. At this point, we can define the affinity matrix. In this case, we compute it using both methods:

from sklearn.neighbors import kneighbors_graph
nb_neighbors = 2
W_knn_sparse = kneighbors_graph(X, n_neighbors=nb_neighbors, mode='connectivity', include_self=True)
W_knn = W_knn_sparse.toarray()

The KNN matrix is obtained using the scikit-learn function kneighbors_graph...