All Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletters

Free Learning

You're reading from Ensemble Machine Learning Cookbook

Product type Book

Published in Jan 2019

Publisher Packt

ISBN-13 9781789136609

Pages 336 pages

Edition 1st Edition

Languages

Python

Concepts

Machine Learning

Authors (2):

Dipayan Sarkar

Vijayalakshmi Natarajan

View More author details

Table of Contents (14) Chapters

Preface

1. Get Closer to Your Data

2. Getting Started with Ensemble Machine Learning

3. Resampling Methods

4. Statistical and Machine Learning Algorithms

5. Bag the Models with Bagging

6. When in Doubt, Use Random Forests

7. Boosting Model Performance with Boosting

8. Blend It with Stacking

9. Homogeneous Ensembles Using Keras

10. Heterogeneous Ensemble Classifiers Using H2O

11. Heterogeneous Ensemble for Text Classification Using NLP

12. Homogenous Ensemble for Multiclass Classification Using Keras

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

k-fold and leave-one-out cross-validation

Machine learning models often face the problem of generalization when they're applied to unseen data to make predictions. To avoid this problem, the model isn't trained using the complete dataset. Instead, the dataset is split into training and testing subsets. The model is trained on the training data and evaluated on the testing set, which it doesn't see during the training process. This is the fundamental idea behind cross-validation.

The simplest kind of cross-validation is the holdout method, which we saw in the previous recipe, Introduction to sampling. In the holdout method, when we split our data into training and testing subsets, there's a possibility that the testing set isn't that similar to the training set because of the high dimensionality of the data. This can lead to instability in the outcome...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime}

Authors (2)

Dipayan Sarkar

Dipayan Sarkar holds a Masters in Economics and comes with 17+ years of experience. Dipayan has won international challenges in predictive modeling and takes a keen interest in the mathematics behind machine learning techniques. Before opting to become an independent consultant and a mentor in the data science and machine learning space with various organizations and educational institutions, he had served in the capacity of a senior data scientist with Fortune 500 companies in the US and Europe. He is currently associated with Great Lakes Institute of Management as a visiting faculty (Analytics) and BML Munjal University as an adjunct faculty (Analytics and Machine Learning). He has co-authored a book on "Ensemble Machine Learning with Python" with PACKT Publishing.

See other products by Dipayan Sarkar

Vijayalakshmi Natarajan

Vijayalakshmi Natarajan holds an ME in Computer Science, comes with 4 years of industry experience. She is a data science enthusiast and is a passionate trainer in the field of data science & data visualization. She takes keen interests in deep diving into Machine Learning techniques. Her specialization includes machine learning techniques in the field of image processing.

See other products by Vijayalakshmi Natarajan