Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Ensemble Machine Learning Cookbook Over 35 practical recipes to explore ensemble machine learning techniques using Python

Product type Paperback

Published in Jan 2019

Publisher Packt

ISBN-13 9781789136609

Length 336 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Machine Learning

Authors (2):

Vijayalakshmi Natarajan

Dipayan Sarkar

View More author details

Table of Contents (14) Chapters

Preface

1. Get Closer to Your Data

2. Getting Started with Ensemble Machine Learning FREE CHAPTER

3. Resampling Methods

4. Statistical and Machine Learning Algorithms

5. Bag the Models with Bagging

6. When in Doubt, Use Random Forests

7. Boosting Model Performance with Boosting

8. Blend It with Stacking

9. Homogeneous Ensembles Using Keras

10. Heterogeneous Ensemble Classifiers Using H2O

11. Heterogeneous Ensemble for Text Classification Using NLP

12. Homogenous Ensemble for Multiclass Classification Using Keras

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

Introduction to random forests

A random forest is a supervised machine learning algorithm based on ensemble learning. It is used for both regression and classification problems. The general idea behind random forests is to build multiple decision trees and aggregate them to get an accurate result. A decision tree is a deterministic algorithm, which means if the same data is given to it, the same tree will be produced each time. They have a tendency to overfit, because they build the best tree possible with the given data, but may fail to generalize when unseen data is provided. All the decision trees that make up a random forest are different because we build each tree on a different random subset of our data. A random forest tends to be more accurate than a single decision tree because it minimizes overfitting.

The following diagram demonstrates bootstrap sampling being done...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at £16.99/month. Cancel anytime

Authors (2)

Sarkar

Aurobindo Sarkar leads a team of data scientists and engineers at Session AI, developing cloud-based ML models for in-session marketing in e-commerce and retail. As a former CTO at multiple SaaS startups, he has architected secure, scalable, and highly available AWS cloud applications. His research interests now focus on AWS-based large-scale transformer models for NLP and HFT models for the futures and options market. Aurobindo holds a bachelor's degree in engineering from IIT Delhi, a master's in management from the Indian Institute of Science Bangalore, and a master's in computer science from New York University.

See other products by Sarkar

Natarajan

Vijayalakshmi Natarajan holds an ME in Computer Science, comes with 4 years of industry experience. She is a data science enthusiast and is a passionate trainer in the field of data science & data visualization. She takes keen interests in deep diving into Machine Learning techniques. Her specialization includes machine learning techniques in the field of image processing.

See other products by Natarajan