Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Ensemble Machine Learning Cookbook Over 35 practical recipes to explore ensemble machine learning techniques using Python

Product type Paperback

Published in Jan 2019

Publisher Packt

ISBN-13 9781789136609

Length 336 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Machine Learning

Authors (2):

Vijayalakshmi Natarajan

Dipayan Sarkar

View More author details

Table of Contents (14) Chapters

Preface

1. Get Closer to Your Data

2. Getting Started with Ensemble Machine Learning FREE CHAPTER

3. Resampling Methods

4. Statistical and Machine Learning Algorithms

5. Bag the Models with Bagging

6. When in Doubt, Use Random Forests

7. Boosting Model Performance with Boosting

8. Blend It with Stacking

9. Homogeneous Ensembles Using Keras

10. Heterogeneous Ensemble Classifiers Using H2O

11. Heterogeneous Ensemble for Text Classification Using NLP

12. Homogenous Ensemble for Multiclass Classification Using Keras

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

Implementing a random forest for predicting credit card defaults using scikit-learn

The scikit-learn library implements random forests by providing two estimators: RandomForestClassifier and RandomForestRegressor. They take various parameters, some of which are explained as follows:

n_estimators: This parameter is the number of trees the algorithm builds before taking a maximum vote or the average prediction. In general, the higher the number of trees the better the performance and the accuracy of the predictions, but it also costs more in terms of computation.
max_features: This parameter is the maximum number of features that the random forest is allowed to try in an individual tree.
min_sample_leaf: This parameter determines the minimum number of leaves that are required to split an internal node.
n_jobs: This hyperparameter tells the engine how many jobs to run in parallel...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Sarkar

Aurobindo Sarkar leads a team of data scientists and engineers at Session AI, developing cloud-based ML models for in-session marketing in e-commerce and retail. As a former CTO at multiple SaaS startups, he has architected secure, scalable, and highly available AWS cloud applications. His research interests now focus on AWS-based large-scale transformer models for NLP and HFT models for the futures and options market. Aurobindo holds a bachelor's degree in engineering from IIT Delhi, a master's in management from the Indian Institute of Science Bangalore, and a master's in computer science from New York University.

See other products by Sarkar

Natarajan

Vijayalakshmi Natarajan holds an ME in Computer Science, comes with 4 years of industry experience. She is a data science enthusiast and is a passionate trainer in the field of data science & data visualization. She takes keen interests in deep diving into Machine Learning techniques. Her specialization includes machine learning techniques in the field of image processing.

See other products by Natarajan