Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Ensemble Learning with Python Build highly optimized ensemble machine learning models using scikit-learn and Keras

Product type Paperback

Published in Jul 2019

Publisher Packt

ISBN-13 9781789612851

Length 298 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Machine Learning

Authors (2):

Konstantinos G. Margaritis

George Kyriakides

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Introduction and Required Software Tools

2. A Machine Learning Refresher FREE CHAPTER

3. Getting Started with Ensemble Learning

4. Section 2: Non-Generative Methods

5. Voting

6. Stacking

7. Section 3: Generative Methods

8. Bagging

9. Boosting

10. Random Forests

11. Section 4: Clustering

12. Clustering

13. Section 5: Real World Applications

14. Classifying Fraudulent Transactions

15. Predicting Bitcoin Prices

16. Evaluating Sentiment on Twitter

17. Recommending Movies with Keras

18. Clustering World Happiness

19. Another Book You May Enjoy

Leave a review - let other readers know what you think

Using random forests

Finally, we will employ a random forest ensemble. Once again, using validation curves, we will determine the optimal ensemble size. From the following graph, we conclude that 50 trees provide the least possible variance in our model, thus we proceed with ensemble size 50:

Validation curves for random forest

We provide the training and validation code as follows, as well as the achieved performance for both datasets. The following code is responsible for loading the required libraries and data, and training and evaluating the ensemble on the original and filtered datasets. We first load the required libraries and data, while creating train and test splits:

# --- SECTION 1 ---
# Libraries and data loading
import numpy as np
import pandas as pd

from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split
from sklearn.utils...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Kyriakides

George Kyriakides is a Ph.D. researcher, studying distributed neural architecture search. His interests and experience include automated generation and optimization of predictive models for a wide array of applications, such as image recognition, time series analysis, and financial applications. He holds an M.Sc. in computational methods and applications, and a B.Sc. in applied informatics, both from the University of Macedonia, Thessaloniki, Greece.

See other products by Kyriakides

Margaritis

Konstantinos G. Margaritis has been a teacher and researcher in computer science for more than 30 years. His research interests include parallel and distributed computing as well as computational intelligence and machine learning. He holds an M.Eng. in electrical engineering (Aristotle University of Thessaloniki, Greece), as well as an M.Sc. and a Ph.D. in computer science (Loughborough University, UK). He is a professor at the Department of Applied Informatics, University of Macedonia, Thessaloniki, Greece.

See other products by Margaritis