Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Ensemble Learning with Python Build highly optimized ensemble machine learning models using scikit-learn and Keras

Product type Paperback

Published in Jul 2019

Publisher Packt

ISBN-13 9781789612851

Length 298 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Machine Learning

Authors (2):

Konstantinos G. Margaritis

George Kyriakides

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Introduction and Required Software Tools

2. A Machine Learning Refresher FREE CHAPTER

3. Getting Started with Ensemble Learning

4. Section 2: Non-Generative Methods

5. Voting

6. Stacking

7. Section 3: Generative Methods

8. Bagging

9. Boosting

10. Random Forests

11. Section 4: Clustering

12. Clustering

13. Section 5: Real World Applications

14. Classifying Fraudulent Transactions

15. Predicting Bitcoin Prices

16. Evaluating Sentiment on Twitter

17. Recommending Movies with Keras

18. Clustering World Happiness

19. Another Book You May Enjoy

Leave a review - let other readers know what you think

Bagging

In this section, we will classify the dataset using bagging. As we have previously shown, decision trees with maximum depth of five are optimal thus, we will use these trees for our bagging example.

We would like to optimize the ensemble's size. We will generate validation curves for the original train set by testing sizes in the range of [5, 30]. The actual curves are depicted here in the following graph:

Validation curves for the original train set, for various ensemble sizes

We observe that variance is minimized for an ensemble size of 10, thus we will utilize ensembles of size 10.

The following code loads the data and libraries (Section 1), splits the data into train and test sets, and fits and evaluates the ensemble on the original dataset (Section 2) and the reduced-features dataset (Section 3):

# --- SECTION 1 ---
# Libraries and data loading
import numpy as...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Kyriakides

George Kyriakides is a Ph.D. researcher, studying distributed neural architecture search. His interests and experience include automated generation and optimization of predictive models for a wide array of applications, such as image recognition, time series analysis, and financial applications. He holds an M.Sc. in computational methods and applications, and a B.Sc. in applied informatics, both from the University of Macedonia, Thessaloniki, Greece.

See other products by Kyriakides

Margaritis

Konstantinos G. Margaritis has been a teacher and researcher in computer science for more than 30 years. His research interests include parallel and distributed computing as well as computational intelligence and machine learning. He holds an M.Eng. in electrical engineering (Aristotle University of Thessaloniki, Greece), as well as an M.Sc. and a Ph.D. in computer science (Loughborough University, UK). He is a professor at the Department of Applied Informatics, University of Macedonia, Thessaloniki, Greece.

See other products by Margaritis