Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Ensemble Learning with Python Build highly optimized ensemble machine learning models using scikit-learn and Keras

Product type Paperback

Published in Jul 2019

Publisher Packt

ISBN-13 9781789612851

Length 298 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Machine Learning

Authors (2):

Konstantinos G. Margaritis

George Kyriakides

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Introduction and Required Software Tools

2. A Machine Learning Refresher FREE CHAPTER

3. Getting Started with Ensemble Learning

4. Section 2: Non-Generative Methods

5. Voting

6. Stacking

7. Section 3: Generative Methods

8. Bagging

9. Boosting

10. Random Forests

11. Section 4: Clustering

12. Clustering

13. Section 5: Real World Applications

14. Classifying Fraudulent Transactions

15. Predicting Bitcoin Prices

16. Evaluating Sentiment on Twitter

17. Recommending Movies with Keras

18. Clustering World Happiness

19. Another Book You May Enjoy

Leave a review - let other readers know what you think

Boosting

As we move on, we will start to utilize generative methods. The first generative method we will experiment with is boosting. We will first try to classify the datasets using AdaBoost. As AdaBoost resamples the dataset based on misclassifications, we expect that it will be able to handle our imbalanced dataset relatively well.

First, we must decide on the ensemble's size. We generate validation curves for a number of ensemble sizes depicted as follows:

Validation curves of various ensemble sizes for AdaBoost

As we can observe, 70 base learners provide the best trade-off between bias and variance. As such, we will proceed with ensembles of size 70.

The following code implements the training and evaluation for AdaBoost:

# --- SECTION 1 ---
# Libraries and data loading
import numpy as np
import pandas as pd
from sklearn.ensemble import AdaBoostClassifier
from sklearn.model_selection...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Kyriakides

George Kyriakides is a Ph.D. researcher, studying distributed neural architecture search. His interests and experience include automated generation and optimization of predictive models for a wide array of applications, such as image recognition, time series analysis, and financial applications. He holds an M.Sc. in computational methods and applications, and a B.Sc. in applied informatics, both from the University of Macedonia, Thessaloniki, Greece.

See other products by Kyriakides

Margaritis

Konstantinos G. Margaritis has been a teacher and researcher in computer science for more than 30 years. His research interests include parallel and distributed computing as well as computational intelligence and machine learning. He holds an M.Eng. in electrical engineering (Aristotle University of Thessaloniki, Greece), as well as an M.Sc. and a Ph.D. in computer science (Loughborough University, UK). He is a professor at the Department of Applied Informatics, University of Macedonia, Thessaloniki, Greece.

See other products by Margaritis