Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Ensemble Learning with Python Build highly optimized ensemble machine learning models using scikit-learn and Keras

Product type Paperback

Published in Jul 2019

Publisher Packt

ISBN-13 9781789612851

Length 298 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Machine Learning

Authors (2):

Konstantinos G. Margaritis

George Kyriakides

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Introduction and Required Software Tools

2. A Machine Learning Refresher FREE CHAPTER

3. Getting Started with Ensemble Learning

4. Section 2: Non-Generative Methods

5. Voting

6. Stacking

7. Section 3: Generative Methods

8. Bagging

9. Boosting

10. Random Forests

11. Section 4: Clustering

12. Clustering

13. Section 5: Real World Applications

14. Classifying Fraudulent Transactions

15. Predicting Bitcoin Prices

16. Evaluating Sentiment on Twitter

17. Recommending Movies with Keras

18. Clustering World Happiness

19. Another Book You May Enjoy

Leave a review - let other readers know what you think

Creating the ensemble

In order to create the ensemble, we will utilize the openensembles library that we presented in Chapter 8, Clustering. As our dataset does not contain labels, we cannot use the homogeneity score in order to evaluate our clustering models. Instead, we will use the silhouette score, which evaluates how cohesive each cluster is and how separate different clusters are. First, we must load our dataset, which is provided in the WHR.csv file. The second file that we load, Regions.csv, contains the region that each country belongs to. We will utilize the data from 2017, as 2018 has a lot of missing data (for example, Delivery quality and Democratic quality are completely absent). We will fill any missing data using the median of the dataset. For our experiment, we will utilize the factors we presented earlier. We store them in the columns variable, for ease of reference...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Kyriakides

George Kyriakides is a Ph.D. researcher, studying distributed neural architecture search. His interests and experience include automated generation and optimization of predictive models for a wide array of applications, such as image recognition, time series analysis, and financial applications. He holds an M.Sc. in computational methods and applications, and a B.Sc. in applied informatics, both from the University of Macedonia, Thessaloniki, Greece.

See other products by Kyriakides

Margaritis

Konstantinos G. Margaritis has been a teacher and researcher in computer science for more than 30 years. His research interests include parallel and distributed computing as well as computational intelligence and machine learning. He holds an M.Eng. in electrical engineering (Aristotle University of Thessaloniki, Greece), as well as an M.Sc. and a Ph.D. in computer science (Loughborough University, UK). He is a professor at the Department of Applied Informatics, University of Macedonia, Thessaloniki, Greece.

See other products by Margaritis