Packt+ | Advance your knowledge in tech

You're reading from Hands-On Automated Machine Learning A beginner's guide to building automated machine learning systems using AutoML and Python

Product type Paperback

Published in Apr 2018

Publisher Packt

ISBN-13 9781788629898

Length 282 pages

Edition 1st Edition

Languages

Python

Tools

Assemble

Concepts

Machine Learning

Authors (2):

Umit Mert Cakmak

Sibanjan Das

View More author details

There are a lot of ML tutorials on the internet, and usually the sample datasets are clean, formatted, and ready to be used with algorithms because the aim of many tutorials is to show the capability of certain tools, libraries, or Software as a Service (SaaS) offerings.

In reality, datasets come in different types and sizes. A recent industry survey done by Kaggle in 2017, titled The State of Data Science and Machine Learning, with over 16,000 responses, shows that the top-three commonly-used datatypes are relational data, text data, and image data.

Moreover, messy data is at the top of the list of problems that people have to deal with, again based on the Kaggle survey. When a dataset is messy and needs a lot of special treatment to be used by ML models, you spend a considerable amount of time on data cleaning, manipulation, and hand-crafting features to get it in the right shape. This is the most time-consuming part of any data science project.

What about selection of performant ML models, hyperparameter optimization of models in training, validation, and testing phases? These are also crucially-important steps that can be executed in many ways.

When the combination of right components come together to process and model your dataset, that combination represents a ML pipeline, and the number of pipelines to be experimented could grow very quickly.

For building high-performing ML pipelines successfully, you should systematically go through all the available options for every step by considering your limits in terms of time and hardware/software resources.

AutoML systems help you to define robust approaches to automatically constructing ML pipelines for a given problem and effectively execute them in order to find performant models.

You're reading from Hands-On Automated Machine Learning A beginner's guide to building automated machine learning systems using AutoML and Python

Table of Contents (10) Chapters

Why use AutoML and how does it help?

Authors (2)

Other recommended products

Personalised recommendations for you

You're reading from Hands-On Automated Machine Learning A beginner's guide to building automated machine learning systems using AutoML and Python

Table of Contents (10) Chapters

Unlock this book and the full library FREE for 7 days

Authors (2)

Other recommended products

Personalised recommendations for you