You're reading from Automated Machine Learning with AutoKeras Deep learning made accessible for everyone with just few lines of coding

Product type Paperback

Published in May 2021

Publisher Packt

ISBN-13 9781800567641

Length 194 pages

Edition 1st Edition

Languages

Python

Tools

AutoKeras

Concepts

Deep Learning

Author (1):

Luis Sobrecueva

View More author details

Table of Contents (15) Chapters

Preface

1. Section 1: AutoML Fundamentals

2. Chapter 1: Introduction to Automated Machine Learning FREE CHAPTER

3. Chapter 2: Getting Started with AutoKeras

4. Chapter 3: Automating the Machine Learning Pipeline with AutoKeras

5. Section 2: AutoKeras in Practice

6. Chapter 4: Image Classification and Regression Using AutoKeras

7. Chapter 5: Text Classification and Regression Using AutoKeras

8. Chapter 6: Working with Structured Data Using AutoKeras

9. Chapter 7: Sentiment Analysis Using AutoKeras

10. Chapter 8: Topic Classification Using AutoKeras

11. Section 3: Advanced AutoKeras

12. Chapter 9: Working with Multimodal and Multitasking Data

13. Chapter 10: Exporting and Visualizing the Models

14. Other Books You May Enjoy

Splitting your dataset for training and evaluation

To evaluate a model, you must divide your dataset into three subsets: a training set, a validation set, and a test set. During the training phase, AutoKeras will train your model with the training dataset, while using the validation dataset to evaluate its performance. Once you are ready, the final evaluation will be done using the test dataset.

Why you should split your dataset

Having a separate test dataset that is not used during training is really important to avoid information leaks.

As we mentioned previously, the validation set is used to tune the hyperparameters of your model based on the performance of the model, but some information about the validation data is filtered into the model. Due to this, you run the risk of ending up with a model that works artificially well with the validation data, because that's what you trained it for. However, the actual performance of the model is due to us using previously...