You're reading from Machine Learning Techniques for Text Apply modern techniques with Python for text processing, dimensionality reduction, classification, and evaluation

Product type Paperback

Published in Oct 2022

Publisher Packt

ISBN-13 9781803242385

Length 448 pages

Edition 1st Edition

Languages

Python

Concepts

Machine Learning

Author (1):

Nikos Tsourakis

View More author details

Table of Contents (13) Chapters

Preface

1. Chapter 1: Introducing Machine Learning for Text

2. Chapter 2: Detecting Spam Emails FREE CHAPTER

3. Chapter 3: Classifying Topics of Newsgroup Posts

4. Chapter 4: Extracting Sentiments from Product Reviews

5. Chapter 5: Recommending Music Titles

6. Chapter 6: Teaching Machines to Translate

7. Chapter 7: Summarizing Wikipedia Articles

8. Chapter 8: Detecting Hateful and Offensive Language

9. Chapter 9: Generating Text in Chatbots

10. Chapter 10: Clustering Speech-to-Text Transcriptions

11. Index

Why subscribe?

12. Other Books You May Enjoy

Creating validation sets

Throughout the book, we mentioned many times that we need to experiment with multiple configurations of the models to find the optimal one. The most typical pipeline is adjusting the hyperparameters and the topology of deep learning architecture, training on a set of samples, and testing on another set. For that reason, machine learning is a highly iterative process. This strategy engenders a particular risk, however. Evaluating different model configurations with a given test set over multiple rounds leads to a model tuned to work well with the specific set. As the number of epochs increases, we implicitly fit the model to the peculiarities of the test set and consequently get a too-optimistic performance in the end.

We need a way to validate our model performance during training while leaving the test set for the final evaluation. This role is undertaken by the validation set that helps us tune the model’s hyperparameters and configurations accordingly...

The rest of the chapter is locked

You're reading from Machine Learning Techniques for Text Apply modern techniques with Python for text processing, dimensionality reduction, classification, and evaluation

Table of Contents (13) Chapters

Creating validation sets

Unlock this book and the full library FREE for 7 days

Authors (1)