Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Hands-On Predictive Analytics with Python Master the complete predictive analytics process, from problem definition to model deployment

Product type Paperback

Published in Dec 2018

Publisher Packt

ISBN-13 9781789138719

Length 330 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Predictive Analytics

Author (1):

Alvaro Fuentes

View More author details

Table of Contents (11) Chapters

Preface

1. The Predictive Analytics Process FREE CHAPTER

2. Problem Understanding and Data Preparation

3. Dataset Understanding – Exploratory Data Analysis

4. Predicting Numerical Values with Machine Learning

5. Predicting Categories with Machine Learning

6. Introducing Neural Nets for Predictive Analytics

7. Model Evaluation

8. Model Tuning and Improving Performance

9. Implementing a Model with Dash

10. Other Books You May Enjoy

Leave a review - let other readers know what you think

Training versus testing error

The point of splitting the dataset into training and testing sets was to simulate the situation of using the model to make predictions on data the model has not seen. As we said before, the whole point is to generalize what we have learned from the observed data. The training MSE (or any metric calculated on the training dataset) may give us a biased view of the performance of our model, especially because of the possibility of overfitting. The metrics of performance we get from the training dataset will tend to be too optimistic. Let's take a look again at our illustration of overfitting:

If we calculate the training MSE for these three cases, we will definitely get the lowest one (hence the best) for the third model, the polynomial with 16 degrees; as we see, the model touches many points, making the error for those points exactly 0. However...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Alvaro Fuentes

Alvaro Fuentes is a senior data scientist with a background in applied mathematics and economics. He has more than 14 years of experience in various analytical roles and is an analytics consultant at one of the ‘Big Three' global management consulting firms, leading advanced analytics projects in different industries like banking, technology, and consumer goods. Alvaro is also an author and trainer in analytics and data science and has published courses and books, such as 'Become a Python Data Analyst' and 'Hands-On Predictive Analytics with Python'. He has also taught data science and related topics to thousands of students both on-site and online through different platforms such as Springboard, Simplilearn, Udemy, and BSG Institute, among others.

See other products by Alvaro Fuentes