Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Predictive Analytics with Python Master the complete predictive analytics process, from problem definition to model deployment

Product type Paperback

Published in Dec 2018

Publisher Packt

ISBN-13 9781789138719

Length 330 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Predictive Analytics

Author (1):

Alvaro Fuentes

View More author details

Table of Contents (11) Chapters

Preface

1. The Predictive Analytics Process FREE CHAPTER

2. Problem Understanding and Data Preparation

3. Dataset Understanding – Exploratory Data Analysis

4. Predicting Numerical Values with Machine Learning

5. Predicting Categories with Machine Learning

6. Introducing Neural Nets for Predictive Analytics

7. Model Evaluation

8. Model Tuning and Improving Performance

9. Implementing a Model with Dash

10. Other Books You May Enjoy

Leave a review - let other readers know what you think

The k-fold cross-validation

So far, we have been evaluating our models in the test set. By now, it is clear why we do it; however, there is one point we have not discussed yet. Let's go back to the diamond prices problem. In this chapter, we have built a simple multiple linear regression model and we have calculated some metrics on the test set. Let's say that we will use the MAE for evaluating the model. When we calculated this metric, we got 733.67. Now let's repeat the same steps for model building:

Train-test split
Standardize the numeric features
Model training
Get predictions
Evaluate the model using the same metric

Here we have the code again:

## Train-test split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.1, random_state=2)

## Standardize the numeric features 
scaler = StandardScaler()
scaler.fit(X_train[numerical_features])
X_train...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Alvaro Fuentes

Alvaro Fuentes is a senior data scientist with a background in applied mathematics and economics. He has more than 14 years of experience in various analytical roles and is an analytics consultant at one of the ‘Big Three' global management consulting firms, leading advanced analytics projects in different industries like banking, technology, and consumer goods. Alvaro is also an author and trainer in analytics and data science and has published courses and books, such as 'Become a Python Data Analyst' and 'Hands-On Predictive Analytics with Python'. He has also taught data science and related topics to thousands of students both on-site and online through different platforms such as Springboard, Simplilearn, Udemy, and BSG Institute, among others.

See other products by Alvaro Fuentes