Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Python for Finance Cookbook Over 50 recipes for applying modern Python libraries to financial data analysis

Product type Paperback

Published in Jan 2020

Publisher Packt

ISBN-13 9781789618518

Length 432 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Data Analysis

Author (1):

Eryk Lewinson

View More author details

Table of Contents (12) Chapters

Preface

1. Financial Data and Preprocessing

2. Technical Analysis in Python FREE CHAPTER

3. Time Series Modeling

4. Multi-Factor Models

5. Modeling Volatility with GARCH Class Models

6. Monte Carlo Simulations in Finance

7. Asset Allocation in Python

8. Identifying Credit Default with Machine Learning

9. Advanced Machine Learning Models in Finance

10. Deep Learning in Finance

11. Other Books You May Enjoy

Leave a review - let other readers know what you think

Splitting data into training and test sets

Having completed the EDA, the next step is to split the dataset into training and test sets. The idea is to have two separate datasets:

Training set—On this part of the data, we train a machine learning model
Test set—This part of the data was not seen by the model during training, and is used to evaluate the performance

What we want to achieve by splitting the data is preventing overfitting. Overfitting is a phenomenon whereby a model finds too many patterns in data used for training and performs well only on that particular data. In other words, it fails to generalize to unseen data.

This is a very important step in the analysis, as doing it incorrectly can introduce bias, for example, in the form of data leakage. Data leakage can occur when, during the training phase, a model observes information to which it should...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Eryk Lewinson

Eryk Lewinson received his master's degree in Quantitative Finance from Erasmus University Rotterdam. In his professional career, he has gained experience in the practical application of data science methods while working in risk management and data science departments of two "big 4" companies, a Dutch neo-broker and most recently the Netherlands' largest online retailer. Outside of work, he has written over a hundred articles about topics related to data science, which have been viewed more than 3 million times. In his free time, he enjoys playing video games, reading books, and traveling with his girlfriend.

See other products by Eryk Lewinson