You're reading from Modern Time Series Forecasting with Python Explore industry-ready time series forecasting using modern machine learning and deep learning

Product type Paperback

Published in Nov 2022

Publisher Packt

ISBN-13 9781803246802

Length 552 pages

Edition 1st Edition

Languages

Python

Tools

Time Series Forecasting

Concepts

Data Science

Author (1):

Manu Joseph

View More author details

Table of Contents (26) Chapters

Preface

1. Part 1 – Getting Familiar with Time Series

2. Chapter 1: Introducing Time Series FREE CHAPTER

3. Chapter 2: Acquiring and Processing Time Series Data

4. Chapter 3: Analyzing and Visualizing Time Series Data

5. Chapter 4: Setting a Strong Baseline Forecast

6. Part 2 – Machine Learning for Time Series

7. Chapter 5: Time Series Forecasting as Regression

8. Chapter 6: Feature Engineering for Time Series Forecasting

9. Chapter 7: Target Transformations for Time Series Forecasting

10. Chapter 8: Forecasting Time Series with Machine Learning Models

11. Chapter 9: Ensembling and Stacking

12. Chapter 10: Global Forecasting Models

13. Part 3 – Deep Learning for Time Series

14. Chapter 11: Introduction to Deep Learning

15. Chapter 12: Building Blocks of Deep Learning for Time Series

16. Chapter 13: Common Modeling Patterns for Time Series

17. Chapter 14: Attention and Transformers for Time Series

18. Chapter 15: Strategies for Global Deep Learning Forecasting Models

19. Chapter 16: Specialized Deep Learning Architectures for Forecasting

20. Part 4 – Mechanics of Forecasting

21. Chapter 17: Multi-Step Forecasting

22. Chapter 18: Evaluating Forecasts – Forecast Metrics

23. Chapter 19: Evaluating Forecasts – Validation Strategies

24. Index

Why subscribe?

25. Other Books You May Enjoy

Validation strategies for datasets with multiple time series

All the strategies we have seen till now are perfectly valid for datasets with multiple time series, such as the London Smart Meters dataset we have been working with in this book. The insights we discussed in the last section are also valid. The implementation of such strategies can be slightly tricky because the scikit learn classes we discussed work for single time series. Those implementations assume that we have a single time series, sorted according to the temporal order. If there are multiple time series, the splits will be haphazard and messy.

There are a couple of options we can adopt for datasets with multiple time series:

We can loop over the different time series and use the methods we discussed to do the train-validation split, and then concatenate the resulting sets across all the time series. But, that is not going to be so efficient.
We can write some code and design the validation strategies...