You're reading from Machine Learning for Streaming Data with Python Rapidly build practical online machine learning solutions using River and other top key frameworks

Product type Paperback

Published in Jul 2022

Publisher Packt

ISBN-13 9781803248363

Length 258 pages

Edition 1st Edition

Languages

Python

Tools

River

Concepts

Machine Learning

Author (1):

Joos Korstanje

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1: Introduction and Core Concepts of Streaming Data

2. Chapter 1: An Introduction to Streaming Data FREE CHAPTER

3. Chapter 2: Architectures for Streaming and Real-Time Machine Learning

4. Chapter 3: Data Analysis on Streaming Data

5. Part 2: Exploring Use Cases for Data Streaming

6. Chapter 4: Online Learning with River

7. Chapter 5: Online Anomaly Detection

8. Chapter 6: Online Classification

9. Chapter 7: Online Regression

10. Chapter 8: Reinforcement Learning

11. Part 3: Advanced Concepts and Best Practices around Streaming Data

12. Chapter 9: Drift and Drift Detection

13. Chapter 10: Feature Transformation and Scaling

14. Chapter 11: Catastrophic Forgetting

15. Chapter 12: Conclusion and Best Practices

16. Other Books You May Enjoy

Defining drift

It is a well-known and commonly observed problem that models tend to start performing worse with time. Whether your metric is accuracy, R2 score, F1 score, or anything else, you will see a slow but steady decrease in performance over time if you put models into production and do not update them.

Depending on your use case, this decrease may become problematic quickly or slowly. Some use cases need to have continuous, near-perfect predictions. In some use cases— for example, for specialized ML in which the models have a direct impact on life—you would be strongly shocked if you observed a 1 percent decrease. In other use cases, ML is used more as automation than as prediction, and the business partners may not even notice a 5 percent decrease.

Whether it is going to impact you is not the question here. What is sure, is that in general, you will see your models decreasing. The goal for this chapter is to make sure to find out why model performance is...