You're reading from Time Series Analysis with Python Cookbook Practical recipes for exploratory data analysis, data preparation, forecasting, and model evaluation

Product type Paperback

Published in Jun 2022

Publisher Packt

ISBN-13 9781801075541

Length 630 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Data Analysis

Author (1):

Tarek A. Atwan

View More author details

Table of Contents (18) Chapters

Preface

1. Chapter 1: Getting Started with Time Series Analysis

2. Chapter 2: Reading Time Series Data from Files FREE CHAPTER

3. Chapter 3: Reading Time Series Data from Databases

4. Chapter 4: Persisting Time Series Data to Files

5. Chapter 5: Persisting Time Series Data to Databases

6. Chapter 6: Working with Date and Time in Python

7. Chapter 7: Handling Missing Data

8. Chapter 8: Outlier Detection Using Statistical Methods

9. Chapter 9: Exploratory Data Analysis and Diagnosis

10. Chapter 10: Building Univariate Time Series Models Using Statistical Methods

11. Chapter 11: Additional Statistical Modeling Techniques for Time Series

12. Chapter 12: Forecasting Using Supervised Machine Learning

13. Chapter 13: Deep Learning for Time Series Forecasting

14. Chapter 14: Outlier Detection Using Unsupervised Machine Learning

15. Chapter 15: Advanced Techniques for Complex Time Series

16. Index

Why subscribe?

17. Other Books You May Enjoy

Detecting outliers using a modified z-score

In the Detecting outliers using a z-score recipe, you experienced how simple and intuitive the method is. But it has one major drawback: it assumes your data is normally distributed.

But, what if your data is not normally distributed? Luckily, there is a modified version of the z-score to work with non-normal data. The main difference between the regular z-score and the modified z-score is that we replace the mean with the median:

Where (tilde x) is the median of the dataset, and MAD is the median absolute deviation of the dataset:

The 0.6745 value is the standard deviation unit that corresponds to the 75th percentile (Q3) in a Gaussian distribution and is used as a normalization factor. In other words, it is used to approximate the standard deviation. This way, the units you obtain from this method are measured in standard deviation, similar to how you would interpret the regular...

The rest of the chapter is locked

You're reading from Time Series Analysis with Python Cookbook Practical recipes for exploratory data analysis, data preparation, forecasting, and model evaluation

Table of Contents (18) Chapters

Detecting outliers using a modified z-score

Authors (1)

Personalised recommendations for you

You're reading from Time Series Analysis with Python Cookbook Practical recipes for exploratory data analysis, data preparation, forecasting, and model evaluation

Table of Contents (18) Chapters

Detecting outliers using a modified z-score

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you