You're reading from Time Series Analysis with Python Cookbook Practical recipes for exploratory data analysis, data preparation, forecasting, and model evaluation

Product type Paperback

Published in Jun 2022

Publisher Packt

ISBN-13 9781801075541

Length 630 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Data Analysis

Author (1):

Tarek A. Atwan

View More author details

Table of Contents (18) Chapters

Preface

1. Chapter 1: Getting Started with Time Series Analysis

2. Chapter 2: Reading Time Series Data from Files FREE CHAPTER

3. Chapter 3: Reading Time Series Data from Databases

4. Chapter 4: Persisting Time Series Data to Files

5. Chapter 5: Persisting Time Series Data to Databases

6. Chapter 6: Working with Date and Time in Python

7. Chapter 7: Handling Missing Data

8. Chapter 8: Outlier Detection Using Statistical Methods

9. Chapter 9: Exploratory Data Analysis and Diagnosis

10. Chapter 10: Building Univariate Time Series Models Using Statistical Methods

11. Chapter 11: Additional Statistical Modeling Techniques for Time Series

12. Chapter 12: Forecasting Using Supervised Machine Learning

13. Chapter 13: Deep Learning for Time Series Forecasting

14. Chapter 14: Outlier Detection Using Unsupervised Machine Learning

15. Chapter 15: Advanced Techniques for Complex Time Series

16. Index

Why subscribe?

17. Other Books You May Enjoy

Detecting outliers using LOF

In the previous recipe, Detecting outliers using KNN, in the KNN algorithm, the decision scoring for detecting outliers was based on the distance between observations. A data point far from its KNN can be considered an outlier. Overall, the algorithm does a good job of capturing global outliers, but those far from the surrounding points may not do well with identifying local outliers.

This is where the LOF comes in to solve this limitation. Instead of using the distance between neighboring points, it uses density as a basis for scoring data points and detecting outliers. The LOF is considered a density-based algorithm. The idea behind the LOF is that outliers will be further from other data points and more isolated, and thus will be in low-density regions.

It is easier to illustrate this with an example: imagine a person standing in line in a small but busy Starbucks, and everyone is pretty much close to each other; then, we can say the person is...