You're reading from Machine Learning with the Elastic Stack Gain valuable insights from your data with Elastic Stack's machine learning features

Product type Paperback

Published in May 2021

Publisher Packt

ISBN-13 9781801070034

Length 450 pages

Edition 2nd Edition

Languages

Python

Tools

Elasticsearch

Concepts

Machine Learning

Authors (3):

Camilla Montonen

Rich Collier

Bahaaldine Azarmi

View More author details

Table of Contents (19) Chapters

Preface

1. Section 1 – Getting Started with Machine Learning with Elastic Stack

2. Chapter 1: Machine Learning for IT FREE CHAPTER

3. Chapter 2: Enabling and Operationalization

4. Section 2 – Time Series Analysis – Anomaly Detection and Forecasting

5. Chapter 3: Anomaly Detection

6. Chapter 4: Forecasting

7. Chapter 5: Interpreting Results

8. Chapter 6: Alerting on ML Analysis

9. Chapter 7: AIOps and Root Cause Analysis

10. Chapter 8: Anomaly Detection in Other Elastic Stack Apps

11. Section 3 – Data Frame Analysis

12. Chapter 9: Introducing Data Frame Analytics

13. Chapter 10: Outlier Detection

14. Chapter 11: Classification Analysis

15. Chapter 12: Regression

16. Chapter 13: Inference

17. Other Books You May Enjoy

Appendix: Anomaly Detection Tips

Evaluating outlier detection with the Evaluate API

In the previous section, we touched on the fact it can be hard for a user to know how to set the threshold for outlier scores in order to group the data points in the dataset into normal and outlier categories. In this section, we will show how to approach this issue if you have a labeled dataset that contains, for each point, the ground truth values that record whether the point is an outlier. Before we dive into the practical demonstration, let's take a moment to understand some key performance metrics that are used in evaluating the performance of the outlier detection algorithm.

One of the simplest ways we can measure the performance of the algorithm is to compute the number of data points that it correctly predicted as outliers; in other words, the number of true positives (TPs). In addition, we also want to know the number of true negatives (TNs): how many normal data points were correctly predicted as normal. By extension...