You're reading from Machine Learning with the Elastic Stack Gain valuable insights from your data with Elastic Stack's machine learning features

Product type Paperback

Published in May 2021

Publisher Packt

ISBN-13 9781801070034

Length 450 pages

Edition 2nd Edition

Languages

Python

Tools

Elasticsearch

Concepts

Machine Learning

Authors (3):

Camilla Montonen

Rich Collier

Bahaaldine Azarmi

View More author details

Table of Contents (19) Chapters

Preface

1. Section 1 – Getting Started with Machine Learning with Elastic Stack

2. Chapter 1: Machine Learning for IT FREE CHAPTER

3. Chapter 2: Enabling and Operationalization

4. Section 2 – Time Series Analysis – Anomaly Detection and Forecasting

5. Chapter 3: Anomaly Detection

6. Chapter 4: Forecasting

7. Chapter 5: Interpreting Results

8. Chapter 6: Alerting on ML Analysis

9. Chapter 7: AIOps and Root Cause Analysis

10. Chapter 8: Anomaly Detection in Other Elastic Stack Apps

11. Section 3 – Data Frame Analysis

12. Chapter 9: Introducing Data Frame Analytics

13. Chapter 10: Outlier Detection

14. Chapter 11: Classification Analysis

15. Chapter 12: Regression

16. Chapter 13: Inference

17. Other Books You May Enjoy

Appendix: Anomaly Detection Tips

Hyperparameters

In the previous section, we took a conceptual overview of how decision trees are constructed. In particular, we established that one of the criteria for determining where a decision tree should be split (in other words, when a new path should be added to our conceptual flowchart) is by looking at the purity of the resulting nodes. We also noted that allowing the algorithm to exclusively focus on the purity of the nodes as a criterion for constructing the decision tree would quickly lead to trees that overfit the training data. These decision trees are so tuned to the training data that they are not only capturing the most salient features for classifying a given data point but are even modeling the noise in the data as though it is a real signal. Therefore, while this kind of a decision tree that is allowed to optimize for specific metrics without restrictions will perform really well on the training data, it will neither perform well on the testing dataset nor generalize...