You're reading from Mastering Prometheus Gain expert tips to monitoring your infrastructure, applications, and services

Product type Paperback

Published in Apr 2024

Publisher Packt

ISBN-13 9781805125662

Length 310 pages

Edition 1st Edition

Tools

Prometheus

Concepts

DevOps

Author (1):

William Hegedus

View More author details

Table of Contents (21) Chapters

Preface

1. Part 1: Fundamentals of Prometheus FREE CHAPTER

2. Chapter 1: Observability, Monitoring, and Prometheus

3. Chapter 2: Deploying Prometheus

4. Chapter 3: The Prometheus Data Model and PromQL

5. Chapter 4: Using Service Discovery

6. Chapter 5: Effective Alerting with Prometheus

7. Part 2: Scaling Prometheus

8. Chapter 6: Advancing Prometheus: Sharding, Federation, and High Availability

9. Chapter 7: Optimizing and Debugging Prometheus

10. Chapter 8: Enabling Systems Monitoring with the Node Exporter

11. Part 3: Extending Prometheus

12. Chapter 9: Utilizing Remote Storage Systems with Prometheus

13. Chapter 10: Extending Prometheus Globally with Thanos

14. Chapter 11: Jsonnet and Monitoring Mixins

15. Chapter 12: Utilizing Continuous Integration (CI) Pipelines with Prometheus

16. Chapter 13: Defining and Alerting on SLOs

17. Chapter 14: Integrating Prometheus with OpenTelemetry

18. Chapter 15: Beyond Prometheus

19. Index

Why subscribe?

20. Other Books You May Enjoy

Scrape jitter

Scrape jitter is the most common cause of oversized TSDB blocks that I have observed. Recall from Chapter 3 how – from the third scrape onwards – timestamp values are stored in the TSDB as they only store the delta of the delta of the sample timestamp. So long as this delta of the delta is 0, the TSDB’s compaction process can save a lot of space by effectively dropping the timestamp value from stored samples that all occur at a consistent delta. With millions of samples, this can add up to gigabytes of storage space in every TSDB block. However, when the delta of the delta is not consistent, this is referred to as scrape jitter.

Scrape jitter is a way to say that scrapes do not occur at consistent intervals. In Prometheus, this often means that they are off by just a few milliseconds. By default, Prometheus will automatically adjust timestamps that are within a 2ms tolerance.

Configuring timestamp adjustments

Whether or not timestamps are...