You're reading from Mastering Prometheus Gain expert tips to monitoring your infrastructure, applications, and services

Product type Paperback

Published in Apr 2024

Publisher Packt

ISBN-13 9781805125662

Length 310 pages

Edition 1st Edition

Tools

Prometheus

Concepts

DevOps

Author (1):

Hegedus

View More author details

Table of Contents (21) Chapters

Preface

1. Part 1: Fundamentals of Prometheus

2. Chapter 1: Observability, Monitoring, and Prometheus FREE CHAPTER

3. Chapter 2: Deploying Prometheus

4. Chapter 3: The Prometheus Data Model and PromQL

5. Chapter 4: Using Service Discovery

6. Chapter 5: Effective Alerting with Prometheus

7. Part 2: Scaling Prometheus

8. Chapter 6: Advancing Prometheus: Sharding, Federation, and High Availability

9. Chapter 7: Optimizing and Debugging Prometheus

10. Chapter 8: Enabling Systems Monitoring with the Node Exporter

11. Part 3: Extending Prometheus

12. Chapter 9: Utilizing Remote Storage Systems with Prometheus

13. Chapter 10: Extending Prometheus Globally with Thanos

14. Chapter 11: Jsonnet and Monitoring Mixins

15. Chapter 12: Utilizing Continuous Integration (CI) Pipelines with Prometheus

16. Chapter 13: Defining and Alerting on SLOs

17. Chapter 14: Integrating Prometheus with OpenTelemetry

18. Chapter 15: Beyond Prometheus

19. Index

Why subscribe?

20. Other Books You May Enjoy

Achieving high availability (HA) in Prometheus

Your monitoring environment needs to be one of your most resilient services. It can be a joke that there’s no such thing as 100% uptime, but your monitoring environment should come pretty darn close. After all, it’s what you depend on to let you know when your other services aren’t achieving their 99.9% uptime goal.

Thus far, we’ve only used Prometheus in a single-point-of-failure mode. If Prometheus goes down, all of its metrics and alerts go down with it. This gap in visibility and alerting is unacceptable. So, what can we do about it if Prometheus doesn’t have built-in HA like Alertmanager? The answer? Duplicate it.

Who watches the watchmen?

With an HA Prometheus setup, you can (and should) configure your Prometheus instances so that they monitor each other. Presuming they’re not running on the same physical hardware, unexpected failures should be isolated and you can be alerted to...

The rest of the chapter is locked

You're reading from Mastering Prometheus Gain expert tips to monitoring your infrastructure, applications, and services

Table of Contents (21) Chapters

Achieving high availability (HA) in Prometheus

Authors (1)

Personalised recommendations for you

You're reading from Mastering Prometheus Gain expert tips to monitoring your infrastructure, applications, and services

Table of Contents (21) Chapters

Achieving high availability (HA) in Prometheus

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you