Advancing Prometheus: Sharding, Federation, and High Availability
If you’re reading this book, chances are that you already had some experience with Prometheus before finding this book. If so, at some point, you’ve also probably run into the need to scale Prometheus beyond just a single Prometheus instance managing everything. There are a variety of solutions to this problem and in this chapter, we’ll cover a few of the built-in ones. As a bonus, we’ll look at how to make your Prometheus metrics highly available.
In this chapter, we’re going to cover the following main topics:
- Prometheus’ limitations
- Sharding Prometheus
- Federating Prometheus
- Achieving high availability (HA) in Prometheus
Let’s get started!