Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Hands-On Infrastructure Monitoring with Prometheus

You're reading from   Hands-On Infrastructure Monitoring with Prometheus Implement and scale queries, dashboards, and alerting across machines and containers

Arrow left icon
Product type Paperback
Published in May 2019
Publisher Packt
ISBN-13 9781789612349
Length 442 pages
Edition 1st Edition
Arrow right icon
Authors (3):
Arrow left icon
Pedro Araujo Pedro Araujo
Author Profile Icon Pedro Araujo
Pedro Araujo
Joel Bastos Joel Bastos
Author Profile Icon Joel Bastos
Joel Bastos
Pedro Ara√∫jo Pedro Ara√∫jo
Author Profile Icon Pedro Ara√∫jo
Pedro Ara√∫jo
Arrow right icon
View More author details
Toc

Table of Contents (21) Chapters Close

Preface 1. Section 1: Introduction FREE CHAPTER
2. Monitoring Fundamentals 3. An Overview of the Prometheus Ecosystem 4. Setting Up a Test Environment 5. Section 2: Getting Started with Prometheus
6. Prometheus Metrics Fundamentals 7. Running a Prometheus Server 8. Exporters and Integrations 9. Prometheus Query Language - PromQL 10. Troubleshooting and Validation 11. Section 3: Dashboards and Alerts
12. Defining Alerting and Recording Rules 13. Discovering and Creating Grafana Dashboards 14. Understanding and Extending Alertmanager 15. Section 4: Scalability, Resilience, and Maintainability
16. Choosing the Right Service Discovery 17. Scaling and Federating Prometheus 18. Integrating Long-Term Storage with Prometheus 19. Assessments 20. Other Books You May Enjoy

Scaling with the help of sharding

With growth come more teams, more infrastructure, more applications. With time, running a single Prometheus server can start to become infeasible: changes in recording/alerting rules and scrape jobs become more frequent (thus requiring reloads which, depending on the configured scrape intervals, can take up to a couple of minutes), missed scrapes can start to happen as Prometheus becomes overwhelmed, or the person or team responsible for that instance may simply become a bottleneck in terms of organizational process. When this happens, we need to rethink the architecture of our solution so that is scales accordingly. Thankfully, this is something the community has tackled time and time again, and so there are some recommendations on how to approach this problem. These recommendations revolve around sharding.

In this context, sharding means splitting...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image