Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Hands-On Infrastructure Monitoring with Prometheus Implement and scale queries, dashboards, and alerting across machines and containers

Product type Paperback

Published in May 2019

Publisher Packt

ISBN-13 9781789612349

Length 442 pages

Edition 1st Edition

Tools

Prometheus

Concepts

Application Monitoring

Authors (3):

Pedro Araujo

Joel Bastos

Pedro Ara√∫jo

View More author details

Table of Contents (21) Chapters

Preface

1. Section 1: Introduction FREE CHAPTER

2. Monitoring Fundamentals

3. An Overview of the Prometheus Ecosystem

4. Setting Up a Test Environment

5. Section 2: Getting Started with Prometheus

6. Prometheus Metrics Fundamentals

7. Running a Prometheus Server

8. Exporters and Integrations

9. Prometheus Query Language - PromQL

10. Troubleshooting and Validation

11. Section 3: Dashboards and Alerts

12. Defining Alerting and Recording Rules

13. Discovering and Creating Grafana Dashboards

14. Understanding and Extending Alertmanager

15. Section 4: Scalability, Resilience, and Maintainability

16. Choosing the Right Service Discovery

17. Scaling and Federating Prometheus

18. Integrating Long-Term Storage with Prometheus

19. Assessments

20. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

In this chapter, we tackled issues concerning running Prometheus at scale. Even though a single Prometheus instance can get you a long way, it's a good idea to have the knowledge to grow if required. We've learned how vertical and horizontal sharding works, when to use sharding, and what benefits and concerns sharding brings. We were introduced to common patterns when federating Prometheus (hierarchical or cross-service), and how to choose between them depending on our requirements. Since, sometimes, we want more than the out-of-the-box federation, we were introduced to the Thanos project and how it solves the global view problem.

In the next chapter, we'll be tackling another common requirement and one that isn't a core concern of the Prometheus project, which is the long-term storage of time series.

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Joel Bastos

Joel Bastos is an open source supporter and contributor, with a background in infrastructure security and automation. He is always striving for the standardization of processes, code maintainability, and code reusability. He has defined, led, and implemented critical, highly available, and fault-tolerant enterprise and web-scale infrastructures in several organizations, with Prometheus as the cornerstone. He has worked at two unicorn companies in Portugal and at one of the largest transaction-oriented gaming companies in the world. Previously, he has supported several governmental entities with projects such as the Public Key Infrastructure for the Portuguese citizen card. You can find his blogs at kintoandar and on Twitter with the handle @kintoandar.

See other products by Joel Bastos

Pedro Araújo

Pedro Arajo is a site reliability and automation engineer and has defined and implemented several standards for monitoring at scale. His contributions have been fundamental in connecting development teams to infrastructure. He is highly knowledgeable about infrastructure, but his passion is in the automation and management of large-scale, highly-transactional systems. Pedro has contributed to several open source projects, such as Riemann, OpenTSDB, Sensu, Prometheus, and Thanos. You can find him on Twitter with the handle @phcrva.

See other products by Pedro Araújo