Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Practical Site Reliability Engineering Automate the process of designing, developing, and delivering highly reliable apps and services with SRE

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781788839563

Length 390 pages

Edition 1st Edition

Tools

Docker

Concepts

Configuration Management

Authors (3):

Pethuru Raj Chelliah

Shailender Singh

Shreyash Naithani

View More author details

Table of Contents (14) Chapters

Preface

1. Demystifying the Site Reliability Engineering Paradigm FREE CHAPTER

2. Microservices Architecture and Containers

3. Microservice Resiliency Patterns

4. DevOps as a Service

5. Container Cluster and Orchestration Platforms

6. Architectural and Design Patterns

7. Reliability Implementation Techniques

8. Realizing Reliable Systems - the Best Practices

9. Service Resiliency

10. Containers, Kubernetes, and Istio Monitoring

11. Post-Production Activities for Ensuring and Enhancing IT Reliability

12. Service Meshes and Container Orchestration Platforms

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

Monitoring is not a one-time task. We should be regularly measuring what's going on with our Kubernetes pods or our microservices. Monitoring plays a crucial role in the microservice system, as we need to monitor all endpoints in our microservices. To achieve a higher quality product, we should be able to detect failures before our customer does. We should enable anomaly detection and notify our operation team to troubleshoot the problem. We have to set up the necessary monitoring and alerts on both the infrastructure side and the application side. In this chapter, we saw how to use Prometheus and Grafana metrics to create powerful dashboards and alerts.

In the next chapter, we will talk about post-production activities and best practices for ensuring and enhancing the IT reliability.

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Pethuru Raj Chelliah

Pethuru Raj Chelliah (PhD) works as the chief architect at the Site Reliability Engineering Center of Excellence, Reliance Jio Infocomm Ltd. (RJIL), Bangalore. Previously, he worked as a cloud infrastructure architect at the IBM Global Cloud Center of Excellence, IBM India, Bangalore, for four years. He also had an extended stint as a TOGAF-certified enterprise architecture consultant in Wipro Consulting services division and as a lead architect in the corporate research division of Robert Bosch, Bangalore. He has more than 17 years of IT industry experience.

See other products by Pethuru Raj Chelliah

Singh

Contacted on 12/01/18 by Davis Anto

See other products by Singh

Naithani

Shreyash Naithani is currently a site reliability engineer at Microsoft R&D. Prior to Microsoft, he worked with both start-ups and mid-level companies. He completed his PG Diploma from the Centre for Development of Advanced Computing, Bengaluru, India, and is a computer science graduate from Punjab Technical University, India. In a short span of time, he has had the opportunity to work as a DevOps engineer with Python/C#, and as a tools developer, site/service reliability engineer, and Unix system administrator. During his leisure time, he loves to travel and binge watch series.

See other products by Naithani