You're reading from AWS Observability Handbook Monitor, trace, and alert your cloud applications with AWS' myriad observability tools

Product type Paperback

Published in Apr 2023

Publisher Packt

ISBN-13 9781804616710

Length 504 pages

Edition 1st Edition

Tools

AWS

Concepts

DevOps

Authors (2):

Fabio Oliveira

Phani Kumar Lingamallu

View More author details

Table of Contents (22) Chapters

Preface

1. Part 1: Getting Started with Observability on AWS

2. Chapter 1: Observability 101 FREE CHAPTER

3. Chapter 2: Overview of the Observability Landscape on AWS

4. Chapter 3: Gathering Operational Data and Alerting Using Amazon CloudWatch

5. Chapter 4: Implementing Distributed Tracing Using AWS X-Ray

6. Part 2: Automated and Machine Learning-Powered Observability on AWS

7. Chapter 5: Insights into Operational Data with CloudWatch

8. Chapter 6: Observability for Containerized Applications on AWS

9. Chapter 7: Observability for Serverless Applications on AWS

10. Chapter 8: End User Experience Monitoring on AWS

11. Part 3: Open Source Managed Services on AWS

12. Chapter 9: Collecting Metrics and Traces Using OpenTelemetry

13. Chapter 10: Deploying and Configuring an Amazon Managed Service for Prometheus

14. Chapter 11: Deploying the Elasticsearch, Logstash, and Kibana Stack Using Amazon OpenSearch Service

15. Part 4: Scaled Observability and Beyond

16. Chapter 12: Augmenting the Human Operator with Amazon DevOps Guru

17. Chapter 13: Observability Best Practices at Scale

18. Chapter 14: Be Well-Architected for Operational Excellence

19. Chapter 15: The Role of Observability in the Cloud Adoption Framework

20. Index

Why subscribe?

21. Other Books You May Enjoy

Benefits of observability

Adopting observability to analyze system performance used to be the job of sysadmins and ops teams, who cared most about the mean time to detect (MTTD) and mean time to resolve (MTTR). Today, more job roles than ever need to use observability data. With the rise of DevOps, CI/CD, and Agile methods, developers are often directly responsible for the performance of their apps in production. SREs and DevOps staff care about meeting service-level indicators (SLIs) and service-level objectives (SLOs). Information about systems and workloads is also used by business leaders in making decisions about capacity, spending, risk, and end user experience. Each stakeholder in an organization has different needs for what is monitored and how the resulting data is analyzed, reported, and displayed. Let’s try to understand the benefits of observability in the real world for different personas.

Understanding application health and performance to improve customer experience

The main observability goal is to know what is going on anywhere in your system to ensure the best possible experience for your end users. You want to detect problems quickly, investigate them efficiently, and remediate them as soon as possible to minimize downtime and other disruptions to your customers.

Improving developer productivity

Traditional debugging by analyzing logs or instrumenting breakpoints into code is tedious, repetitive, and time-consuming. It doesn’t scale well for production applications or those built using microservice or serverless architectures. To analyze performance across distributed applications, developers need to correlate metrics and traces to identify user impact from any source and to find broken or expensive code paths as quickly as possible. And they need to do all this without having to re-instrument their code when they want to add new observability tools to their kit.

Getting more insight with visualizations

Observability, especially at scale, can generate huge volumes of data that become difficult for humans to parse. Visualization tools help humans make sense of data by correlating observability data into intuitive graphic displays. However, having a bunch of graphs, charts, and more scattered across multiple tools and displays becomes a problem. It’s essential to centralize visual data into a single dashboard, giving you a unified view of your system’s critical information and performance.

Digital eperience monitoring

Digital Experience Monitoring (DEM) correlates infrastructure and operations metrics with business outcomes by focusing on the end user experience. It seeks to reduce the MTTR in the event of client-side performance issues by monitoring the client-side performance on web and mobile applications in real time. Resolution is assisted by the relevant debugging data such as error messages, stack traces, and user sessions to fix performance issues such as JavaScript errors, crashes, and latencies.

Controlling cost and planning capacity

A key advantage to operating in the cloud is that you can scale quickly to meet demand during peak load times. However, unplanned and uncontrolled growth can result in unexpected costs. Observability can help you find performance improvements, such as reducing the CPU footprint. Across a fleet of thousands or hundreds of thousands of instances, a slight percentage performance improvement in how much CPU an application uses can save millions of dollars. Similarly, by using observability to understand and predict your future capacity needs, you can take advantage of the cost savings available from reserve and spot pricing and avoid cost surprises.