Summary
In this chapter, we covered the intricate concept of observability and what an observable system is. There’s no price to knowing how to explain what monitoring and telemetry are and how they relate to reliability. Having a good grasp on APM and how it is used to measure reliability for applications is a must-have for SREs. You heard all about recent concepts related to monitoring and event technologies. You also acquired knowledge on how alerting is done by SREs and how observability is a guiding principle at the end of the day. Finally, you consolidated the knowledge from this chapter by going through the practical simulation lab available on GitHub and understanding how to further develop it.
In the next chapter, you will learn how to approach an issue by isolating possible causes and effects, and diagnosing an anomaly when the observability platform has detected one.