Being alerted versus being alarmed
Incident management is a common process used in many areas, from physical incidents such as fire and medical emergencies to computer security or service failure. While we may not handle life-threatening incidents in the computing world, the stress caused by bad incident management processes can be very significant, from anxiety and depression to complete burnout, and it can increase the chance of heart attacks and strokes. Our aim in this section is to explain how observability and Grafana’s tools fit into an incident management strategy, and how to use them to reduce the impact on your teams, the duration of incidents, and the frequency of incidents. We will explore the details of these concepts and the tools available in Grafana to support them further throughout the chapter.
There are a lot of great public resources available on the topic of incident management; here are some for you to explore if you wish:
- Emergency response...