Summary
In this chapter, we have seen how to establish a great incident management process, which will help you evaluate and work on your organization’s own process. We have also explored SLIs, SLOs, and SLAs and how to use them, seeing immediately whether a service is responding successfully or not. You have gained the skills to select appropriate SLIs, allowing you to transparently share with the rest of your organization whether the services you are responsible for are behaving as expected. In turn, this transparency helps the organization identify quickly where problems are and target resources to address them.
Finally, we looked at the tools offered by Grafana for incident management, seeing how to configure and use them to support great incident management processes.
The next chapter will look at how we can use the tools provided by Grafana and OpenTelemetry to automate the processes of collecting, storing, and visualizing data in an observability platform.