Observability consumers – self healing
Self-healing infrastructure or applications or services is considered to be the next step in artificial intelligence for IT operations (AIOps) and automation. By implementing observability, developers and engineers can generate and ingest a large amount of data related to an application or a service. Using this data, the observability tools can detect a deviation in performance and instability in service and also identify potential outages. Once these are detected, this information can be passed on to a downstream system such as an automation tool to invoke workflows that can be used to resolve or recover the health of the application or services.
Figure 5.7 – An observability future workflow
Shell (sh) and Python scripts are commonly used to speed up recovery during outages by automating recovery steps such as restarting servers, diverting traffic to a different set of endpoints, clearing the cache...