Adapting to Change
It is recommended to frequently review how workload monitoring is implemented so you can update it if needed, based on significant events and changes. Among the things to review, look at your metrics, KPIs, and SLAs. Verify that these metrics are still meaningful to your current workload since business priorities change over time.
Additionally, auditing your monitoring setup is another safeguard to make sure that you know whether an application is meeting its reliability objectives. Establishing regular operational performance reviews and conducting knowledge-sharing sessions is a great way to enhance your organization’s ability to achieve higher operational performance.
For example, AWS service teams conduct internal weekly reviews assessing various teams’ operational performance. This allows them to share learnings among teams. Since there are too many reviews to go through each time, AWS has created an application that it has open sourced,...