Augmenting the Human Operator with Amazon DevOps Guru
Today’s applications are becoming increasingly distributed and complex. We learned in the previous chapters that we need the three pillars of Metrics, Logs, and Traces to achieve good observability. To visualize the data that’s been collected, we need dashboards that can correlate data and provide a drill-down view of the application, such as the CloudWatch service map. While this model is effective for less complex systems, as the volume and diversity of data increase, it becomes challenging to identify and troubleshoot issues manually. Developers or administrators may face difficulties in locating and resolving problems as they need to correlate information manually from multiple sources and tools. The constant alerts and notifications from different tools can also lead to alarm fatigue and difficulty in determining the most pressing issue. That’s where DevOps Guru steps in and comes to the rescue.
DevOps...