EKS Observability
Throughout the book, we’ve looked at how you build EKS clusters and deploy workloads. However, a critical part of any EKS deployment is observability. Observability is the ability to interpret logs and metrics from your cluster/workloads without which you can’t troubleshoot/resolve issues or understand capacity or performance. Observability also includes tracing, which allows you to follow a request as it moves through different EKS workloads (microservices), simplifying troubleshooting in a distributed system.
In this chapter, we are going to discuss the tools and techniques you can use to monitor your clusters and workloads natively on AWS or using third-party tools. We will cover the following topics:
- Monitoring clusters and Pods using native AWS tools
- Building dashboards with Managed Service for Prometheus and Grafana
- Tracing with OpenTelemetry
- Using machine learning with DevOps Guru