Monitoring Kubernetes Clusters
In the previous chapter, we looked at serverless computing and its manifestations on Kubernetes. A lot of innovation happens in this space and it's both super useful and fascinating to follow the evolution.
In this chapter, we're going to talk about how to make sure your systems are up and running, performing correctly, and how to respond to them when they aren't. In Chapter 3, High Availability and Reliability, we discussed related topics. The focus here is about knowing what's going on in your system and what practices and tools you can use.
The are many aspects to monitoring, such as logging, metrics, distributed tracing, error reporting, and alerting. Practices like auto-scaling and self-healing depend on monitoring to detect that there is a need to scale or to heal.
The topics we will cover in this chapter include:
- Understanding observability
- Logging with Kubernetes
- Recording metrics with...