Troubleshooting on-prem Kubernetes clusters
If you come from a systems administration/infrastructure background, troubleshooting Kubernetes clusters is going to come to you pretty naturally. At the end of the day, a Kubernetes cluster consists of servers, networking, infrastructure, and APIs, which are essentially what infrastructure engineers are working on day to day.
If you’re a developer, some of these concepts may be new to you, such as troubleshooting networks. However, you’ll be very familiar with a few troubleshooting techniques as well, such as looking at and analyzing logs.
The whole idea of troubleshooting a Kubernetes cluster is to look at two pieces:
- The cluster itself
- The Pods running inside the cluster
The cluster itself, including networking, servers, operating systems, and scalability, is going to be thought of from more of an infrastructure perspective, where something such as the Certified Kubernetes Administrator (CKA) comes...