Summary
Troubleshooting performance issues is a complex matter, as it can take a long time to diagnose and analyze them. Of the three major components in an environment – storage, compute, and memory – storage is the slowest. There is always going to be a mismatch in their performance, and any degradation in disk performance can impact the overall operation of the system.
Keeping this objective in mind, we divided this chapter into two sections. In the first section, we explained the most important metrics that you should understand before troubleshooting any issues. We discussed the time-related metrics related to storage devices, CPU wait averages, disk saturation, and disk utilization, and the different access patterns when reading from or writing to physical disks.
In the second part, we saw the different ways in which we can analyze the metrics highlighted in the first section. There are a lot of mechanisms available in Linux that can assist to identify potential...