Join our book community on Discord
https://packt.link/PyNetCommunity
Imagine you get a call from your company's network operations center at 2:00 a.m. The person on the other end says: "Hi, we are facing a difficult issue that is impacting production services. We suspect it might be network related. Can you check this for us?" For this type of urgent, open-ended question, what would be the first thing you do? Most of the time, the thing that comes to mind would be: What changed in the time between when the network was working and when something went wrong? We would check our monitoring tool and see if any key metrics have changed in the last few hours. Better yet, we may have received monitoring alerts for any metrics that deviated from the normal baseline numbers.
Throughout this book, we have been discussing various ways to systematically make predictable changes to our network, intending to keep the network running as smoothly as possible. However, networks are not...