Troubleshooting, or debugging (more frequently used for specific software), is the act of trying to work out what's going wrong with a solution right now or what went wrong historically during an incident. That incident can be anything from the entire network going down in a data center, to working out why that one Docker container suddenly decided it was going to bounce itself up and down like a tennis ball.
No software is perfect. If it were, we'd all be out of jobs, and it's because of this simple statement of fact that you will almost inevitably find yourself in a situation at some point in your life where you're faced with a broken system and there's someone screaming at you to fix it.
You can do everything in your power before this point to try to ensure you never experience something such as a production outage, including...