Troubleshooting and logging in your EMR cluster
An Amazon EMR cluster has several components, such as open source software, custom application code, and AWS integrations, which can contribute to cluster failures or can take longer than expected to complete defined jobs. In this section, you will learn how you can troubleshoot these failures and what fixes can be applied.
When you are starting to implement big data applications in an EMR cluster, it's recommended to enable debugging on the cluster and also take a step-by-step approach to test your application with a smaller subset of data, which might help in debugging failures.
Let's dive deep into a few troubleshooting aspects that can help.
Tools available to debug your EMR cluster
We can divide the set of tools available for troubleshooting into the following three categories:
- Tools that display cluster details
- Tools to view cluster or application logs
- Tools that can be used to monitor cluster...