Summary
Over the course of this chapter, we got an overview of how to monitor cluster and job activities using a cluster's application interfaces, cluster metrics, and the CloudWatch console. We also saw how to enable auditing on cluster API activities using AWS CloudTrail.
Then, we dived deep into EMR cluster scaling capabilities, which includes EMR-managed scaling and autoscaling with custom policies. We also learned how they compare to each other.
Finally, we covered how to make our cluster highly scalable with multiple master nodes and what the supported applications are. We also learned how we can clone an existing cluster to replicate its configurations and steps.
That concludes this chapter! Hopefully, you got a good overview of monitoring, scaling, and high-availability aspects of the cluster, and in the next chapter, we can dive deep into security aspects of EMR.