Further reading
Following are a few resources you can refer to for further reading:
- Lambda function for transient EMR cluster use cases: https://docs.aws.amazon.com/prescriptive-guidance/latest/patterns/launch-a-spark-job-in-a-transient-emr-cluster-using-a-lambda-function.html
- More on AWS Glue crawler definition and execution: https://docs.aws.amazon.com/glue/latest/dg/add-crawler.html
- Optimize Spark performance in EMR: https://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-spark-performance.html