Summary
In this chapter, you started with an understanding of why to choose the cloud for big data analytics. You learned details about Amazon EMR, which is AWS Hadoop offering in the cloud and in 2021, AWS also launched servers offering of EMR. You learned about EMR clusters, file systems and security.
Further in this chapter, you got introduced to one of the most important services in the AWS stack – AWS Glue. You also learned about the high-level components that comprise AWS Glue, such as the AWS Glue console, the AWS Glue Data Catalog, AWS Glue crawlers, and AWS Glue code generators. You then learn how everything is connected and how it can be used. Finally, you learned recommended best practices when architecting and implementing AWS Glue. You also learn when to choose Glue and EMR.
Real-time insights are becoming essential to customer experience, and you learned about handing streaming data in the cloud. You learned about AWS streaming data offering Amazon Kinesis and different...