Technical requirements
In this chapter, we will implement a real-time streaming pipeline using AWS analytics services. So, before getting started, you need to make sure that you have the following requirements ready:
- An AWS account with access to create Amazon S3, Amazon EMR, Amazon Athena, Amazon Cognito, and AWS Glue Catalog resources.
- An IAM user who has access to create IAM roles, which will be used to trigger AWS CloudFormation stack or execute jobs.
Refer to the following link for access to the book's GitHub repository: https://github.com/PacktPublishing/Simplify-Big-Data-Analytics-with-Amazon-EMR-/tree/main/chapter_10.
Now, let's dive deep into the use case and the hands-on implementation steps.
Check out the following video to see the Code in Action at https://bit.ly/3oIz89Q