Chapter 3: Common Use Cases and Architecture Patterns
This chapter provides an overview of common use cases and architecture patterns you will see with Amazon Elastic MapReduce (EMR) and how EMR integrates with different AWS services to solve specific use cases. The use cases include batch Extract, Transform, and Load (ETL), real-time streaming, clickstream analytics, interactive analytics with machine learning (ML), genomics data analysis, and log analytics.
This should give you a starting point to understand what problem statements you can solve using Amazon EMR and use it to solve your real-world big data use cases.
We will dive deep into the following topics in this chapter:
- Reference architecture for batch ETL workloads
- Reference architecture for clickstream analytics
- Reference architecture for interactive analytics and ML
- Reference architecture for real-time streaming analytics
- Reference architecture for genomics data analytics
- Reference architecture...