Interactive Analytics
In this chapter, we will look at the following key topics:
- Analytics using Amazon Athena
- Analytics using Presto, Trino, and Hive on Amazon EMR
One of the fundamental principles of building a modern data architecture on AWS is hinged around using purpose-built tools for solving specific use cases. An enterprise data platform once fully built has many components, each with a specific purpose for solving a particular business use case.
In Chapter 2, Scalable Data Lakes, we went through the fundamentals of building a data lake on AWS using Amazon S3 as the storage layer and the AWS Glue Data Catalog as the technical metadata layer. Each layer of the data lake has data that may be of use to different personas in an organization. The most basic ask from each of these personas will be to provide them the ability to query datasets in the data lake using the SQL syntax so that they can derive insights from the data. Interactive analytics, using specific...