Querying the DynamoDB data using AWS EMR
In the previous recipe, we have seen how to access the DynamoDB data from AWS EMR. In this recipe, we are going to see how to query DynamoDB using AWS EMR.
Getting ready
To perform this recipe, you should have performed the earlier recipe and have your EMR cluster still running.
How to do it…
Here, we will use productHiveTable
, which we created in the previous recipe. In this recipe, we will see how easy it is to query the DynamoDB data using EMR:
- To get started, connect to your EMR cluster and start Hive.
- In our e-commerce application, we would like to query the product catalogue data in various ways. With DynamoDB being a NoSQL database, we can only query on hash or range keys themselves, which sometimes makes querying difficult. Now, we can use Hive to effectively query the DynamoDB data.
- Let's start with our first query to count the total number of products in our DynamoDB table. For this, we need to execute the following query:
hive>...