Hands-on – loading data into an Amazon Redshift cluster and running queries
In our Redshift hands-on exercise, we're going to create a new Redshift cluster and set up Redshift Spectrum so that we can query data in external tables on Amazon S3. We'll then use Redshift Spectrum to read data from S3 and load a subset of that data into a local table in Redshift, after which we'll run some complex queries.
In this exercise, we will be setting up a Redshift cluster for a travel agency. Agents need to ensure that they can find the best deal for accommodation in New York City and Jersey City that is close to specific popular tourist attractions, such as the Freedom Tower and the Empire State Building.
Uploading our sample data to Amazon S3
For this exercise, we will use a dataset from an organization called Inside Airbnb (http://insideairbnb.com/about.html) that provides Airbnb data under the Creative Commons Attribution 4.0 International License (https://creativecommons...