Streaming data to Amazon Redshift via Amazon Kinesis Firehose
Streaming datasets are continuous datasets that can originate from sources such as internet of things (IoT) devices, log files, gaming systems, and so on. Ingesting streamed data into Amazon Redshift allows the running of near-real-time analytics that can be combined with the historical/operational data to produce actionable reporting—for example, in a manufacturing shop, analyzing the data from several IoT sensors can help predict the failure of machinery and enable you to take preventive action.
In this recipe, we will simulate a streaming dataset using the www.amazon.com product review data to be ingested into Amazon Redshift using Amazon Kinesis Firehose. Amazon Kinesis Firehose provides out-of-the-box integration to capture the streaming dataset and land it into an Amazon Redshift table.
Getting ready
To complete this recipe, you will need the following:
- An Amazon Redshift cluster deployed in...