Hands-on – ingesting streaming data
Earlier in this chapter, we looked at two options for ingesting streaming data into AWS, namely Amazon Kinesis and Amazon MSK. AWS provides an open-source solution for streaming sample data to Amazon Kinesis; therefore, in this section, we will use the Amazon Kinesis service to ingest streaming data. To generate streaming data, we will use the AWS open-source Amazon Kinesis Data Generator (KDG).
In this section, we will perform the following tasks:
- Configure Amazon Kinesis Data Firehose to ingest streaming data, and write the data out to Amazon S3.
- Configure Amazon KDG to create mock streaming data.
To get started, let’s configure a new Kinesis Data Firehose instance to ingest streaming data and write it out to our Amazon S3 data lake.
Configuring Kinesis Data Firehose for streaming delivery to Amazon S3
Kinesis Data Firehose is designed to enable you to easily ingest data from streaming sources...