Streaming Data Ingestion
In this chapter, we will look at the following key topics:
- The need for streaming architectures and its challenges
- Streaming data ingestion using Amazon Kinesis
- Streaming data ingestion using Amazon MSK
- Streaming services usage patterns
Chapter 3, Batch Data Ingestion, was all about batch data ingestion, where we saw multiple ways of ingesting data in batches. Batch data ingestion is still the bedrock of many data pipelines since it helps to serve so many business use cases. For many such use cases, data analytics can be performed with data that’s not fresh – that is, data is not available for consumption in the analytics environment as soon as it’s produced in the source system. For a very long time, deriving reactive insights from data was fine as OLAP systems were meant to perform analytics on data that was typically a day old.
However, data in these modern times gets generated in large volumes and moves...