Developing a Stream Processing Solution
Building on the foundation of Spark, Azure Data Factory (ADF), and Synapse pipelines, Chapter 5, Developing a Batch Processing Solution, equipped you to effectively process large amounts of data.
In this chapter, you will cover another data method, stream processing solutions, also known as real-time processing systems. Similar to batch processing, stream processing is another important segment of data pipelines. This chapter will focus on introducing the concepts and technologies involved in building a stream processing system. You will be learning about technologies such as Azure Stream Analytics (ASA), Azure Event Hubs, and Spark (from a streaming perspective).
You will learn how to build end-to-end streaming solutions using these technologies and explore important streaming concepts such as checkpointing, windowed aggregates, replaying older stream data, handling drift, and stream management concepts such as distributing streams across...