Part 2: Big Data Stack
In this part, you will dive into the core technologies that make up the modern data stack, a set of tools and architectures designed for building robust and scalable data pipelines. You will gain a solid understanding of the Lambda architecture and its components, and gain some hands-on experience with powerful big data tools such as Apache Spark, Apache Airflow, and Apache Kafka.
This part contains the following chapters:
- Chapter 4, The Modern Data Stack
- Chapter 5, Big Data Processing with Apache Spark
- Chapter 6, Apache Airflow for Building Pipelines
- Chapter 7, Apache Kafka for Real-Time Events and Data Ingestion