Part 1: Fundamentals of Data Ingestion
In this part, you will be introduced to the fundamentals of data ingestion and data engineering, passing through the basic definition of an ingestion pipeline, the common types of data sources, and the technologies involved.
This part has the following chapters:
- Chapter 1, Introduction to Data Ingestion
- Chapter 2, Principals of Data Access – Accessing Your Data
- Chapter 3, Data Discovery – Understanding Our Data Before Ingesting It
- Chapter 4, Reading CSV and JSON Files and Solving Problems
- Chapter 5, Ingesting Data from Structured and Unstructured Databases
- Chapter 6, Using PySpark with Defined and Non-Defined Schemas
- Chapter 7, Ingesting Analytical Data