The data lake, data warehouse, and data mart
To build a data architecture, an architect needs to understand the basic concept and differences between a data lake, data warehouse, and data mart. In this section, we will cover the modern data architectural ecosystem and where the data lake, data warehouse, and data mart fit into that landscape.
The following diagram depicts the landscape of a modern data architecture:
Figure 2.5 – Landscape of a modern data architecture
As we can see, various types of data get ingested into the data lake, where it lands in the raw zone. The data lake consists of structured, semi-structured, and unstructured data ingested directly from data sources. Data lakes have a zone consisting of cleansed, transformed, and sorted datasets that serve various downstream data processing activities such as data analytics, advanced analytics, publishing as Data-as-a-Service, AI, ML, and many more. This is called the curated zone. The...