Summary
In this chapter, we emphasized the need to choose the right architecture for future-proofing a business. This choice will determine the future agility of on-boarding use cases and the productivity of data personas in exploring and executing use cases. Traditional data warehouses and data lakes have their own strengths and weaknesses, and the lakehouse is a happy amalgamation of the two technologies.
The data format of warehouses is closed and proprietary, whereas a lakehouse prescribes an open data format. Our recommendation is to use Delta, as it is the best open source data format in the open source community today. The data type of warehouses caters to mostly structured data, and some semi-structured, whereas a lakehouse supports all kinds of data, including unstructured. Cloud storage is highly scalable, durable, and cost-effective, so a lakehouse is not only highly scalable but much cheaper and more performant than its warehouse counterpart. A warehouse was designed...