Summary
In this chapter, we explored what a data lake is and how a data lake can help a large-scale organization. We also looked at the components and characteristics of a successful data lake. Additionally, we looked at the ultimate data lake (Google) and why it's difficult to replicate its functionality. On the flip side, we explored what can be done to optimize the architecture of a data lake. Finally, we delved into the different metrics that can be tracked in order to keep control of your data lake.
In the next chapter, we will learn about a variety of patterns that facilitate the creation of resilient applications and systems by learning about availability, reliability, and scalability patterns.