Overview of AWS Data Lake solution
A data lake is a new architectural pattern that is a popular way to store and analyze data as it allows enterprises to easily ingest and store data in any format, both structured and unstructured. A modern data lake provides more agility and flexibility than traditional management systems and allows businesses to store all their data, structured and unstructured, in a central repository.
In Chapter 2, Exploring Any Data, we looked at various services that make up an AWS big data ecosystem. These services are building blocks for a data lake and are broadly classified into four major categories: collect, store, analyze, and orchestrate.
To jump start the build-out of a new data lake, AWS offers a data lake solution that has the key building blocks already packaged and deployed, along with an intuitive web application. This pre-packaged implementation allows customers to quickly realize the data lake concept and put a real web interface in front of the data...