Leveraging cloud object storage
Cloud object storage has become the cheapest way to store large volumes of data. These storage services have become the foundation for data lakes and lakehouses, which are used for data engineering, data science, and data analysis workloads.
Each cloud service offers its own storage, and you can pick whichever works best for you, but on Azure, the storage service dedicated to big data analytics is called Azure Data Lake Storage Gen2 (ADLS Gen2).
You can follow the directions in the following link to create an ADLS storage account to use throughout this book: https://learn.microsoft.com/en-us/azure/storage/blobs/create-data-lake-storage-account.
Using Databricks
Databricks is a platform that simplifies the experience of data engineers, data scientists, and data analysts by combining the capabilities of those personas required into a managed service. As a managed service, you will not have to install components of your environment separately...