Chapter 1: Creating an Azure Databricks Service
Azure Databricks is a high-performance Apache Spark-based platform that has been optimized for the Microsoft Azure cloud.
It offers three environments for building and developing data applications:
- Databricks Data Science and Engineering: This provides an interactive workspace that enables collaboration between data engineers, data scientists, machine learning engineers, and business analysts and allows you to build big data pipelines.
- Databricks SQL: This allows you to run ad hoc SQL queries on your data lake and supports multiple visualization types to explore your query results.
- Databricks Machine Learning: Provides end-to-end machine learning environment for feature development, model training , experiment tracking along with model serving and management.
In this chapter, we will cover how to create an Azure Databricks service using the Azure portal, Azure CLI, and ARM templates. We will learn about different...