Setting up your environment
Before we begin our chapter, let’s take the time to set up our working environment.
The Databricks CLI
The first step is to install the databricks-cli
tool using the pip
Python package manager:
pip install databricks-cli
Let’s validate that everything has been installed correctly. If this command produces the tool version, then everything is working correctly:
Databricks -v
Now let’s set up authentication. First, go into the Databricks UI and generate a personal access token (PAT). The following command will ask for the host created for your Databricks instance and the created token:
databricks configure --token
We can quickly determine whether the CLI is set up correctly by running the following command, and if no error is returned, you have a working setup:
databricks fs ls
Git
Git will be used in this chapter and there are many ways to install it. I would recommend using https://git-scm.com/download...