Setting up your environment
Before we begin this chapter, let’s take some time to set up our working environment.
Databricks
As in the previous chapters, this chapter assumes you have a working version of Python 3.6 or above installed in your development environment. It also assumes you have set up an AWS account and that you have set up Databricks with that AWS account.
Databricks CLI
The first step is to install the databricks-cli
tool using the pip
Python package manager:
pip install databricks-cli
Let’s validate that everything has been installed correctly. If the following command produces the tool’s version, then everything is working correctly:
Databricks -v
Now, let’s set up authentication. First, go into the Databricks UI and generate a personal access token. The following command will ask for the host that was created for your Databricks instance and the token that was created:
databricks configure --token
We can...