Setting up your environment
Before we begin our chapter, let’s take the time to set up our working environment.
Databricks
As we have with many others, this chapter assumes you have a working version of Python 3.6 and the preceding tooling installed in your development environment. We will also assume that you have set up an AWS account, and have set up Databricks with that AWS account.
Databricks CLI
The first step is to install the databricks-cli
tool using the pip
Python package manager:
pip install databricks-cli
Let’s validate that everything has been installed correctly. If this command produces the tool version, then everything is working correctly:
Databricks -v
Now let’s set up authentication. First, go into the Databricks UI and generate a personal access token. The following command will ask for the host created for your Databricks instance and the created token:
databricks configure --token
We can quickly determine whether...