Setting up your environment
Before we begin this chapter, let’s take some time to set up our working environment.
Python, AWS, and Databricks
As in previous chapters, this chapter assumes you have a working version of Python 3.6 or higher installed in your development environment. It also assumes you have set up an AWS account and have set up Databricks with that AWS account.
If you do not have a working Databricks setup, please refer to the following guide to get started: https://docs.databricks.com/en/getting-started/index.html.
Databricks CLI
The first step is to install the databricks-cli
tool using the pip
Python package manager:
pip install databricks-cli
Let’s validate that everything has been installed correctly. If the following command produces the tool’s version, then everything is working correctly:
Databricks –v
Now, let’s set up authentication. First, go into the Databricks UI and generate a personal access...