Technical requirements
The tooling used in this chapter is tied to the tech stack chosen for the book. All vendors should offer a free trial account.
I will be using the following:
- Databricks
- AWS
- Tableau Desktop
- Python
- SQL
- dbt
Setting up your environment
Before we begin our chapter, let’s take the time to set up our working environment.
Python, AWS, and Databricks
As we have with previous chapters, this chapter assumes you have a working version of Python of 3.6 or above release installed in your development environment. We will also assume you have set up an AWS account and Databricks with that account.
Databricks CLI
The Databricks CLI is used to create our Databricks infrastructure; before we can create anything, we must first make sure it’s set up correctly.
Installation and setup
The first step is to install the databricks-cli
tool using the pip python
package manager:
pip install databricks-cli
Let...