Installing and configuring Apache Airflow
Apache Airflow performs the same role as Apache NiFi; however, it allows you to create your data flows using pure Python. If you are a strong Python developer, this is probably an ideal tool for you. It is currently one of the most popular open source data pipeline tools. What it lacks in a polished GUI – compared to NiFi – it more than makes up for in the power and freedom to create tasks.
Installing Apache Airflow can be accomplished using pip
. But, before installing Apache Airflow, you can change the location of the Airflow install by exporting AIRFLOW_HOME
. If you want Airflow to install to opt/airflow
, export the AIRLFOW_HOME
variable, as shown:
export AIRFLOW_HOME=/opt/airflow
The default location for Airflow is ~/airflow
, and for this book, this is the location I will use. The next consideration before installing Airflow is to determine which sub-packages you want to install. If you do not specify any, Airflow...