Part 1 – ETL tools in Python
In your local environment, open the Powerful_ETL_Tools_In_Python.ipynb
file using jupyter notebook
in the command line in your PyCharm terminal.
Bonobo
Bonobo (https://www.bonobo-project.org/) is a Python-based Extract, Transform, Load (ETL) framework that uses a simple and rather elegant approach to pipeline construction. Bonobo treats any callable (i.e., function) or iterable object in Python as a node, which the module can then organize into graphs and structures to execute each object with simplicity. Bonobo makes it incredibly easy to build, test, and deploy pipelines, which allows you to focus on the business logic of your pipeline and not the underlying infrastructure.
Figure 8.2: Bonobo is the Swiss Army knife for everyday data
Installing and using Bonobo in your environment
In your PyCharm terminal, install Bonobo using your pipenv
environment with the following command:
pipenv install bonobo
Head...