Technical requirements
In this chapter, we will use pandas
, matplotlib
, and featuretools
open source Python libraries. You can install featuretools
with pip
:
pip install featuretools
Additionally, you can do so with conda
:
conda install -c conda-forge featuretools
These commands install the basic featuretools
functionality, but we can install add-ons for creating features with natural language processing (NLP) or for using dask
as the backend instead of pandas
. For more information on how to install featuretools
add-ons, including graphviz
, check out their documentation here: https://docs.featuretools.com/en/v0.16.0/getting_started/install.html.
We will work with the Online Retail II dataset from the UCI Machine Learning Repository, which is available at https://archive.ics.uci.edu/ml/datasets/Online+Retail+II and licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license: https://creativecommons.org/licenses/by/4.0/legalcode. The corresponding...