Technical requirements
In this chapter, we will use pandas
, matplotlib
, and the open source Python library Featuretools. You can install Featuretools with pip
:
pip install featuretools
Otherwise, you can do so with conda
:
conda install -c conda-forge featuretools
Make sure you have Featuretools version 1.14.0 or greater to run this notebook. The code was tested using versions 1.14.0 and 1.15.0.
Note
We will work with the Online Retail II dataset from the UCI Machine Learning Repository: https://archive.ics.uci.edu/ml/datasets/Online+Retail+II. Dua, D. and Graff, C. (2019). UCI Machine Learning Repository (http://archive.ics.uci.edu/ml). Irvine, CA: The University of California, School of Information and Computer Science.
To download the Online Retail II
dataset, follow these steps:
- Go to https://archive.ics.uci.edu/ml/machine-learning-databases/00502/.
- Click on
online_retail_II.xlsx
to download the data. - Save
online_retail_II.xlsx
to the folder...