Featuretools on a New Dataset
In this chapter, we have learned about Featuretools and how to build automated features using it. In the next activity, we will apply what we have learned to a new dataset. This dataset is a modified version of the adult dataset from the UCI Machine Learning Repository, Irvine, CA: University of California, School of Information and Computer Science, which can be found at https://packt.live/2Qr3ih6, in the adult.data
file. This dataset has various attributes of a working adult, such as age, occupation, education, and native. The task is to predict whether a particular adult will earn more than 50,000
in their yearly salary or not.
The details about the various attributes are available at the preceding link in the adult.names
file. This dataset has a mix of both categorical and numerical data and is a good dataset to try out what you have learned about Featuretools.