Exploring the Dataset
Real-life applications are crucial for cementing knowledge. Therefore, this chapter consists of a real-life case study involving a classification task, where the key steps that you learned about in the previous chapter will be applied in order to select the best performing model.
To accomplish this, the Census Income dataset will be used, which is available at the UC Irvine Machine Learning Repository.
Note
The dataset that will be used in the following section, as well as in this chapter's activities, can be found in this book's GitHub repository at https://packt.live/2xUGShx.
Citation: Dua, D. and Graff, C. (2019). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science.
You can download the dataset from this book's GitHub repository. Alternatively, to download the dataset from the original source, follow these steps:
- Visit the...