The dataset that we're going to be using for this chapter is the Autistic Spectrum Disorder Screening Data for Children Dataset provided by the UCI Machine Learning Repository, which can be found here: https://archive.ics.uci.edu/ml/datasets/Autistic+Spectrum+Disorder+Screening+Data+for+Children++. This dataset contains records of 292 patients or children that have been screened for autism. This contains details of their age, ethnicity, and familial history of autism. We will be using this dataset to predict whether these patients actually have autism.
Now, we have to download the ZIP file present in the repository and extract the contents. In this zip file, we will find two files. The first file contains a description of the attributes present in the dataset. This file mentions the different features that we'll be using for predictions in this...