In this section, we will attempt to clean and pre-process the dataset in our current project. This process can also be called exploratory data analysis. In general, the term exploratory data analysis denotes the process of exploring and analyzing a dataset at the same time.
As we have said before, in an iterative development process with data, we need to take incremental steps to learn about the specifics of a dataset and, from there, know how to analyze it better.
For example, a dataset attribute that contains continuous numerical values (such as length or area) should be handled differently than a discrete attribute (such as age or number of siblings) or even categorical data (such as city, country, or gender). In this case, we will apply various cleaning and pre-processing techniques to the attributes in our dataset per their data types.
Data...