This chapter will introduce a dataset from the regression problem category and teach us how to use exploratory data analysis techniques to analyze this data. We will learn and use the exploratory data analysis techniques of the scatter plot, 6-plot, linear correlation plot, linear intercept plot, linear slope plot, and linear residual standard deviation plot. We will also explore and analyze a real world dataset called the Glass Identification dataset from UCI.
The following topics will be covered in this chapter:
- Introducing and reading data
- Cleaning and tidying up data
- Mapping and understanding the underlying structure of the dataset and identifying the most important variables
- Testing assumptions and hypothesis, estimating parameters, and calculating the margins of error
- Creating a list of outliers or other anomalies
- Uncovering...