Creating Plots that Present Global Patterns in Data
In this section, we will study the context of plots that present global patterns in data, such as:
- Plots that show the variance in individual features in data, such as histograms
- Plots that show how different features present in data vary with respect to each other, such as scatter plots, line plots, and heatmaps
Most data scientists prefer to see such plots because they give an idea of the entire spectrum of values taken by the features of interest. Plots depicting global patterns are also useful because they make it easier to spot anomalies in data.
We will work with a dataset called mpg
. It was published by the StatLib library, maintained at Carnegie Mellon University, and is available in the seaborn
library. It was originally used to study the relationship of mileage – Miles Per Gallon (MPG) – with other features in the dataset; hence the name mpg
. Since the dataset contains 3 discrete features...