Outliers
You should recall that an outlier is a data point that is different from the majority of data points. When visualized, this data point is far away from the rest—hence, the name outlier. For example, if you have a set of 12 numbers, of which 11 are between 1 and 6 and 1 has the value of 37, that data point will be an outlier because it is extremely different and far away from the rest of the data points.
Boxplots are a type of visualization that are great for visualizing outliers. They provide us with a lot of information about our data, such as the median, the first quartile, the third quartile, the minimum and maximum values, as well as the existence of outliers.
Let's do a quick exercise based on the example of 12 numbers to understand how to spot an outlier from a boxplot.
Exercise 10.02: Identifying Outliers
In this exercise, you will create a small DataFrame with only 12 rows, each consisting of a random number. You will then plot this column...