Exploring continuous data
In the Exploring categorical data recipe, we provided a definition for categorical and continuous data, while exploring only the former. The same vehicles
dataset we used in that recipe has a good mix of both types of data (most datasets will), so we will reuse that same dataset but shift our focus to continuous data for this recipe.
Before going through this recipe, I advise you to get familiar with the techniques shown in the Plotting distributions of non-aggregated data recipe first. The actual plotting calls made will be the same, but this recipe will apply them to more of a “real-world” dataset instead of artificially created data.
How to do it
Let’s start by loading the vehicles
dataset:
df = pd.read_csv(
"data/vehicles.csv.zip",
dtype_backend="numpy_nullable",
dtype={
"rangeA": pd.StringDtype(),
"mfrCode": pd.StringDtype(),
"c240Dscr...