Summary statistics
Summary statistics include the mean, quartiles, and standard deviation. The .describe
method will calculate these measures on all of the numeric columns in a DataFrame.
How to do it…
- Load the dataset:
>>> import pandas as pd >>> import numpy as np >>> fueleco = pd.read_csv("data/vehicles.csv.zip") >>> fueleco barrels08 barrelsA08 ... phevHwy phevComb 0 15.695714 0.0 ... 0 0 1 29.964545 0.0 ... 0 0 2 12.207778 0.0 ... 0 0 3 29.964545 0.0 ... 0 0 4 17.347895 0.0 ... 0 0 ... ... ... ... ... ... 39096 14.982273 0.0 ... 0 0 39097 14.330870 0.0 ... 0 0 39098 15.695714 0.0 ... 0 0 39099 15.695714 0.0 ... 0 0 39100...