Producing histograms
Histograms can help you inspect the distribution of any variable quickly. Histograms bin the observations into groups and present the count of observations in each group on a bar chart. It is a powerful and easy way to examine issues with your data as well as inspect visually if the data follows some known distribution.
In this recipe, we will produce histograms of all the prices from our dataset.
Getting ready
To run this recipe, you will need pandas
and SQLAlchemy
to retrieve the data. Matplotlib
and Seaborn
handle the presentation layer. Matplotlib
is a 2D library for scientific data presentation. Seaborn builds on Matplotlib
and provides an easier way to produce statistical charts (such as histograms, among others).
How to do it…
We assume that the data can be accessed from the PostgreSQL database. Check the Storing and retrieving from a relational database recipe from Chapter 1, Preparing the Data. The following code will produce a histogram of the prices and...