Performing univariate analysis using a histogram
When visualizing one numeric variable in our dataset, there are various options to consider, and the histogram is one of them. A histogram is a bar graph-like representation that provides insights into our dataset’s underlying frequency distribution, usually a continuous dataset. The x-axis of a histogram represents continuous values that have been split into bins or intervals while the y-axis represents the number or percentage of occurrences for each bin.
With the histogram, we can quickly identify outliers, data spread, skewness, and more.
In this recipe, we will explore how to create histograms in seaborn
. The histplot
method in seaborn
can be used for this.
Getting ready
In this chapter, we will work with two datasets: the Amsterdam House Prices Data and the Palmer Archipelago (Antarctica) Penguins data, both from Kaggle.
Create a folder for this chapter and create a new Python script or Jupyter Notebook file...