Exploring the data
We will use the following Python modules in the next example:
- pandas: Python package for data analysis and data manipulation.
- NumPy: This is a library that adds support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
Statsmodels
: Python package that provides a complement to SciPy for statistical computations, including descriptive statistics and estimation and inference for statistical models. It provides classes and functions for the estimation of many different statistical models.- Seaborn and Matplotlib: Python packages for effective data visualization.
We’ll get started using the following steps:
- The following block of code will load all the required packages, as well as load the data and show the first five rows of it. For readability purposes, we will limit the maximum number of rows to be shown to
20
, set the limit of maximum...