Doing Automated EDA using AutoViz
AutoViz is an Automated EDA library used for the automatic visualization of datasets. Unlike the previous libraries, it is built on top of the matplotlib
library. It provides a wide array of visuals to summarize and analyze datasets to provide quick insights. The library does most of the heavy lifting and requires minimal user input.
The reports generated by the AutoViz library typically provide the following:
- Data cleaning suggestions: They provide insights into missing values, unique values, and outliers. They also provide suggestions on how to handle outliers, irrelevant columns, rare categories, columns with constant values, and more. This can be useful for data cleaning.
- Univariate analysis: They use histograms, density plots, and violin plots to provide insights into the distribution of the data, outliers, and more.
- Bivariate analysis: They use scatterplots, heatmaps, and pair plots to provide insights into the relationship...