Using the DataExplorer package
The DataExplorer
R package is created to streamline the majority of data management and visualization responsibilities during the EDA process. EDA is a critical and primary stage in data analysis, during which analysts take their initial glimpse at the data to formulate meaningful hypotheses and determine subsequent action.
DataExplorer
provides a variety of functions to do the following:
- Scan and analyze data variables: The package can automatically scan and analyze each variable in a dataset, identifying its type, data distribution, outliers, and missing values.
- Visualize data:
DataExplorer
provides a variety of visualization functions to help analysts understand the relationships between variables and identify patterns in the data. These functions include histograms, scatter plots, box plots, heat maps, and correlation matrices. - Transform data:
DataExplorer
also provides functions to transform data, such as converting categorical...