Graphical EDA
You can include as Graphical Exploratory Data Analysis (Graphical EDA) any kind of technique that implies visualizing your data following some kind of conventional system. This can involve representing it on a Cartesian plot or following polar coordinates; the main point here is that you do not rely on any kind of summary, but just on your eyes to read the story your data has to tell.
Visualizing a variable distribution
One of the first things you can do when performing graphical EDA is to look at the distribution of your data, one variable at a time. To do that, two main types of plot come in to help:
- Histogram
- Boxplot
Histogram
A histogram is a special kind of bar plot, having on the x axis the unique values shown from your variable, or some kind of clusterization of these values, and on the y axis the frequency of these values. Let's plot a histogram for each of our three variables within the cash_flow_report
. We are going to use ggplot
, which somebody told me you should already...