After cleaning the dataset, it is always recommended to visualize it. This helps us gain an understanding of the different variables, how their values are distributed, and the correlations that exist between them (we will explore correlations in more detail in the next chapter). We can determine which variables are important to our analyses, which ones give us more information, and which ones can be discarded for being redundant.
We will start by looking a several bar plots, where we will either count the number of occurrences of each value (using a histogram), or we will show the percentage of each value with respect to the total (using a bar plot). To achieve this, perform the following steps:
- Right-click on any cell within the table to access the Quick Analysis option:
- In the pop-up window, we can choose the chart type. Select Clustered...