Data Analysis and Visualization with R and Python in Excel – A Case Study
In this final chapter, we are going to perform an analysis—visualization and a simple model—built with data from Excel and place all those outcomes back into it. This can be useful when there is a lot of data, or the calculations themselves are best suited to being done outside of Excel.
First, we will start with importing our data and then performing some data exploration via visualizations. For this chapter, we are going to use the diamonds
dataset from the R package called ggplot2
. We will view the data where the price is the outcome and look at it via different facets of the diamond’s characteristics. After the visualizations are done, we will perform some simple modeling to predict the price of a diamond based on its characteristics.
In this chapter, we’re going to cover the following main topics:
- Getting a visualization
- Performing a simple machine learning...