Examining the Relationships between Features and the Response
In order to make accurate predictions of the response variable, good features are necessary. We need features that are clearly linked to the response variable in some way. Thus far, we've examined the relationship between a couple features and the response variable, either by calculating a groupby/mean of the response variable, or by trying models directly, which is another way to make this kind of exploration. However, we have not yet made a systematic exploration of how all the features relate to the response variable. We will do that now and capitalize on all the hard work we put in when we were exploring the features and making sure the data quality was good.
A popular way of getting a quick look at how all the features relate to the response variable, as well as how the features are related to each other, is by using a correlation plot. We will first create the correlation plot for the case study data, then discuss how to...