Studying the Relationship between Two Numeric Variables
To understand how we can study the relationship between two numeric variables, we can leverage scatter plots. It is a 2-dimensional visualization of the data, where each variable is plotted on an axis along its length. Relationships between the variables are easily identified by studying the trend across the visualization. Let's take a look at an example in the following exercise.
Exercise 30: Studying the Relationship between Employee Variance Rate and Number of Employees
Let's study the relationship between employee variance rate and the number of employees. Ideally, the number of employees should increase as the variation rate increases.
Perform the following steps to complete the exercise:
First, import the ggplot2 package using the following command:
library(ggplot2)
Create a DataFrame object, df, and use the bank-additional-full.csv file using the following command:
df <- read.csv("/Chapter 2/Data/bank-additional/bank-additional-full...