Technical requirements
Dataset: We will use the Census Income dataset (https://archive.ics.uci.edu/ml/datasets/Adult) for this chapter.
All the code can be found in the book’s GitHub repository: https://github.com/PacktPublishing/Data-Wrangling-with-R/tree/main/Part2/Chapter8. The package to be used can be installed and loaded using the following single commands:
install.packages('tidyverse') library(tidyverse)
Figure 8.1 shows the message displayed once the package is loaded. Be aware that there are some red-colored warning messages displayed once you load packages in R, but they are not errors. Those are just to alert you to what version of the R language is used to build their current versions.
Figure 8.1 – The tidyverse package will load eight libraries. The warning messages are not errors
In the next section, let’s remind ourselves about tidy data and why it is important for data wrangling with tidyverse.