The previous section looked at the high-level characteristics of data as defined by data types, formats and sources. Data Extraction-Transformation-Load (ETL) is a generic term that signifies the process of extracting (retrieving) data, applying transformations such as cleansing operations and aggregations, and finally loading the data onto a target system, if such is needed.
Data extraction, transformation, and load
Basic tools of data wrangling
In this section, we're going to share some of the common data mining and aggregation operations that can be performed on data.frame, dplyr, and data.table.
First, we are going to learn how to use a few functions in Base R to perform basic manipulation operations. We'll then...