Variable creation and data transformation are two processes when defining data munging and data wrangling tasks. These tasks are important for proper data preparation and make it easier to analyze data for future tasks.
The functions that we will be exploring are as follows:
- Variable creation and recoding
- Data transformation
- Handling missing values
- Sorting, merging, and splitting datasets
- Aggregate by category (which means sums), which is similar to T-SQL aggregations and Windows functions
This part will cover some of the following functions, mainly focusing on data transformation, handling missing values, and splitting datasets:
RxDataSource, rxDataStep, rxDataStepXdf, RxFileSystem, rxFindFileInPath, rxFindPackage, rxFisherTest, RxForeachDoPar, rxGetInfo, rxGetInfoXdf, rxGetJobInfo, rxGetJobInfo, rxGetOption, rxGetVarInfo, rxGetVarNames...