Descriptive statistics give insights into understanding data. These are summary statistics that describe a given dataset by summarizing features and measures, such as central tendency and measure of spread (or variability). Central tendency includes calculation of the mean, median, mode, whereas measures of variability include range, quartiles, minimum and maximum value, variance and standard deviation, as well as skewness and kurtosis.
These statistics are covered byrx- functions in RevoScaleR package, which means that you can use all the computational advantages of the package by calling: rxSummary, rxCrossTabs, rxMarginals, rxQuantile, rxCube, and rxHistogram, without worrying about the performance, out of memory exceptions, or which R package holds the right function.
We will be using the[Sales].[vPersonDemographics] view in the AdventureWorks...