In this section, we will look at a few other data-related topics. The following will be covered:
- Checking data quality
- Reading binary files: Excel, SAS, and other data sources
- On-disk formats
- Working with web data
In this section, we will look at a few other data-related topics. The following will be covered:
There are several inbuilt functions as well as packages for checking the quality of data in R. The most commonly used among them is the summary function in base R:
## Packages Used: ## psych, pastecs, dataMaid, daff # install.packages(c("psych","pastecs","dataMaid","daff")) state <- data.frame(state.x77) state$State <- row.names(state) state summary(state)
The output of the preceding code is as follows:
library...