Chapter 14. Analyzing the R Community
In this final chapter, I will try to summarize what you have learned in the past 13 chapters. To this end, we will create an actual case study, independent from the previously used hflights
and mtcars
datasets, and will now try to estimate the size of the R community. This is a rather difficult task as there is no list of R users around the world; thus, we will have to build some predicting models on a number of partial datasets.
To this end, we will do the following in this chapter:
- Collect live data from different data sources on the Internet
- Cleanse the data and transform it to a standard format
- Run some quick descriptive, exploratory analysis methods
- Visualize the extracted data
- Build some log-linear models on the number of R users based on an independent list of names