Summary
In summary, the dplyr
package builds on the R language to make an even more expressive and concise language for data manipulation. In this chapter, the estimate for the total road length in 2011
is the same as in the previous chapter, but the code used to get there is more concise and easier to follow. This can mean less time spent on navigating numerous processing steps and variable names, and more time spent organizing the data.
This concludes the second section of this book, which dealt with a more formulated approach to data wrangling. If you've read up to this point, congratulations! You now have a broad understanding of the tools, approaches, and skills involved in manipulating data.
In the remaining part of the book, I will discuss advanced methods for retrieving and storing data. First, large sources of data are often made available through web interfaces called APIs. I will discuss how to use APIs to retrieve data in the next chapter.
Second, working with large amounts of data...