Introduction to stringr
There is a lot to learn about strings. Even though R is a language that was created with statistics in mind, it has developed a lot over the years and many libraries have emerged. As already mentioned, working with strings is a good skill to have given that you will often need to deal with these objects in your daily work as a data scientist. Sentiment analysis of clients or social media, comments analysis in feedback forms, the analysis of textual information scraped from the internet, or simply parsing a city name out of an address are some of the many tasks that can be part of a data wrangling request.
To code along with this chapter, make sure that you have installed and loaded the following libraries. Of them, you may be missing the Gutenberg package. Therefore, I suggest that you use install.package("gutenbergr")
before trying to load it. We will use it for an exercise at the end of this chapter:
# Use install.packages("library_name...