Summary
We explored key tidy data topics in this chapter. These topics included handling duplicated data, either by dropping rows where the data are redundant, or aggregating by group. We also restructured data stored in a many-to-many format into a tidy format. Finally, we stepped through several ways of converting data from wide to long format, and back to wide when necessary. Up next is the final chapter of the book, where we will learn to automate data cleaning with user-defined functions, classes and pipelines.
Join our community on Discord
Join our community’s Discord space for discussions with the author and other readers:
https://discord.gg/p8uSgEAETX