Expanding your data manipulation and preparation skills
If you have read any introductory text on data science, you will have probably been told that data scientists spend the majority of their time cleaning data, reformatting it, and reshaping it.
As you have read this book, you will have probably seen this in action!
We saw several times how much code and mental effort, and importantly, domain knowledge, goes into just getting our data into a certain format. Once we have our data in a standardized format, for example, a long form (tidy) DataFrame, then our lives become easier.
You might want to learn more pandas and NumPy for a more complete set of techniques on reshaping your data however you want. As mentioned at the beginning of the chapter, learning new pandas techniques without a practical purpose in mind can help a lot in expanding your imagination. Learning regular expressions can help a lot in text analysis, because text is typically unstructured, and finding and...