Summary
There was a fair bit packed into this chapter, covering several approaches to automating our data-cleaning work. We created functions for showing the structure of our data and generating descriptive statistics. We created functions for restructuring and aggregating our data. We also developed Python classes for handling data cleaning when we have a large number of variables, each requiring very different treatment. We also saw how Python classes can make it easier to work directly with a JSON file. We examined being more intentional with our data cleaning by checking our data against predefined targets. Finally, we explored how to automate our data cleaning with pipelines.
Leave a review!
Enjoyed this book? Help readers like you by leaving an Amazon review. Scan the QR code below to get a free eBook of your choice.