Summary
In this chapter, we played around with data pertaining to the quality of air in multiple localities of Beijing, China. We observed trends over different measures of time to see how the concentration of various pollutants differed.
In this book, we looked at several data cleaning, preparation, analysis, and visualization techniques and applied them to a diverse range of datasets from a variety of domains. We made informed decisions to delete or impute instances based on the data available, and tweaked existing features to create new ones by converting them into different formats and breaking them down into several features.
These processes helped us to derive additional insights from our data. Additionally, we learned to ensure that we ask our data the right questions and understand what information it can and cannot provide us with. It is important not to have unreasonable expectations from your data.
You are now equipped with the tools and knowledge required to...