Data collection
We’re lucky. I’ve already found our data and included it for your consideration. In the real world, we would have needed to have performed the normal step of data collection. While there are entire tomes on this topic – most 4-year scientific university degree programs focus heavily on this topic – I don’t plan on doing a deep dive here. However, I will at least give you an overview should you be new to what we’re trying to accomplish.
Downloading from an external source
This is the case for our example dataset since I downloaded it from Kaggle. When using a dataset downloaded from the internet, we should always make sure to check its copyright license. Most of the time, if it is in the public domain, we can freely use and distribute it without any worry. The example dataset we are using is an instance of this. On the other hand, if the dataset is copyrighted, we might still be able to use it by asking for permission from...