Organizing Data with Datasets
In his story The Adventure of the Copper Beeches, Arthur Conan Doyle has Sherlock Holmes shout “Data! Data! Data! I cannot make bricks without clay.” This mindset, which served the most famous detective in literature so well, should be adopted by every data scientist. For that reason, we begin the more technical part of this book with a chapter dedicated to data: specifically, in the Kaggle context, leveraging the power of the Kaggle Datasets functionality for our purposes.
In this chapter, we will cover the following topics:
- Setting up a dataset
- Gathering the data
- Working with datasets
- Using Kaggle Datasets in Google Colab
- Legal caveats