Getting Datasets
Often, one of the challenges in machine learning is obtaining sample datasets for experimentation. In machine learning, when you are just getting started with an algorithm, it is often useful to get started with a simple dataset that you can create yourself to test that the algorithm is working correctly according to your understanding. Once you clear this stage, it is time to work with a large dataset, and for this you would need to find the relevant source so that your machine learning model can be as realistic as possible.
Here are some places where you can get the sample dataset to practice your machine learning:
- Scikit‐learn's built‐in dataset
- Kaggle dataset
- UCI (University of California, Irvine) Machine Learning Repository
Let's take a look at each of these in the following sections.
Using the Scikit‐learn Dataset
Scikit‐learn comes with a few standard sample datasets, which makes learning machine learning easy. To load the...