In this recipe, we download and inspect the well-known Iris dataset in preparation for the upcoming streaming KMeans recipe, which lets you see classification/clustering in real-time.
The data is housed on the UCI machine learning repository, which is a great source of data to prototype algorithms on. You will notice that R bloggers tend to love this dataset.