Selecting the features for classifying the Zoo dataset
The UCI Machine Learning Repository (https://archive.ics.uci.edu/) maintains over 600 datasets as a service to the machine learning community. These datasets can be used for experimentation with various models and algorithms. A typical dataset contains a number of features (inputs) and the desired output, in theform of columns, with a description of their meaning.
In this section, we will use the UCI Zoo dataset (https://archive.ics.uci.edu/dataset/111/zoo). This dataset describes 101 different animals using the following 18 features:
No. |
Feature Name |
Data Type |
1 |
animal name |
unique for each instance |
2 |
hair |
boolean |
3 |
feathers ... |