We will start with the most commonly used machine learning technique: classification. As we reviewed in the first chapter, the main idea is to automatically build a mapping between the input variables and the outcome. In the following sections, we will look at how to load the data, select features, implement a basic classifier in Weka, and evaluate its performance.
Classification
Data
For this task, we will take a look at the ZOO database. The database contains 101 data entries of animals described with 18 attributes, as shown in the following table:
animal |
aquatic |
fins |
hair |
predator |
legs |
feathers |
toothed |
tail |
eggs |
backbone |
domestic |
milk |
breathes |
cat size |
airborne |
venomous |
type... |