Combining classifiers with voting
One way to improve classification performance is to combine classifiers. The simplest way to combine multiple classifiers is to use voting, and choose whichever label gets the most votes. For this style of voting, it's best to have an odd number of classifiers so that there are no ties. This means combining at least three classifiers together. The individual classifiers should also use different algorithms; the idea is that multiple algorithms are better than one, and the combination of many can compensate for individual bias. However, combining a poorly performing classifier with better performing classifiers is generally not a good idea, because the poor performance of one classifier can bring the total accuracy down.
Getting ready
As we need to have at least three trained classifiers to combine, we are going to use a NaiveBayesClassifier
class, a DecisionTreeClassifier
class, and a MaxentClassifier
class, all trained on the highest information words of...