Classifying text documents using Weka
We used Weka in Chapter 4, Learn from Data - Part 1 to classify data points that are not in text format. Weka is a very useful tool to classify text documents using machine-learning models as well. In this recipe, we will demonstrate how you can use to develop document classification model using Weka 3.
Getting ready
To download Weka, go to http://www.cs.waikato.ac.nz/ml/weka/downloading.html and you will find download options for Windows, Mac, and other operating systems such as Linux. Read through the options carefully and download the appropriate version. During the writing of this book, 3.9.0 was the latest version for the developers, and as the author already had version 1.8 JVM installed in his 64-bit Windows machine, he has chosen to download a self-extracting executable for 64-bit Windows without a Java VM.
After the download is complete, double-click on the executable file and follow the on-screen instructions. You need to install the full version...