In this section, we are going to work on a big data classification algorithm, based on a decision tree algorithm that you can use in a real-world scenario.
Let's say you're working for a company and want to find out your ideal market. You also want to know who are the customers buying your product or service. To answer these questions, you can leverage the vast amounts of data you have collected from historical customers.
As with the previous lesson, let's call our rubygems and decisiontree code libraries and set up some attributes. In this example, our attributes will be demographic data, such as age, education, income, and marital status. We also have some training data that will be used by the decision tree to make decisions:
require 'rubygems'
require 'decisiontree'
attributes = [&apos...