Machine learning using Hbase
Before we dive deep into the details of Hbase/Hadoop, Mahout, and machine learning, it's vital to discuss and highlight some important concepts, which will be used in this chapter.
Data science—in software engineering terms—is an operation of a set of programs that churns a large quantity of data to evaluate supervised or unsupervised learning models and provides a valuable tool to data scientists or systems through which decisions can be made.
The most important aspect is applying programming/algorithms that provide a cleaner dataset on which the next program can be synthesized. This also allows us to look at different patterns that can change as the size of data grows. Thus, it's important to use some form of machine learning that can adapt as the size of data grows exponentially.
There are various use cases that can be solved with different types of machine learning techniques:
Supervised learning
Unsupervised learning
Recommender system
Model efficacy
The use of...