Chapter 7. Handling Big Data
In this chapter, we will cover the following recipes:
- Training an online logistic regression model using Apache Mahout
- Applying an online logistic regression model using Apache Mahout
- Solving simple text-mining problems with Apache Spark
- Clustering using KMeans algorithm with MLib
- Creating a linear regression model with MLib
- Classifying data points with a Random Forest model using MLib