In this chapter, we will develop, test, and evaluate supervised machine learning models applied to a variety of real-world use cases using Python, Apache Spark, and its machine learning library, MLlib. Specifically, we will train, test, and interpret the following types of supervised machine learning models:
- Univariate linear regression
- Multivariate linear regression
- Logistic regression
- Classification and regression trees
- Random forests