In this chapter, we will train and evaluate unsupervised machine learning models applied to a variety of real-world use cases, again using Python, Apache Spark, and its machine learning library, MLlib. Specifically, we will develop and interpret the following types of unsupervised machine learning models and techniques:
- Hierarchical clustering
- K-means clustering
- Principal component analysis