In this chapter, we will cover the following recipes:
- Doing term frequency with Spark - everything that counts
- Displaying similar words with Spark using Word2Vec
- Downloading a complete dump of Wikipedia for a real-life Spark ML project
- Using Latent Semantic Analysis for text analytics with Spark 2.0
- Topic modeling with Latent Dirichlet allocation in Spark 2.0