If your objective is to use Spark as a tool for your machine learning projects, this section will introduce you to how it is possible to do so within R. Spark contains its own machine learning library that can be accessed by sparklyr, so it's pretty simple to work your machine learning projects on Spark. The website https://spark.rstudio.com/mlib/ provides a pretty good presentation about Spark Machine Learning library, so remember to visit it to discover the many available functions; they include an example workflow that is short and effective and shows how to sequence your project codes. We already talked about machine learning in Chapter 6, Machine Learning with R; in this section, I'm only going to develop the decision tree studies we did in that chapter, but now using the Spark machine learning library.
Using Spark machine learning or H2O Sparking Water
The Spark...