Similar to the previous recipe, we will use the DecisionTree() class to train and predict an outcome using a regression tree model. To refresh all these models is a variation on CART (Classification and Regression Tree), which comes in two modes. In this recipe, we explore the regression API for the decision tree implementation in Spark.
Solving Regression problems with Decision Trees in Spark 2.0
How to do it...
- Start a new project in IntelliJ or in an IDE of your choice. Make sure the necessary JAR files are included.
- Set up the package location where the program will reside:
package spark.ml.cookbook.chapter10
- Import the necessary packages for the Spark context to get access to the cluster and Log4j.Logger to reduce...