Running an RBM with DeepLearning4j over Spark
In this recipe, we'll see how to run a restricted Boltzmann machine for classifying the iris dataset.
Getting ready
To step through this recipe, you will need a running Spark cluster either in pseudo distributed mode or in one of the distributed modes, that is, standalone, YARN, or Mesos. Also, get familiar with ND4S, that is, n-dimensional arrays for Scala (Scala bindings for ND4J). ND4J and ND4S are scientific computing libraries for the JVM. Please visit http://nd4j.org/ for details. The pre-requisites to be installed are Java 7, IntelliJ, and the Maven or SBT build tool.
How to do it…
Start an application named
RBMWithSpark
. Initially, specify the following libraries in thebuild.sbt
file:libraryDependencies ++= Seq( "org.apache.spark" %% "spark-core" % "1.6.0", "org.apache.spark" %% "spark-mllib" % "1.6.0", "org.deeplearning4j" % "deeplearning4j-core" % "0.4-rc3.8", "org.deeplearning4j" ...