Preparing the RHadoop environment
As RHadoop requires an R and Hadoop integrated environment, we must first prepare an environment with both R and Hadoop installed. Instead of building a new Hadoop system, we can use the Cloudera QuickStart VM (the VM is free), which contains a single node Apache Hadoop Cluster and R. In this recipe, we will demonstrate how to download the Cloudera QuickStart VM.
Getting ready
To use the Cloudera QuickStart VM, it is suggested that you should prepare a 64-bit guest OS with either VMWare, VirtualBox, or the KVM installed.
If you choose to use VMWare, you should prepare a player compatible with WorkStation 8.x or higher: Player 4.x or higher, ESXi 5.x or higher, or Fusion 4.x or higher.
Note, 4 GB of RAM is required to start VM, with an available disk space of at least 3 GB.
How to do it...
Perform the following steps to set up a Hadoop environment using the Cloudera QuickStart VM:
- Visit the Cloudera QuickStart VM download site (you may need to update the link as...