There are several ways to install Hadoop. The most common ones are:
- Installing Hadoop from the source files from https://hadoop.apache.org
- Installing using open source distributions from commercial vendors such as Cloudera and Hortonworks
In this exercise, we will install the Cloudera Distribution of Apache Hadoop (CDH), an integrated platform consisting of several Hadoop and Apache-related products. Cloudera is a popular commercial Hadoop vendor that provides managed services for enterprise-scale Hadoop deployments in addition to its own release of Hadoop. In our case, we'll be installing the HDP Sandbox in a VM environment.