Installing CDH
With a good background knowledge of CDH and its components, let's go ahead and install CDH on a cluster. The remainder of this chapter is going to be hands-on, and we will try to cover everything that is needed to get a fully functional cluster that runs CDH5.
CDH can be installed using one of the following two methods:
- Installation using the operating system's package manager (
yum
/rpm
) - Installation using Cloudera Manager
In this section, we will cover installation using the operating system's package manager. Installation of CDH using Cloudera Manager will be covered in Chapter 5, Using Cloudera Manager.
For the installation, we will be using four servers that run CentOS 6.4 (64-bit) as the operating system.
The following diagram shows a simple four-node Hadoop cluster that runs MapReduce Version 1 (MRv1):
By performing the following installation instructions, we will try to configure our cluster to look like the one shown in the preceding diagram:
- The first step...