Prerequisites
The following are the prerequisites for installing a secure Hadoop cluster:
Root or sudo access for the user installing the cluster.
Hadoop cluster is configured and running in a non-secured mode.
Proper file permissions are assigned to local and Hadoop system directories.
Incase, we are building Kerberos from the source code, we will need the GCC compiler to compile the Kerberos source code. On RHEL/CentOS, run the
yum groupinstall 'Development Tools'
command to install all the dependencies.DNS resolutions and host mappings are working for all machines in the cluster. Kerberos doesn't work with IP. Reverse DNS lookup on all nodes should be working and returning the fully qualified hostname.
The ports required for Kerberos are port 88 for KDC and port 749 for admin services. Since all nodes will have to connect with KDC for authentication, port 88 should be open for all nodes in the cluster running the Hadoop daemons.
The name of the Kerberos realm that will be used for authenticating...