Configuring Hadoop users
In this recipe, we will configure users to run Hadoop services so as to have better control of access by daemons.
In all the recipes so far, we have configured all services/daemons, whether it's HDFS, YARN, or Hive to run with user hadoop
. This is not the right practice for production clusters as it would be difficult to control services in a fine and granular manner.
It is recommended to segregate services to run with different users, for example, HDFS daemons as hdfs:hadoop
, YARN daemons as yarn:hadoop,
and other services such as Hive or HBase with their own respective users.
Getting ready
To step through the recipe in this section, we need a Hadoop cluster already configured and it is assumed that users are aware about Hadoop installation and configuration. Refer to Chapter 1, Hadoop Architecture and Deployment for the installation and configuration of a Hadoop cluster. In this recipe, we are just separating daemons to run with different users, rather than them...