In-transit encryption
In this recipe, we will configure in-transit encryption for securing the transfer of data between nodes during the shuffle phase. The mapper output is consumed by reducers, which can run on different nodes, so to secure the transfer channel, we secure the communication between Mappers and Reducers. We will be securing the RPC communication channel as well, although it induces a slight overhead and must be setup only if it is absolutely necessary.
Getting ready
To complete the recipe, the user must have completed the previous Configure SSL in Hadoop recipe. We will be extending the configuration already set up in that section by adding a few more options.
Note
It is recommended that the users explore SSL and learn more about ciphers to understand its security and performance implications.
How to do it...
- Connect to the
nn1.cluster1.com
master node and switch to userhadoop
. - To enable RPC privacy, edit
core-site.xml
to add the following lines on each node in the cluster:<...