Automation of a secured Hadoop deployment
In a production environment, there are hundreds (sometimes even thousands) of nodes in a Hadoop cluster. Managing and configuring such a large cluster is not done manually as it is laborious and error prone. Traditionally, enterprises used Chef/Puppet or a similar solution for cluster configuration management and deployment, In this approach, organizations had to continuously update their chef recipes based on the changes in Apache Hadoop releases. Instead, organizations typically deploy Hadoop cluster deployment automation based on the Hadoop distribution they work with. For example, in a Cloudera-based Hadoop distribution, organizations leverages Cloudera Manager to provide cluster deployment. automation, and management capability. For Hortonworks-based distributions, organizations prefer Ambari. Similarly, Intel distribution has Intel Manager for Apache Hadoop. Each of these deployment managers support secured Hadoop deployment. The approach...