In this chapter, we have gone through a step by step guide for installing and setting up a new virtual machine. Then, while using our new virtual machine, we have gone through the installation of Ubuntu to treat it as a dedicated server. We then learnt how to install and configure Apache Hadoop in a single node environment as well as in pseudo-distributed mode. In between, we have also gone through a step by step guide to the installation of the Hadoop prerequisite applications Java and SSH. Now, we are ready to go deeper into the big data world, and will be able to execute our programs.
In the next chapter, we will take a deep dive into the Hadoop echo system, considering the past and present of Hadoop, HDFS, MapReduce, and YARN in detail.