Chapter 2. Getting Started with Apache Hadoop and Apache Spark
In this chapter, we will understand the basics of Hadoop and Spark, how Spark is different from MapReduce, and get started with the installation of clusters and setting up the tools needed for analytics.
This chapter is divided into the following subtopics:
- Introducing Apache Hadoop
- Introducing Apache Spark
- Discussing why we use Hadoop with Spark
- Installing Hadoop and Spark clusters