Apart from the core components, Hadoop contains many tools and libraries existing on top of the core, collectively called the Hadoop ecosystem.
The following figure just lists a few commonly-used frameworks constituting the Hadoop ecosystem:
Figure 13: Hadoop Ecosystem (a few important components shown)
The following section tries to categorize these frameworks on top of Hadoop according to its core capability and briefly explains them. Getting into the details of each one is not possible and the section is intended for a skim knowledge so that you are well aware of Hadoop’s capabilities and choices.
The following section covers various frameworks in the Hadoop ecosystem, categorizing each into a capability. The various capabilities are these:
- Data access/processing components
- Data storage components
- Monitoring, management, and orchestration components
- Data integration components...