We have gone through the Apache Hadoop subsystem in detail in previous chapters. Although Hadoop is extensively known for its core components such as HDFS, MapReduce and YARN, it also offers a whole ecosystem that is supported by various components to ensure all your business needs are addressed end-to-end. One key reason behind this evolution is because Hadoop's core components offer processing and storage in a raw form, which requires an extensive amount of investment when building software from a grass-roots level.
The ecosystem components on top of Hadoop can therefore provide the rapid development of applications, ensuring better fault-tolerance, security, and performance over custom development done on Hadoop.
In this chapter, we cover the following topics:
- Understanding Hadoop's Ecosystem
- Working with Apache Kafka
- Writing...