The integration of Storm with YARN
Storm is a distributed computational and processing framework, which was developed to process streaming data in real time. It has been released as open source. It is useful for continuous monitoring of processes and running machine learning algorithms. Storm can process millions of records per second on a single node and is widely used for low-latency processing.
Storm has two main services: Nimbus
(master) and Supervisor
(slave). Storm requires Zookeeper component for co-ordination between the Nimbus
and Supervisor
services. The Storm bundle contains a storm.yaml
configuration file. The file contains information related to the Nimbus
server and the Zookeeper quorum. Similar to Spark, this book does not focus on the architecture and the components of Storm. To read more about Storm, you can refer to the official website at https://storm.apache.org/.
Companies such as GroupOn, The Weather Channel, Twitter, Yahoo, and so on are using Storm. To get the full...