Summary
YARN is used as a generic resource manager for distributed applications. YARN allows easy resource scheduling and application execution over a cluster of nodes. YARN is being integrated with different big data technologies such as Apache Storm, Spark, and so on. In this chapter, you learned about the Spark-YARN architecture and how you can submit Spark jobs on YARN. You also learned about the integration of Storm on YARN and how you can manage the storm services through YARN. This chapter also covered a brief overview of the integration of HAMA and Giraph with YARN.