Learning YARN: Moving beyond MapReduce - learn resource management and big data processing using YARN

Akhil Arora

Shrey Mehrotra

₹800 per month

4 (2 Ratings)

Paperback Aug 2015 278 pages 1st Edition

Akhil Arora

Shrey Mehrotra

₹800 per month

4 (2 Ratings)

Paperback Aug 2015 278 pages 1st Edition

What do you get with a Packt Subscription?

Free for first 7 days. ₹800 p/m after that. Cancel any time!

Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!

50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

Thousands of reference materials covering every tech concept you need to stay up to date.

Subscribe now

View plans & pricing

View table of contents

Preview Book

Description

Today enterprises generate huge volumes of data. In order to provide effective services and to make smarter and more intelligent decisions from these huge volumes of data, enterprises use big-data analytics. In recent years, Hadoop has been used for massive data storage and efficient distributed processing of data. The Yet Another Resource Negotiator (YARN) framework solves the design problems related to resource management faced by the Hadoop 1.x framework by providing a more scalable, efficient, flexible, and highly available resource management framework for distributed data processing. This book starts with an overview of the YARN features and explains how YARN provides a business solution for growing big data needs. You will learn to provision and manage single, as well as multi-node, Hadoop-YARN clusters in the easiest way. You will walk through the YARN administration, life cycle management, application execution, REST APIs, schedulers, security framework and so on. You will gain insights about the YARN components and features such as ResourceManager, NodeManager, ApplicationMaster, Container, Timeline Server, High Availability, Resource Localisation and so on. The book explains Hadoop-YARN commands and the configurations of components and explores topics such as High Availability, Resource Localization and Log aggregation. You will then be ready to develop your own ApplicationMaster and execute it over a Hadoop-YARN cluster. Towards the end of the book, you will learn about the security architecture and integration of YARN with big data technologies like Spark and Storm. This book promises conceptual as well as practical knowledge of resource management using YARN.

Who is this book for?

This book is intended for those who want to understand what YARN is and how to efficiently use it for the resource management of large clusters. For cluster administrators, this book gives a detailed explanation of provisioning and managing YARN clusters. If you are a Java developer or an open source contributor, this book will help you to drill down the YARN architecture, write your own YARN applications and understand the application execution phases. This book will also help big data engineers explore YARN integration with real-time analytics technologies such as Spark and Storm.

What you will learn

Explore YARN features and offerings
Manage big data clusters efficiently using the YARN framework
Create single as well as multinode HadoopYARN clusters on Linux machines
Understand YARN components and their administration
Gain insights into application execution flow over a YARN cluster
Write your own distributed application and execute it over YARN cluster
Work with schedulers and queues for efficient scheduling of applications
Integrate big data projects like Spark and Storm with YARN

What do you get with a Packt Subscription?

Free for first 7 days. ₹800 p/m after that. Cancel any time!

Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!

50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

Thousands of reference materials covering every tech concept you need to stay up to date.

Subscribe now

View plans & pricing

Frequently bought together

Learning Docker

Jun 2015 240 pages

3.8 (5)

eBook

₹799 ~~₹3276.99~~

Learning Hadoop 2

Feb 2015 382 pages

3.8 (4)

eBook

₹799 ~~₹3276.99~~

Learning YARN

Aug 2015 278 pages

4 (2)

eBook

₹799 ~~₹2919.99~~

Total ₹ 11,843.97

₹4096.99

₹3649.99

Total ₹ 11,843.97

Xiao Zhang Sep 16, 2015

I'm one of the technical reviewers of this book. The book is written in a very simple language and very easy to understand. Even you do not know anything about Big Data, Hadoop and Yarn, you will find this book very easy to follow and you will be surprised how much you can learn after reading the book. This book also has very rich content of technical details and sample code. It is very useful for technical practitioners who want to write their own YARN applications. It also has the latest information on YARN integration with Spark and Storm for real-time analytics.This book goes great with another YARN book I reviewed – YARN Essentials . Compared to YARN Essentials, this book has more hands on technical details. But the author did a very good job of explaining the concepts at the beginning of each chapter before going into technical details. You will have a comprehensive understanding of the YARN architecture and also be able to do hands on develop&admin work.I am specifically enjoy the chapter related to YARN integration with Spark and Storm. Hadoop is known for its batch processing power. With the new features such as YARN, Spark, Storm available in Hadoop 2.0, we will be able to solve our challenges related to real-time analytics, which is critical for businesses to be successful in this Big Data era.I'm happy having gotten the opportunity to work on this book and I'm very well pleased with the result. I believe this book is a great asset to any people who are interested in YARN!

Amazon Verified review

K Hu Mar 08, 2018

This book is simplified version of online doc. It is a thin book and took me 2 days to read through it, definitely doesn't worth 50+ bucks. 0 depth.

Learning YARN: Moving beyond MapReduce - learn resource management and big data processing using YARN

What do you get with a Packt Subscription?

Learning YARN

Chapter 2. Setting up a Hadoop-YARN Cluster

Starting with the basics

The Hadoop-YARN single node installation

Prerequisites

Installation steps

Step 1...

An overview of web user interfaces

The Hadoop-YARN multi-node installation

Prerequisites

An overview of the Hortonworks and Cloudera installations

Starting with the basics

Page 1 of 7

Description

Who is this book for?

What you will learn

Product Details

What do you get with a Packt Subscription?

Product Details

Frequently bought together

Table of Contents

Recommendations for you

Customer reviews

People who bought this also bought

About the authors

FAQs

Learning YARN: Moving beyond MapReduce - learn resource management and big data processing using YARN

What do you get with a Packt Subscription?

Step 1...

Description

Who is this book for?

What you will learn

Product Details

What do you get with a Packt Subscription?

Product Details

Packt Subscriptions

Frequently bought together

Table of Contents

Recommendations for you

Customer reviews

People who bought this also bought

About the authors

FAQs