Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Apache Hadoop 3 Quick Start Guide Learn about big data processing and analytics

Product type Paperback

Published in Oct 2018

Publisher Packt

ISBN-13 9781788999830

Length 220 pages

Edition 1st Edition

Languages

Java

Tools

Hadoop

Concepts

Big Data

Author (1):

Hrishikesh Vijay Karambelkar

View More author details

Table of Contents (10) Chapters

Preface

1. Hadoop 3.0 - Background and Introduction

2. Planning and Setting Up Hadoop Clusters FREE CHAPTER

3. Deep Dive into the Hadoop Distributed File System

4. Developing MapReduce Applications

5. Building Rich YARN Applications

6. Monitoring and Administration of a Hadoop Cluster

7. Demystifying Hadoop Ecosystem Components

8. Advanced Topics in Apache Hadoop

9. Other Books You May Enjoy

Leave a review - let other readers know what you think

Real-time streaming with Apache Storm

Apache Storm provides a distributed real-time computational capability for processing large amounts of data with high velocity. This is one of the reasons why it is being used primarily for real-time streaming data for rapid analytics. Storm is capable of processing over thousands of data records per second on a distributed cluster. Apache Storm runs on YARN framework and can connect with queues such as JMS and Kafka or to any type of database or it can listen to streaming APIs feeding information continuously, such as Twitter-streaming APIs and RSS feeds.

Apache Storm uses networks of spouts, bolts, and sinks called topology to address any kind of complex problems. Spouts represents a source where Storm is collecting information such as APIs, databases, or message queues. Bolts provide computation logic for an input stream and they produce...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Vijay Karambelkar

Hrishikesh Vijay Karambelkar is an innovator and an enterprise architect with 16 years of software design and development experience, specifically in the areas of big data, enterprise search, data analytics, text mining, and databases. He is passionate about architecting new software implementations for the next generation of software solutions for various industries, including oil and gas, chemicals, manufacturing, utilities, healthcare, and government infrastructure. In the past, he has authored three books for Packt Publishing: two editions of Scaling Big Data with Hadoop and Solr and one of Scaling Apache Solr. He has also worked with graph databases, and some of his work has been published at international conferences such as VLDB and ICDE.

See other products by Vijay Karambelkar