Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Cassandra Design Patterns

You're reading from   Cassandra Design Patterns Build real-world, industry-strength data storage solutions with time-tested design methodologies using Cassandra

Arrow left icon
Product type Paperback
Published in Nov 2015
Publisher
ISBN-13 9781785285707
Length 168 pages
Edition 2nd Edition
Languages
Arrow right icon
Author (1):
Arrow left icon
Rajanarayanan Thottuvaikkatumana Rajanarayanan Thottuvaikkatumana
Author Profile Icon Rajanarayanan Thottuvaikkatumana
Rajanarayanan Thottuvaikkatumana
Arrow right icon
View More author details
Toc

A brief overview

How can we use Cassandra and Spark together for data analysis? How can we use Map/Reduce-like processing when using Spark? What are the general data transformations that can be performed on the data stored in Cassandra using Spark? This is a very brief overview of these capabilities. All Spark-related discussions are centered around the programming aspects. The clustering, deployments, methods of running jobs, and so on are beyond the scope of this chapter.

The most important data abstraction in Spark is Resilient Distributed Dataset (RDD). For all practical purposes, RDD can be considered as an in-memory table of data coming from its data source. The data source can be text files, files stored in HDFS, Cassandra column families, HBase column families, and so on.

Note

RDD is immutable and hence it is highly reusable and can be cached. Because of the immutability of the RDD, there is an absolute guarantee on the final results because no other process can change its contents...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at R$50/month. Cancel anytime