Mastering Apache Cassandra 3.x: An expert guide to improving database scalability and availability without compromising performance , Third Edition

What do you get with Print?

Instant access to your digital copy whilst your Print order is Shipped

Paperback book shipped to your preferred address

Redeem a companion digital copy on all Print orders

Access this title in our online reader with advanced features

DRM FREE - Read whenever, wherever and however you want

Cassandra Architecture

In this chapter, we will discuss the architecture behind Apache Cassandra in detail. We will discuss how Cassandra was designed and how it adheres to the Brewer's CAP theorem, which will give us insight into the reasons for its behavior. Specifically, this chapter will cover:

Problems that Cassandra was designed to solve
Cassandra's read and write paths
The role that horizontal scaling plays
How data is stored on-disk
How Cassandra handles failure scenarios

This chapter will help you to build a good foundation of understanding that will prove very helpful later on. Knowing how Apache Cassandra works under the hood helps for later tasks around operations. Building high-performing, scalable data models is also something that requires an understanding of the architecture, and your architecture can be the difference between an unsuccessful or a successful...

Key benefits

Write programs more efficiently using Cassandra's features with the help of examples

Configure Cassandra and fine-tune its parameters depending on your needs

Integrate Cassandra database with Apache Spark and build strong data analytics pipeline

Description

With ever-increasing rates of data creation, the demand for storing data fast and reliably becomes a need. Apache Cassandra is the perfect choice for building fault-tolerant and scalable databases. Mastering Apache Cassandra 3.x teaches you how to build and architect your clusters, configure and work with your nodes, and program in a high-throughput environment, helping you understand the power of Cassandra as per the new features. Once you’ve covered a brief recap of the basics, you’ll move on to deploying and monitoring a production setup and optimizing and integrating it with other software. You’ll work with the advanced features of CQL and the new storage engine in order to understand how they function on the server-side. You’ll explore the integration and interaction of Cassandra components, followed by discovering features such as token allocation algorithm, CQL3, vnodes, lightweight transactions, and data modelling in detail. Last but not least you will get to grips with Apache Spark. By the end of this book, you’ll be able to analyse big data, and build and manage high-performance databases for your application.

What you will learn

Write programs more efficiently using Cassandra s features more efficiently

Exploit the given infrastructure, improve performance, and tweak the Java Virtual Machine (JVM)

Use CQL3 in your application in order to simplify working with Cassandra

Configure Cassandra and fine-tune its parameters depending on your needs

Set up a cluster and learn how to scale it

Monitor a Cassandra cluster in different ways

Use Apache Spark and other big data processing tools

What do you get with Print?