Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Mastering Apache Cassandra 3.x An expert guide to improving database scalability and availability without compromising performance

Product type Paperback

Published in Oct 2018

Publisher Packt

ISBN-13 9781789131499

Length 348 pages

Edition 3rd Edition

Languages

Java

Tools

Cassandra

Concepts

Database Programming

Authors (2):

Aaron Ploetz

Tejaswi Malepati

View More author details

Table of Contents (12) Chapters

Preface

1. Quick Start FREE CHAPTER

2. Cassandra Architecture

3. Effective CQL

4. Configuring a Cluster

5. Performance Tuning

6. Managing a Cluster

7. Monitoring

8. Application Development

9. Integration with Apache Spark

10. References

11. Other Books You May Enjoy

Leave a review - let other readers know what you think

Spark

Spark is a powerful, open source, general-purpose, unified cluster-computing analytics framework for large-scale data-processing. It's known for high performance, in-memory processing with an efficient engine and query optimizer. The four most widely-used interpreters for Spark are Python, Scala, Java, and R, including their interactive CLI. Spark is built on a foundation of Resilient Distributed Dataset (RDD) spread across the cluster of nodes. This eliminates computational limitations due to a cap of maximum resources that can be on a single machine, theoretically making it an infinitely scalable system. With all this, it is no surprise that it is the largest open source project in the data-processing community. Refer to Apache Spark docs for further information at Spark-Spark Overview: http://spark.apache.org/docs/2.3.1/.

...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Ploetz

Aaron Ploetz is the NoSQL Engineering Lead for Target, where his DevOps team supports Cassandra, MongoDB, and Neo4j. He has been named a DataStax MVP for Apache Cassandra three times and has presented at multiple events, including the DataStax Summit and Data Day Texas. Aaron earned a BS in Management/Computer Systems from the University of Wisconsin-Whitewater, and an MS in Software Engineering from Regis University. He and his wife, Coriene, live with their three children in the Twin Cities area.

See other products by Ploetz

Malepati

Tejaswi Malepati is the Cassandra Tech Lead for Target. He has been instrumental in designing and building custom Cassandra integrations, including a web-based SQL interface and data validation frameworks between Oracle and Cassandra. Tejaswi earned a master's degree in computer science from the University of New Mexico, and a bachelor's degree in electronics and communication from Jawaharlal Nehru Technological University in India. He is passionate about identifying and analyzing data patterns in datasets using R, Python, Spark, Cassandra, and MySQL.

See other products by Malepati