Hadoop and Cassandra
In the age of big data analytics, there are hardly any data-rich companies that do not want their data to be extracted, evaluated, and inferred to provide more business inside. In the past, analyzing large datasets (structured or unstructured) that span terabytes or petabytes used to be expensive and a technically challenging task to a team; distributed computing was harder to keep track of, and hardware to support this kind of infrastructure was not financially feasible to everyone.
Note
This chapter does not cover Cassandra integration with Hive and Oozie. To learn about Cassandra integration with Oozie, visit http://wiki.apache.org/cassandra/HadoopSupport#Oozie.
There are ongoing efforts to bring Hive integration to Cassandra as its native part. If you are planning to use Cassandra with Hive, visit https://issues.apache.org/jira/browse/CASSANDRA-4131.
DataStax Enterprise editions have built-in Cassandra-enabled Hive MapReduce clients. Check them out at http://www.datastax...