Apache HBase provides a distributed, columnar key-value-based storage on Apache Hadoop. It is best suited when you need to perform read-writes randomly on large and varying data stores. HBase is capable of distributing and sharding its data across multiple nodes of Apache Hadoop, and it also provides high availability through its automatic failover from one region server to another. Apache HBase can be run in two modes: standalone and distributed. In the standalone mode, HBase does not use HDFS and instead uses a local directory by default, whereas the distributed mode works on HDFS.
Apache HBase stores its data across multiple rows and columns, where each row consists of a row key and a column containing one or more values. A value can be one or more attributes. Column families are sets of columns that are collocated together for performance reasons...