An Overview of HBase
HBase is designed based on a Google white paper, Big Table: A Distributed Storage System for Structured Data and defined as a sparse, distributed, persistent multidimensional sorted map. HBase is a columnar and partition oriented database, but is stored in key value pair of data. I know it's confusing and tricky, so let's look at the terms again in detail.
Sparse: HBase is columnar and partition oriented. Usually, a record may have many columns and many of them may have null data, or the values may be repeated. HBase can efficiently and effectively save the space in sparse data.
Distributed: Data is stored in multiple nodes, scattered across the cluster.
Persistent: Data is written and saved in the cluster.
Multidimensional: A row can have multiple versions or timestamps of values.
Map: Key-Value Pair links the data structure to store the data.
Sorted: The Key in the structure is stored in a sorted order for faster read and write optimization.
The HBase Data Model, as...