blog




  • Essay / Improving data locality and availability in the...

    HBase has a rigid master-slave architecture and its main objective is to be a scalable and efficient NoSQL database that helps in data storage. HBase has highly constant reads/writes, which makes it suitable for high-speed counter aggregation. There is automatic partitioning that makes it easier to divide regions as the volume of data increases in a particular region. HBase's automatic failover mechanism enables data availability with a higher probability as regions are reassigned among the rest of the regional servers. HBase ultimately stores all of its data in HDFS so that the data is stored permanently. HBase even supports many APIs such as Java Client API for programmatic access and Thrift/REST API as options for other programming options. HBase even supports the MapReduce framework for parallel processing of a large number of tasks. From the architecture diagram above, HBase is located on top of HDFS and it is certain that HBase uses HDFS as its underlying architecture. The HMaster is responsible for monitoring all Region Server instances in the cluster and is the interface for all...