Welcome to Apache HBase™

Apache HBase™ is the Hadoop database, a distributed, scalable, big data store.

When Would I Use Apache HBase?

Use Apache HBase when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, column-oriented store modeled after Google's Bigtable: A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS.

Features

  • Linear and modular scalability.
  • Strictly consistent reads and writes.
  • Automatic and configurable sharding of tables
  • Automatic failover support between RegionServers.
  • Convenient base classes for backing Hadoop MapReduce jobs with Apache HBase tables.
  • Easy to use Java API for client access.
  • Block cache and Bloom Filters for real-time queries.
  • Query predicate push down via server side Filters
  • Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
  • Extensible jruby-based (JIRB) shell
  • Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX

Where Can I Get More Information?

See the Architecture Overview, the Apache HBase Reference Guide FAQ, and the other documentation links on the left!

News

September 26, 2013 HBase Meetup at Arista Networks.in San Francisco

August 20th, 2013 HBase Meetup at Flurry.in San Francisco

July 16th, 2013 HBase Meetup at Twitter.in San Francisco

June 25th, 2013 Hadoop Summit Meetup.at San Jose Convention Center

June 14th, 2013 KijiCon: Building Big Data Apps in San Francisco.

June 13th, 2013 HBaseCon2013 in San Francisco. Submit an Abstract!

June 12th, 2013 HBaseConHackAthon at the Cloudera office in San Francisco.

Old News