HBase: Bigtable-like structured storage for Hadoop HDFS
Just as Google's
Bigtable leverages the distributed data storage provided by the
Google File System, HBase provides Bigtable-like capabilities on top of Hadoop Core. Data is organized into tables, rows and columns. An Iterator-like interface is available for scanning through a row range (and of course there is the ability to retrieve a column value for a specific key). Any particular column may have multiple versions for the same row key.
General Information
HBase Home Page -- hbase.org HBase Architecture -- (updated 2008/09/05)
FAQ and Troubleshooting
Support:
HBase Mailing Lists See also the
Hadoop Mailing Lists
HBase
news, presentations, articles, and
twitter. PoweredBy, a list of sites and applications powered by HBase
Administrators / Setup Guides and config
HBase and Performance
Tools for evaluating HBase performance and scalability
There are setup instructions and a JMeter Test Plan in
HADOOP-2625
Rolling Restart of HBase
Migrating between HBase versions
User Developer Documentation
HBase Shell -- Based on Ruby's IRB
JRuby interface to HBase -- obsoleted by the new (J)IRB shell
HBase non-java access
Using HBase with Hadoop MapReduce -- Obsoleted by
HBase MapReduce Package Summary
Developer Documentation
Roadmaps -- TODO: Update!
Design Documents
HBase Design Overview (Under Construction)
Glossary (Under Construction)
Obsolete/Stale
Discussion of new file format -- hfile has become the new store file format in hbase
HBase/Zookeeper integration documentation -- integrated
Using Bloom Filters -- removed in 0.20.x, to be reinstated in 0.21.x