This page is a list of projects that are related to HBase. Put new projects at the top of the list.
To add entries you need write permission to the wiki, which you can get by subscribing to the firstname.lastname@example.org mailing list and asking for permissions on the wiki account username you've registered yourself as. If you are using HBase in production you ought to consider getting involved in the development process anyway, by filing bugs, testing beta releases, reviewing the code and turning your notes into shared documentation. Your participation in this process will ensure your needs get met.
Projects that add new features to hbase:
HBase SEP -- the HBase Side Effect Processor, a system for asynchronously and reliably listening to HBase mutation events, based on HBase replication
Lily HBase Indexer -- indexes HBase content to Solr by listening to the replication stream (uses the HBase SEP)
Crux - HBase Reporting and Analysis with support for simple and composite keys, get and range scans, column based filtering, charting.
Omid - Lock-free transactional support on top of HBase providing Snapshot Isolation.
Parhely Home Page -- ORM for HBase
HBase-Writer -- Heritrix2 Processor for writing crawls to HBase
The Pigi Project is an ORM-like framework. It includes a configurable index system and a simple object to HBase mapping framework (or indexing for HBase if you like). Designed for use by web applications.
hbase-thrift generates and installs Perl and Python Thrift bindings for HBase.
OHM -- is a weakly relational ORM for HBase which provides Object Mapping and Column indexing. It has its own compiler capable of generating interface code for multiple languages. Currently C# (via the Thrift API), with support Java currently in development. The compiler is easily extensible to add support for other languages.
DataNucleus -- is a Java JDO/JPA/REST implementation. It supports HBase, and many other datastores.
Kundera -- Kundera is a JPA 2.0 based object-datastore mapping library for HBase, Cassandra and MongoDB.
Zohmg -- Time series data store that uses HBase as its backing store.
Grails Support -- Grails HBase plug-in
BigRecord -- is an active_record-based object mapping layer for Ruby on Rails.
hbase-stargate -- Ruby client for HBase Stargate.
Meetup.Beeno -- simple HBase Java "beans" mapping framework based on annotations. Includes a rudimentary high level query API that generates the appropriate server-side filters.
Spring Hadoop - The Spring Hadoop project provides support for writing Apache Hadoop applications that benefit from the features of Spring, Spring Batch and Spring Integration.
Spring Framework HBase Template -- Provides HBase data access templates similar to what is provided in Spring for JDBC, Hibernate, iBatis, etc. If you find this useful, please vote for its inclusion in the Spring Framework.
Clojure-HBase -- A library for convenient access to HBase from Clojure.
HBase indexing library -- A library for building and querying HBase-table-based indexes.
HBasene -- Lucene+HBase - Using HBase as the backing store for the TF-IDF representations needed by Lucene. Also, contains a library for constructing lucene indices from HBase schema.
JMXToolkit -- A HBase tailored JMX toolkit enabling monitoring with Cacti and checking with Nagios or similar.
IHBASE -- Provides faster scans by indexing regions, each region has it's own index. The indexed columns are user-defined and indexes can be intersected or joined in a single query.
HBASE EC2 scripts -- This collection of bash scripts allows you to run HBase clusters on Amazon.com's Elastic Compute Cloud (EC2) service with best practices baked in.
Stargate -- Enhanced RESTful interface.
HBase-trx -- Transactional (JTA) and indexed extensions of HBase.
HBase Thrift Python client Debian package -- Debian packages for the HBase Thrift Python client (see readme for sources.list setup)
capjure -- capjure is a persistence helper for HBase. It is written in the Clojure language, and supports persisting of native hash-maps.
HBaseHUT -- (High Update Throughput for HBase) It focuses on write performance during records update (by avoiding doing Get on every Put to update record).
HBaseWD -- HBase Writes Distributor spreads records over the cluster even when their keys are sequential, while still allowing fast range scans over them
HBase UI Tool & Util - Hbase UI client & simple util module -- it can handle hbase more easily like jdo(not persistence api)
Example HBase Applications
HBase powered feed aggregator by Savu Andrei -- 200909