Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/rayokota/awesome-hbase

A curated list of awesome HBase projects and resources.
https://github.com/rayokota/awesome-hbase

List: awesome-hbase

awesome awesome-list hbase

Last synced: 3 months ago
JSON representation

A curated list of awesome HBase projects and resources.

Awesome Lists containing this project

README

        

# Awesome HBase [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)

[](http://hbase.apache.org/)

A curated list of awesome HBase projects and resources.

[HBase](http://hbase.apache.org) is a distributed, scalable, big data store.

## Contents

- [Projects](#projects)
- [Clients](#clients)
- [Cloud](#cloud)
- [Frameworks](#frameworks)
- [Datasets](#datasets)
- [Document](#document)
- [Entity/JPA](#entityjpa)
- [Geospatial](#geospatial)
- [Graph](#graph)
- [SQL/OLAP](#sqlolap)
- [Time Series](#time-series)
- [Infrastructure](#infrastructure)
- [Secondary Indices](#secondary-indices)
- [Transactions](#transactions)
- [Integrations](#integrations)
- [Tools](#tools)
- [Miscellaneous](#miscellaneous)

- [Resources](#resources)
- [Books](#books)
- [Papers](#papers)
- [Community](#community)


## Projects

### Clients

* [asynchbase](https://github.com/OpenTSDB/asynchbase) - Fully asynchronous, non-blocking HBase client.
* [gohbase](https://github.com/tsuna/gohbase) - Pure Go client for HBase.
* [happybase](https://github.com/wbolster/happybase) - Python client for HBase.

### Cloud

* [Amazon EMR](https://aws.amazon.com/emr/) - Amazon's Hadoop/HBase offering on AWS.
* [Azure HDInsight](https://azure.microsoft.com/en-us/services/hdinsight/) - Microsoft's Hadoop/HBase offering on Azure.
* [Cloudera Director](https://www.cloudera.com/products/product-components/cloudera-director.html) - Run Hadoop/HBase clusters on AWS, Azure or Google Cloud.
* [Google Cloud Bigtable](https://cloud.google.com/bigtable/) - High-performance NoSQL database service accessible via HBase client API.
* [Hortonworks Cloudbreak](https://hortonworks.com/open-source/cloudbreak/) - Provision Hadoop/HBase clusters on AWS, Azure, Google Cloud, or OpenStack.

### Frameworks

#### Datasets

* [Kite](http://kitesdk.org) - High-level data layer for Hadoop/HBase.

#### Document

* [HDocDB](https://github.com/rayokota/hdocdb) - HBase as a JSON document database.

#### Entity/JPA

* [DataNucleus](http://www.datanucleus.org) - JPA persistence layer with support for HBase.
* [Gora](http://gora.apache.org) - Persistence library for big data with support for HBase.
* [HBase ORM](https://github.com/flipkart-incubator/hbase-orm) - A production-grade HBase ORM library.
* [HEntityDB](https://github.com/rayokota/hentitydb) - HBase as an entity database.
* [Kundera](https://github.com/impetus-opensource/Kundera) - JPA client with support for HBase.

#### Geospatial

* [GeoMesa](http://www.geomesa.org/) - Spatial-temporal database with support for Accumulo, HBase, Cassandra, and Kafka.

#### Graph
* [Gradoop](https://github.com/dbs-leipzig/gradoop) - Research framework for scalable graph analytics built on Flink and HBase.
* [HGraphDB](https://github.com/rayokota/hgraphdb) - HBase as a TinkerPop graph database.
* [HugeGraph](https://github.com/apache/incubator-hugegraph) - A graph database that supports more than 10+ billion data, high performance and scalability.
* [JanusGraph](http://janusgraph.org/) - Scalable graph database with support for Cassandra, HBase, Google Cloud Bigtable, and BerkeleyDB.
* [NebulaGraph](https://github.com/vesoft-inc/nebula) - A high performance distributed Graph database.
* [S2Graph](http://s2graph.incubator.apache.org) - High-performance distributed graph database built on HBase.

#### SQL/OLAP

* [AntsDB](http://antsdb.com/) - AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase.
* [EsgynDB](https://esgyn.com/) - Commercial SQL engine providing ACID transactions and BI analytics on top of Hadoop, based on Trafodian.
* [Kylin](http://kylin.apache.org) - Extreme OLAP engine for big data that stores data in HBase.
* [LeanXScale](http://www.leanxcale.com) - Commercial full ACID full SQL product built on Hadoop/HBase.
* [Phoenix](https://phoenix.apache.org) - SQL layer on top of HBase.
* [Splice Machine](https://www.splicemachine.com) - Commercial RDBMS built on top of HBase.
* [Trafodian](http://trafodion.apache.org) - Transactional SQL-on-Hadoop/HBase.

#### Time Series

* [Axibase](http://axibase.com/products/axibase-time-series-database/) - Distributed time series database built on HBase.
* [OpenTSDB](http://opentsdb.net) - Scalable time series database built on HBase.
* [Warp 10](http://www.warp10.io) - Time series database for sensor data.

### Infrastructure

#### Secondary Indices

* [hindex](https://github.com/Huawei-Hadoop/hindex) - Secondary index for HBase.
* [Lily HBase Indexer](http://ngdata.github.io/hbase-indexer/) - Quickly and easily search for content stored in HBase.

#### Transactions

* [Haeinsa](https://github.com/VCNC/haeinsa) - Multi-row/multi-table transaction library for HBase.
* [HBase-QoD](https://github.com/algarecu/hbase-0.94.8-qod) - Vector-field consistency for HBase fine-grained transactional inter-DC replication.
* [Omid](https://github.com/apache/incubator-omid) - Transactional support for HBase.
* [Tephra](http://tephra.incubator.apache.org) - Globally consistent transactions on top of HBase.
* [Themis](https://github.com/XiaoMi/themis) - Cross-row/cross-table transactions on HBase based on Google's Percolator.

### Integrations

* [Apex](https://github.com/apache/apex-malhar/tree/master/contrib/src/test/java/org/apache/apex/malhar/contrib/hbase) - Apex-HBase connector.
* [Beam](https://github.com/apache/beam/tree/master/sdks/java/io/hbase) - Beam HBase integration.
* [Camel](http://camel.apache.org/hbase.html) - Camel HBase component.
* [Cascading](https://github.com/Cascading/cascading.hbase) - HBase adapters for Cascading.
* [Cascalog](https://github.com/sorenmacbeth/hbase-cascalog) - Wrapper around Cascading.HBase for use in Cascalog.
* [Crunch](https://github.com/apache/crunch/tree/master/crunch-hbase) - HBase adapters for Crunch.
* [Drill](https://drill.apache.org/docs/querying-hbase/) - HBase storage plugin for Drill.
* [Elasticsearch](https://github.com/mallocator/Elasticsearch-HBase-River) - Elasticsearch import river for HBase.
* [Flink](https://github.com/apache/flink/tree/master/flink-connectors/flink-connector-hbase-2.2) - Flink-HBase connector.
* [Gearpump](https://github.com/apache/incubator-gearpump/tree/master/external/hbase) - Gearpump integration for HBase.
* [Giraph](https://github.com/apache/giraph/tree/trunk/giraph-hbase) - Giraph input and output formats for HBase.
* [HAWQ](https://hawq.apache.org/docs/userguide/2.3.0.0-incubating/pxf/HBasePXF.html) - HAWQ PXF external tables on HBase.
* [Hive](https://cwiki.apache.org/confluence/display/Hive/HBaseIntegration) - Hive HBase integration.
* [Impala](https://www.cloudera.com/documentation/enterprise/latest/topics/impala_hbase.html) - Impala support for querying HBase tables.
* [Kafka](https://github.com/apache/hbase-connectors/tree/master/kafka) - HBase Kafka proxy.
* [Pig](https://github.com/apache/pig/tree/trunk/src/org/apache/pig/backend/hadoop/hbase) - Pig HBase integration.
* [Presto](https://github.com/analysys/presto-hbase-connector) - Presto-HBase connector.
* [Pulsar](http://pulsar.apache.org/docs/en/io-hbase/) - HBase connector for Pulsar.
* [Ranger](https://cwiki.apache.org/confluence/display/RANGER/HBase+Plugin) - HBase plugin for Apache Ranger.
* [Spark](https://github.com/hortonworks-spark/shc) - Spark-HBase connector.
* [Spring for Apache Hadoop](https://projects.spring.io/spring-hadoop/) - Spring-Hadoop integration, including HBase support.
* [Storm](https://github.com/apache/storm/tree/master/external/storm-hbase) - Storm/Trident integration for HBase.
* [Tajo](https://tajo.apache.org/docs/current/hbase_integration.html) - Tajo integration with HBase.
* [Zeppelin](https://zeppelin.apache.org/docs/0.6.2/interpreter/hbase.html) - HBase shell interpreter for Apache Zeppelin.

### Tools

* [Ambari](https://ambari.apache.org) - Software for provisioning, managing, and monitor Hadoop/HBase clusters.
* [Cloudera Manager](https://www.cloudera.com/products/product-components/cloudera-manager.html) - Tool for managing Hadoop/HBase in production.
* [DbSchema](http://www.dbschema.com/index.html) - Diagram-oriented database designer with support for HBase.
* [Hannibal](https://github.com/sentric/hannibal) - Tool to monitor and maintain HBase clusters.
* [h-rider](https://github.com/NiceSystems/hrider) - GUI for viewing and manipulating data in HBase.
* [Hue](http://gethue.com) - Smart analytics workbench that includes an HBase browser.
* [Sematext SPM](http://sematext.com/spm) - Tool for [monitoring HBase](http://sematext.com/spm/integrations/hbase-monitoring), HDFS, etc.

### Miscellaneous

* [HubSpot HBase support](https://github.com/HubSpot/hbase-support) - Configs and tools for HBase at HubSpot, including Hystrix integration and coprocessors.

## Resources

### Books

* [HBase in Action](https://www.manning.com/books/hbase-in-action) - Experience-driven guide that shows you how to use HBase.
* [HBase: The Definitive Guide](http://shop.oreilly.com/product/0636920014348.do) - Comprehensive guide to HBase.
* [Architecting HBase Applications](http://shop.oreilly.com/product/0636920035688.do) - Includes HBase principles, cluster guidelines, and in-depth case studies.
* [HBase Administration Cookbook](https://www.packtpub.com/big-data-and-business-intelligence/hbase-administration-cookbook) - How to master HBase configuration and administration.
* [HBase Essentials](https://www.packtpub.com/big-data-and-business-intelligence/hbase-essentials) - A practical guide to using HBase.
* [HBase Design Patterns](https://www.packtpub.com/big-data-and-business-intelligence/hbase-design-patterns) - Successful patterns to develop scalable applications with HBase.
* [Learning HBase](https://www.packtpub.com/big-data-and-business-intelligence/learning-hbase) - Learn the fundamentals of HBase administration and development.
* [HBase High Performance Cookbook](https://www.packtpub.com/big-data-and-business-intelligence/hbase-high-performance-cookbook) - Exciting projects that teach you how to use HBase.
* [Apache HBase Primer](http://www.apress.com/us/book/9781484224236) - A compact guide to HBase essentials.
* [Pro Apache Phoenix](http://www.apress.com/us/book/9781484223697) - Basic and best practices for using Phoenix.

### Papers

* [Bigtable: A Distributed Storage System for Structured Data](https://static.googleusercontent.com/media/research.google.com/en//archive/bigtable-osdi06.pdf) - The inspiration for HBase.
* [Apache Hadoop Goes Realtime at Facebook](https://pdfs.semanticscholar.org/865a/215390cd49af9e4941e03107120e631dcaa0.pdf) - How Facebook deployed HBase to production.

### Community

* [Blog](https://blogs.apache.org/hbase/)
* [Mailing Lists](http://hbase.apache.org/mail-lists.html)
* [Reddit](https://www.reddit.com/r/hbase/)
* [Stack Overflow](https://stackoverflow.com/questions/tagged/hbase)
* [Twitter](https://twitter.com/HBase)

## License



Public Domain Mark