Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/sameeragarwal/blinkdb

BlinkDB: Sub-Second Approximate Queries on Very Large Data.
https://github.com/sameeragarwal/blinkdb

Last synced: 2 months ago
JSON representation

BlinkDB: Sub-Second Approximate Queries on Very Large Data.

Host: GitHub
URL: https://github.com/sameeragarwal/blinkdb
Owner: sameeragarwal
License: apache-2.0
Created: 2011-10-07T00:24:33.000Z (over 12 years ago)
Default Branch: alpha-0.2.0
Last Pushed: 2014-02-06T07:21:39.000Z (over 10 years ago)
Last Synced: 2024-01-24T15:12:46.129Z (5 months ago)
Language: Scala
Homepage: http://blinkdb.cs.berkeley.edu/
Size: 302 MB
Stars: 658
Watchers: 95
Forks: 126
Open Issues: 9
Metadata Files:
- Readme: README.md
- License: LICENSE

Lists

awesome-db - BlinkDB - BlinkDB: Sub-Second Approximate Queries on Very Large Data [website] (http://blinkdb.cs.berkeley.edu/) (Scala)
awesome-db - BlinkDB - BlinkDB: Sub-Second Approximate Queries on Very Large Data [website] (http://blinkdb.cs.berkeley.edu/) (Scala)
awesome-database - BlinkDB - BlinkDB: Sub-Second Approximate Queries on Very Large Data [Website] (http://blinkdb.cs.berkeley.edu/). (#SCALA) (Scala / Embeddable engines <a name="embeddable-engines"></a>)

README

![BlinkDB](http://blinkdb.org/figures/blinkdb-logo-withaffiliations.png)
#### Queries with Bounded Errors and Bounded Response Times on Very Large Data

BlinkDB is a large-scale data warehouse system built on Shark and Spark and is designed to be
compatible with Apache Hive. It can answer HiveQL queries up to 200-300 times faster than Hive
by executing them on user-specified samples of data and providing approximate answers that are
augmented with meaningful error bars. BlinkDB 0.1.0 is an alpha developer release that supports
creating/deleting samples on any input table and/or materialized view and executing approximate
HiveQL queries with those aggregates that have statistical closed forms (i.e., AVG, SUM, COUNT,
VAR and STDEV).

#### BlinkDB requires:
* Scala 2.10.x
* Spark 0.9.x

### For current documentation, see the [BlinkDB Wiki](https://github.com/sameeragarwal/blinkdb/wiki).
### For more information about the BlinkDB Project, see the [BlinkDB Website](http://blinkdb.cs.berkeley.edu).