Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sameeragarwal/blinkdb
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
https://github.com/sameeragarwal/blinkdb
Last synced: 2 months ago
JSON representation
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
- Host: GitHub
- URL: https://github.com/sameeragarwal/blinkdb
- Owner: sameeragarwal
- License: apache-2.0
- Created: 2011-10-07T00:24:33.000Z (over 12 years ago)
- Default Branch: alpha-0.2.0
- Last Pushed: 2014-02-06T07:21:39.000Z (over 10 years ago)
- Last Synced: 2024-01-24T15:12:46.129Z (5 months ago)
- Language: Scala
- Homepage: http://blinkdb.cs.berkeley.edu/
- Size: 302 MB
- Stars: 658
- Watchers: 95
- Forks: 126
- Open Issues: 9
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Lists
- awesome-db - BlinkDB - BlinkDB: Sub-Second Approximate Queries on Very Large Data [website] (http://blinkdb.cs.berkeley.edu/) (Scala)
- awesome-db - BlinkDB - BlinkDB: Sub-Second Approximate Queries on Very Large Data [website] (http://blinkdb.cs.berkeley.edu/) (Scala)
- awesome-database - BlinkDB - BlinkDB: Sub-Second Approximate Queries on Very Large Data [Website] (http://blinkdb.cs.berkeley.edu/). (#SCALA) (Scala / Embeddable engines <a name="embeddable-engines"></a>)
README
![BlinkDB](http://blinkdb.org/figures/blinkdb-logo-withaffiliations.png)
#### Queries with Bounded Errors and Bounded Response Times on Very Large DataBlinkDB is a large-scale data warehouse system built on Shark and Spark and is designed to be
compatible with Apache Hive. It can answer HiveQL queries up to 200-300 times faster than Hive
by executing them on user-specified samples of data and providing approximate answers that are
augmented with meaningful error bars. BlinkDB 0.1.0 is an alpha developer release that supports
creating/deleting samples on any input table and/or materialized view and executing approximate
HiveQL queries with those aggregates that have statistical closed forms (i.e., AVG, SUM, COUNT,
VAR and STDEV).#### BlinkDB requires:
* Scala 2.10.x
* Spark 0.9.x### For current documentation, see the [BlinkDB Wiki](https://github.com/sameeragarwal/blinkdb/wiki).
### For more information about the BlinkDB Project, see the [BlinkDB Website](http://blinkdb.cs.berkeley.edu).