Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/hydradatabase/hydra
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
https://github.com/hydradatabase/hydra
data-warehouse datawarehouse postgres postgresql postgresql-extension
Last synced: about 17 hours ago
JSON representation
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
- Host: GitHub
- URL: https://github.com/hydradatabase/hydra
- Owner: hydradatabase
- License: apache-2.0
- Created: 2022-07-22T16:42:51.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-07T17:19:52.000Z (2 months ago)
- Last Synced: 2024-11-27T14:04:50.186Z (15 days ago)
- Topics: data-warehouse, datawarehouse, postgres, postgresql, postgresql-extension
- Language: C
- Homepage: https://www.hydra.so
- Size: 4.52 MB
- Stars: 2,853
- Watchers: 31
- Forks: 79
- Open Issues: 20
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
- stars - hydradatabase/hydra - oriented Postgres. Add scalable analytics to your project in minutes. (HarmonyOS / Windows Manager)
- awesome-repositories - hydradatabase/hydra - Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes. (C)
README
[![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/header.png)](https://hydra.so)
# šš¤ Hydra
**Hydra is open source, column-oriented Postgres. You can query billions of rows instantly on Postgres without code changes. Parallelized analytics in minutes, not weeks.**
## šĀ Quick start
Try theĀ [Hydra Free Tier](https://dashboard.hydra.so/signup)Ā to create a column-oriented Postgres instance. Then connect to it with your preferred Postgres client (psql, dbeaver, etc).
Alternatively, you can run HydraĀ [locally](#run-locally).
## šŖĀ Benchmarks - fastest Postgres aggregates on earth
Benchmarks were run on a c6a.4xlarge (16 vCPU, 32 GB RAM) with 500 GB of GP2 storage.
Results in seconds, smaller is better.![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/ReadmeBenchmarks.png)
[Review Clickbench](https://tinyurl.com/clickbench)Ā for comprehensive results and the list of 42 queries tested.
This benchmark represents typical workload in the following areas: clickstream and traffic analysis, web analytics, machine-generated data, structured logs, and events data.
![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/FasterthanPG.png)
### For our continuous benchmark results, seeĀ [BENCHMARKS](https://github.com/hydradatabase/hydra/blob/main/BENCHMARKS.md).
## šĀ FAQs
View complete answers in [our documentation](https://docs.hydra.so/concepts/faqs).
### Q: Why is Hydra so fast?
A: Columnar storage, query parallelization, vectorized execution, column-level caching, and tuning Postgres.
### Q: How do I start using the columnar format on Postgres?
A: Data is loaded into columnar format by default. Use Postgres normally.
### Q: What operations is Hydra meant for? Provide examples.
A: Aggregates (COUNT, SUM, AVG), WHERE clauses, bulk INSERTS, UPDATE, DELETEā¦
### Q: What is columnar not meant for?
A: Frequent large updates, small transactionsā¦
### Q: What Postgres features are unsupported on columnar?
* Logical replication.
* Columnar tables donāt typically use indexes, only supporting btree and hash indexes, and their associated constraints.### Q: Is Hydra a fork?
A: Hydra is a Postgres extension, not a fork. Hydra makes use of `tableam` (table access method API), which was added in Postgres 12 released in 2019.
## š¤Ā Community and Status
- [x] Alpha: Limited to select design partners
- [x] Public Alpha: available for use, but with noted frictions
- [x] Hydra 1.0 beta: Stable for non-enterprise use cases
- [x] Hydra 1.0 Release: Generally Available (GA) and ready for production use## :technologist: Developer resources
- [CHANGELOG](https://github.com/hydradatabase/hydra/blob/main/CHANGELOG.md) for details of recent changes
- [GitHub Issues](https://github.com/hydradatabase/hydra/issues)Ā for bugs and missing features
- [Discord](https://hydra.so/discord)Ā discussion with the Community and Hydra team
- [Docs](https://docs.hydra.so/)Ā for Hydra features and warehouse ops## š»Ā Run locally
The HydraĀ [Docker image](https://github.com/hydradatabase/hydra/pkgs/container/hydra)Ā is a drop-in replacement forĀ [postgres Docker image](https://hub.docker.com/_/postgres).
You can try out Hydra locally usingĀ [docker-compose](https://docs.docker.com/compose/).
```bash
git clone https://github.com/hydradatabase/hydra && cd hydra
cp .env.example .env
docker compose up
psql postgres://postgres:[email protected]:5432
```## šĀ License
Hydra is only possible by building on the shoulders of giants.
The code in this repo is licensed under:
- [AGPL 3.0](https://github.com/hydradatabase/hydra/tree/main/columnar/LICENSE)Ā forĀ [Hydra Columnar](https://github.com/hydradatabase/hydra/tree/main/columnar)
- All other code isĀ [Apache 2.0](https://github.com/hydradatabase/hydra/blob/main/LICENSE)The docker image is built on theĀ [Postgres docker image](https://hub.docker.com/_/postgres/), which contains a large number of open source projects, including:
- Postgres -Ā [the Postgres license](https://www.postgresql.org/about/licence/)
- Debian or Alpine Linux image, depending on the image used
- Hydra includes the following additional software in the image:
- multicorn - BSD license
- mysql_fdw - MIT-style license
- parquet_s3_fdw - MIT-style license
- pgsql-http - MIT licenseAs for any pre-built image usage, it is the image user's responsibility to ensure that any use of this image complies with any relevant licenses for all software contained within.