Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/hydradatabase/columnar
Postgres-native columnar storage extension
https://github.com/hydradatabase/columnar
data-warehouse datawarehouse postgres postgresql postgresql-extension
Last synced: 2 days ago
JSON representation
Postgres-native columnar storage extension
- Host: GitHub
- URL: https://github.com/hydradatabase/columnar
- Owner: hydradatabase
- License: apache-2.0
- Created: 2022-07-22T16:42:51.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2025-02-10T14:58:06.000Z (4 days ago)
- Last Synced: 2025-02-10T15:43:38.315Z (4 days ago)
- Topics: data-warehouse, datawarehouse, postgres, postgresql, postgresql-extension
- Language: C
- Homepage:
- Size: 4.52 MB
- Stars: 2,893
- Watchers: 32
- Forks: 80
- Open Issues: 28
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
- stars - hydradatabase/columnar - native columnar storage extension (HarmonyOS / Windows Manager)
README
[![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/header.png)](https://hydra.so)
# šš¤ Hydra Columnar
Hydra Colummnar is open source, column-oriented Postgres.
## ļæ½Ā Run locally
The [Docker image](https://github.com/hydradatabase/hydra/pkgs/container/hydra)Ā is a drop-in replacement forĀ [postgres Docker image](https://hub.docker.com/_/postgres).
You can try out Hydra locally usingĀ [docker-compose](https://docs.docker.com/compose/).
```bash
git clone https://github.com/hydradatabase/columnar && cd columnar
cp .env.example .env
docker compose up
psql postgres://postgres:[email protected]:5432
```## ļ潚ŖĀ Benchmarks
Benchmarks were run on a c6a.4xlarge (16 vCPU, 32 GB RAM) with 500 GB of GP2 storage.
Results in seconds, smaller is better.![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/ReadmeBenchmarks.png)
[Review Clickbench](https://tinyurl.com/clickbench)Ā for comprehensive results and the list of 42 queries tested.
This benchmark represents typical workload in the following areas: clickstream and traffic analysis, web analytics, machine-generated data, structured logs, and events data.
![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/columnar/main/.images/FasterthanPG.png)
For our continuous benchmark results, seeĀ [BENCHMARKS](https://github.com/hydradatabase/hydra/blob/main/BENCHMARKS.md).
## šĀ FAQs
View complete answers in [our documentation](https://columnar.docs.hydra.so/concepts/faqs).
### Q: Why is Hydra so fast?
A: Columnar storage, query parallelization, vectorized execution, column-level caching, and tuning Postgres.
### Q: How do I start using the columnar format on Postgres?
A: Data is loaded into columnar format by default. Use Postgres normally.
### Q: What operations is Hydra meant for? Provide examples.
A: Aggregates (COUNT, SUM, AVG), WHERE clauses, bulk INSERTS, UPDATE, DELETEā¦
### Q: What is columnar not meant for?
A: Frequent large updates, small transactionsā¦
### Q: What Postgres features are unsupported on columnar?
* Logical replication.
* Columnar tables donāt typically use indexes, only supporting btree and hash indexes, and their associated constraints.### Q: Is Hydra a fork?
A: Hydra is a Postgres extension, not a fork. Hydra makes use of `tableam` (table access method API), which was added in Postgres 12 released in 2019.
## š¤Ā Community and Status
- [x] Alpha: Limited to select design partners
- [x] Public Alpha: available for use, but with noted frictions
- [x] Hydra 1.0 beta: Stable for non-enterprise use cases
- [x] Hydra 1.0 Release: Generally Available (GA) and ready for production use## :technologist: Developer resources
- [CHANGELOG](https://github.com/hydradatabase/columnar/blob/main/CHANGELOG.md) for details of recent changes
- [GitHub Issues](https://github.com/hydradatabase/columnar/issues)Ā for bugs and missing features
- [Docs](https://columnar.docs.hydra.so/)Ā for Hydra features and warehouse ops## šĀ License
Hydra is only possible by building on the shoulders of giants.
The code in this repo is licensed under:
- [AGPL 3.0](https://github.com/hydradatabase/hydra/tree/main/columnar/LICENSE)Ā forĀ [Hydra Columnar](https://github.com/hydradatabase/hydra/tree/main/columnar)
- All other code isĀ [Apache 2.0](https://github.com/hydradatabase/hydra/blob/main/LICENSE)The docker image is built on theĀ [Postgres docker image](https://hub.docker.com/_/postgres/), which contains a large number of open source projects, including:
- Postgres -Ā [the Postgres license](https://www.postgresql.org/about/licence/)
- Debian or Alpine Linux image, depending on the image used
- Hydra includes the following additional software in the image:
- multicorn - BSD license
- mysql_fdw - MIT-style license
- parquet_s3_fdw - MIT-style license
- pgsql-http - MIT licenseAs for any pre-built image usage, it is the image user's responsibility to ensure that any use of this image complies with any relevant licenses for all software contained within.