Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/hydradatabase/hydra

Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
https://github.com/hydradatabase/hydra

data-warehouse datawarehouse postgres postgresql postgresql-extension

Last synced: about 17 hours ago
JSON representation

Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.

Awesome Lists containing this project

README

        

[![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/header.png)](https://hydra.so)

# šŸ˜šŸ¤˜ Hydra

**Hydra is open source, column-oriented Postgres. You can query billions of rows instantly on Postgres without code changes. Parallelized analytics in minutes, not weeks.**

## šŸš€Ā Quick start

Try theĀ [Hydra Free Tier](https://dashboard.hydra.so/signup)Ā to create a column-oriented Postgres instance. Then connect to it with your preferred Postgres client (psql, dbeaver, etc).

Alternatively, you can run HydraĀ [locally](#run-locally).

## šŸ’ŖĀ Benchmarks - fastest Postgres aggregates on earth

Benchmarks were run on a c6a.4xlarge (16 vCPU, 32 GB RAM) with 500 GB of GP2 storage.
Results in seconds, smaller is better.

![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/ReadmeBenchmarks.png)

[Review Clickbench](https://tinyurl.com/clickbench)Ā for comprehensive results and the list of 42 queries tested.

This benchmark represents typical workload in the following areas: clickstream and traffic analysis, web analytics, machine-generated data, structured logs, and events data.

![Hydra - the open source data warehouse](https://raw.githubusercontent.com/hydradatabase/hydra/main/.images/FasterthanPG.png)

### For our continuous benchmark results, seeĀ [BENCHMARKS](https://github.com/hydradatabase/hydra/blob/main/BENCHMARKS.md).

## šŸ™‹Ā FAQs

View complete answers in [our documentation](https://docs.hydra.so/concepts/faqs).

### Q: Why is Hydra so fast?

A: Columnar storage, query parallelization, vectorized execution, column-level caching, and tuning Postgres.

### Q: How do I start using the columnar format on Postgres?

A: Data is loaded into columnar format by default. Use Postgres normally.

### Q: What operations is Hydra meant for? Provide examples.

A: Aggregates (COUNT, SUM, AVG), WHERE clauses, bulk INSERTS, UPDATE, DELETEā€¦

### Q: What is columnar not meant for?

A: Frequent large updates, small transactionsā€¦

### Q: What Postgres features are unsupported on columnar?

* Logical replication.
* Columnar tables donā€™t typically use indexes, only supporting btree and hash indexes, and their associated constraints.

### Q: Is Hydra a fork?

A: Hydra is a Postgres extension, not a fork. Hydra makes use of `tableam` (table access method API), which was added in Postgres 12 released in 2019.

## šŸ¤Ā Community and Status

- [x] Alpha: Limited to select design partners
- [x] Public Alpha: available for use, but with noted frictions
- [x] Hydra 1.0 beta: Stable for non-enterprise use cases
- [x] Hydra 1.0 Release: Generally Available (GA) and ready for production use

## :technologist: Developer resources

- [CHANGELOG](https://github.com/hydradatabase/hydra/blob/main/CHANGELOG.md) for details of recent changes
- [GitHub Issues](https://github.com/hydradatabase/hydra/issues)Ā for bugs and missing features
- [Discord](https://hydra.so/discord)Ā discussion with the Community and Hydra team
- [Docs](https://docs.hydra.so/)Ā for Hydra features and warehouse ops

## šŸ’»Ā Run locally

The HydraĀ [Docker image](https://github.com/hydradatabase/hydra/pkgs/container/hydra)Ā is a drop-in replacement forĀ [postgres Docker image](https://hub.docker.com/_/postgres).

You can try out Hydra locally usingĀ [docker-compose](https://docs.docker.com/compose/).

```bash
git clone https://github.com/hydradatabase/hydra && cd hydra
cp .env.example .env
docker compose up
psql postgres://postgres:[email protected]:5432
```

## šŸ“Ā License

Hydra is only possible by building on the shoulders of giants.

The code in this repo is licensed under:

- [AGPL 3.0](https://github.com/hydradatabase/hydra/tree/main/columnar/LICENSE)Ā forĀ [Hydra Columnar](https://github.com/hydradatabase/hydra/tree/main/columnar)
- All other code isĀ [Apache 2.0](https://github.com/hydradatabase/hydra/blob/main/LICENSE)

The docker image is built on theĀ [Postgres docker image](https://hub.docker.com/_/postgres/), which contains a large number of open source projects, including:

- Postgres -Ā [the Postgres license](https://www.postgresql.org/about/licence/)
- Debian or Alpine Linux image, depending on the image used
- Hydra includes the following additional software in the image:
- multicorn - BSD license
- mysql_fdw - MIT-style license
- parquet_s3_fdw - MIT-style license
- pgsql-http - MIT license

As for any pre-built image usage, it is the image user's responsibility to ensure that any use of this image complies with any relevant licenses for all software contained within.