{"id":18080599,"url":"https://github.com/anskarl/druid-docker-cluster","last_synced_at":"2025-08-30T23:37:32.287Z","repository":{"id":81364032,"uuid":"188568109","full_name":"anskarl/druid-docker-cluster","owner":"anskarl","description":"Dockerized Apache Druid for testing and development","archived":false,"fork":false,"pushed_at":"2020-01-24T15:39:15.000Z","size":1908,"stargazers_count":23,"open_issues_count":1,"forks_count":9,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-04-12T14:22:45.648Z","etag":null,"topics":["docker","docker-compose","docker-image","druid"],"latest_commit_sha":null,"homepage":null,"language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/anskarl.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.TXT","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2019-05-25T13:13:32.000Z","updated_at":"2023-11-09T09:42:02.000Z","dependencies_parsed_at":null,"dependency_job_id":"0ff08cc4-ff32-472c-ab4c-802b6e22712d","html_url":"https://github.com/anskarl/druid-docker-cluster","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/anskarl/druid-docker-cluster","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/anskarl%2Fdruid-docker-cluster","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/anskarl%2Fdruid-docker-cluster/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/anskarl%2Fdruid-docker-cluster/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/anskarl%2Fdruid-docker-cluster/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/anskarl","download_url":"https://codeload.github.com/anskarl/druid-docker-cluster/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/anskarl%2Fdruid-docker-cluster/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":265297036,"owners_count":23742585,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["docker","docker-compose","docker-image","druid"],"created_at":"2024-10-31T13:08:58.689Z","updated_at":"2025-07-14T13:06:57.414Z","avatar_url":"https://github.com/anskarl.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Dockerized Apache Druid cluster\n\nThis project demostrates how you can setup a Dockerized example/development [Apache Druid](http://druid.io/) cluster.\n\n\nThe cluster is being composed of the following components:\n\n- S3 Compatible Object Storage [**MinIO**](https://min.io) for Deep storage\n- [**PostgreSQL**](https://www.postgresql.org/) for metadata storage \n- [**Zookeeper**](https://zookeeper.apache.org/) for internal service discovery, coordination, and leader election\n- [**Apache Druid**](http://druid.io/) platform:\n  \n  * **Middle Manager** to handle the ingestion of data into the cluster\n  * **Historical** to handle the storage and querying on “historical” data\n  * **Broker** to receive queries from external clients\n  * **Coordinator** to assign segments to Historical nodes\n  * **Overlord** to assign ingestion tasks to Middle Managers and to coordinate segment publishing\n  * **Router** provides a unified API gateway in front of Brokers, Overlords and Coordinators\n\n### Instructions to build Druid image\n\n```\nmake image\n```\n\nor by using docker-compose\n\n```\ndocker-compose build\n```\n\nYou can also specify the version of Druid to build, for example:\n\n```\nmake DRUID_VERSION=0.14.1-incubating image\n```\n\nor by using docker-compose\n\n```\ndocker-compose build --build-arg ARG_DRUID_VERSION=0.14.1-incubating\n```\n\n### Run the cluster\n\n```\ndocker-compose up\n```\n\nor to run in the backgroumd:\n\n```\ndocker-compose up -d\n```\n\nAfter a while the Druid console should be available in [http://localhost:8888](http://localhost:8888)\n\n\n### Load example data\n\nFor example data we are using a subset of the [NYC Taxi \u0026 Limousine Commission - Trip Record Data](https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page), specifically from months 2015-01 to 2015-03.\n\n```\ncd dataset\n./03-load_to_druid.sh\n```\n\nPlease note that you can download data for different months and adjust the sample size by adjusting the parameters of `./dataset/01-download.sh` and `./dataset/02-create_sample_tripdata.sh`.\n\nThe schema of the dataset and the indexing task is being defined in `./dataset/yellow_tripdata-index.json`\n\n...enjoy :)","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fanskarl%2Fdruid-docker-cluster","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fanskarl%2Fdruid-docker-cluster","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fanskarl%2Fdruid-docker-cluster/lists"}