{"id":50584857,"url":"https://github.com/kreuzberg-dev/kreuzberg-cloud","last_synced_at":"2026-06-05T05:02:49.446Z","repository":{"id":356875110,"uuid":"1103412415","full_name":"kreuzberg-dev/kreuzberg-cloud","owner":"kreuzberg-dev","description":"Cloud-native document extraction platform — SaaS at kreuzberg.dev or self-host on any Kubernetes cluster. 90+ formats, REST API, webhooks. Built on Kreuzberg.","archived":false,"fork":false,"pushed_at":"2026-06-01T17:45:20.000Z","size":17905,"stargazers_count":14,"open_issues_count":5,"forks_count":0,"subscribers_count":0,"default_branch":"development","last_synced_at":"2026-06-01T18:23:55.415Z","etag":null,"topics":["api","axum","busl","cloud-native","document-extraction","document-processing","helm","kreuzberg","kubernetes","microservices","nats","nextjs","ocr","pdf","postgresql","rust","saas","self-hosted","text-extraction"],"latest_commit_sha":null,"homepage":"https://kreuzberg.dev","language":"Rust","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kreuzberg-dev.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":".github/CODEOWNERS","security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-11-24T20:52:51.000Z","updated_at":"2026-06-01T17:45:25.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/kreuzberg-dev/kreuzberg-cloud","commit_stats":null,"previous_names":["kreuzberg-dev/kreuzberg-cloud"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/kreuzberg-dev/kreuzberg-cloud","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kreuzberg-dev%2Fkreuzberg-cloud","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kreuzberg-dev%2Fkreuzberg-cloud/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kreuzberg-dev%2Fkreuzberg-cloud/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kreuzberg-dev%2Fkreuzberg-cloud/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kreuzberg-dev","download_url":"https://codeload.github.com/kreuzberg-dev/kreuzberg-cloud/tar.gz/refs/heads/development","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kreuzberg-dev%2Fkreuzberg-cloud/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33930311,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-05T02:00:06.157Z","response_time":120,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["api","axum","busl","cloud-native","document-extraction","document-processing","helm","kreuzberg","kubernetes","microservices","nats","nextjs","ocr","pdf","postgresql","rust","saas","self-hosted","text-extraction"],"created_at":"2026-06-05T05:02:48.693Z","updated_at":"2026-06-05T05:02:49.441Z","avatar_url":"https://github.com/kreuzberg-dev.png","language":"Rust","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Kreuzberg Cloud\n\n\u003cimg width=\"1500\" height=\"500\" alt=\"x- Banner\" src=\"https://github.com/user-attachments/assets/af332cf6-79a0-430f-94c6-44cd1fecbe47\" /\u003e\n\n\u003cdiv align=\"center\"\u003e\n  \u003ca href=\"https://kreuzberg.dev/login\"\u003eSaaS\u003c/a\u003e •\n  \u003ca href=\"https://docs.kreuzberg.cloud\"\u003eDocs\u003c/a\u003e •\n  \u003ca href=\"https://github.com/kreuzberg-dev/kreuzberg-cloud/blob/main/LICENSE\"\u003eLicense (BUSL-1.1)\u003c/a\u003e •\n  \u003ca href=\"https://github.com/kreuzberg-dev/kreuzberg-cloud/actions/workflows/ci-rust.yaml\"\u003e\u003cimg src=\"https://img.shields.io/github/actions/workflow/status/kreuzberg-dev/kreuzberg-cloud/ci-rust.yaml?label=CI\u0026color=007ec6\" alt=\"CI\"\u003e\u003c/a\u003e\n\u003c/div\u003e\n\nREST API and webhook service for extracting text, metadata, tables, and code intelligence from documents. Wraps the [Kreuzberg](https://github.com/kreuzberg-dev/kreuzberg) extraction core with multi-tenant project isolation, presigned uploads, signed webhook delivery, and Stripe-metered billing.\n\n10,000 free pages on signup at [kreuzberg.dev/login](https://kreuzberg.dev/login).\n\n## Features\n\n- REST API with presigned uploads, polling, and bulk job submission.\n- Signed webhook delivery (HMAC-SHA256) for completion and failure events.\n- Multi-tenant project isolation enforced at the PostgreSQL row level.\n- Code-aware extraction across 306 programming languages via tree-sitter.\n- BUSL-1.1 source available; self-host on Kubernetes via Helm or use the managed service.\n\n## Quick Start\n\n```bash\ncurl -X POST https://api.kreuzberg.dev/v1/extract \\\n  -H \"Authorization: Bearer kz_...\" \\\n  -F \"file=@document.pdf\" \\\n  -F 'webhook={\"url\":\"\"}'\n```\n\n### SDKs\n\n```bash\npip install kreuzberg-cloud-sdk                                # Python\npnpm add @kreuzberg/cloud                                      # TypeScript\ngo get github.com/kreuzberg-dev/kreuzberg-cloud-sdk/go         # Go\npub add kreuzberg_cloud_sdk                                    # Dart\n```\n\nAPI reference and language guides: [docs.kreuzberg.cloud](https://docs.kreuzberg.cloud).\n\n## Self-hosting\n\nKreuzberg Cloud is available as a managed service at [kreuzberg.dev/login](https://kreuzberg.dev/login), and as open-source software under the BUSL-1.1 license for on-premise and bring-your-own-cloud deployments. This repository contains everything needed to run Kreuzberg Cloud in your infrastructure: Helm charts, microservices, database schema, and observability dashboards. See [`docs/self-hosting.md`](docs/self-hosting.md) for prerequisites, quick-start deployment instructions, authentication options, and customization checkpoints.\n\n## Architecture\n\n| Service | Purpose |\n|---------|---------|\n| **api** | Public REST API — job submission, polling, results, usage |\n| **backend** | Management API — projects, members, API keys, webhooks |\n| **billing** | Stripe integration — metered billing and quota enforcement |\n| **worker** | Document processing via Kreuzberg; scales to zero with KEDA |\n| **webhook** | Signed HTTP delivery of completion/failure events |\n\nDetails: [`docs/concepts/architecture.md`](docs/concepts/architecture.md). OpenAPI spec: [`services/api/spec/openapi.json`](services/api/spec/openapi.json). Security policy: [SECURITY.md](SECURITY.md).\n\n## Part of Kreuzberg.dev\n\n- [Kreuzberg](https://github.com/kreuzberg-dev/kreuzberg) — document intelligence: text, tables, metadata from 91+ formats with optional OCR. Self-host counterpart to Kreuzberg Cloud.\n- [kreuzcrawl](https://github.com/kreuzberg-dev/kreuzcrawl) — web crawling and scraping with HTML→Markdown and headless-Chrome fallback.\n- [html-to-markdown](https://github.com/kreuzberg-dev/html-to-markdown) — fast, lossless HTML→Markdown engine.\n- [liter-llm](https://github.com/kreuzberg-dev/liter-llm) — universal LLM API client with native bindings for 14 languages and 143 providers.\n- [tree-sitter-language-pack](https://github.com/kreuzberg-dev/tree-sitter-language-pack) — tree-sitter grammars and code-intelligence primitives.\n- [Discord](https://discord.gg/xt9WY3GnKR) — community, roadmap, announcements.\n\n## License\n\n[Business Source License 1.1](LICENSE). The source is available for review and non-production use. Production use requires a commercial license or use of the Kreuzberg-operated managed service.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkreuzberg-dev%2Fkreuzberg-cloud","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkreuzberg-dev%2Fkreuzberg-cloud","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkreuzberg-dev%2Fkreuzberg-cloud/lists"}