{"id":48025261,"url":"https://github.com/benzsevern/goldenpipe","last_synced_at":"2026-04-04T13:49:59.469Z","repository":{"id":347110273,"uuid":"1192053481","full_name":"benzsevern/goldenpipe","owner":"benzsevern","description":"Golden Suite orchestrator — chains validation, transformation, and entity resolution. 4 MCP tools on Smithery.","archived":false,"fork":false,"pushed_at":"2026-04-03T14:51:02.000Z","size":131,"stargazers_count":0,"open_issues_count":1,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-04-03T18:35:26.628Z","etag":null,"topics":["a2a","agent","cli","data-engineering","data-orchestration","data-pipeline","data-quality","etl","fastapi","golden-suite","mcp","mcp-server","orchestration","pipeline","pluggable","polars","python","remote-mcp","tui","yaml"],"latest_commit_sha":null,"homepage":"https://smithery.ai/servers/benzsevern/goldenpipe","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/benzsevern.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-03-25T21:06:40.000Z","updated_at":"2026-04-03T14:51:10.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/benzsevern/goldenpipe","commit_stats":null,"previous_names":["benzsevern/goldenpipe"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/benzsevern/goldenpipe","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benzsevern%2Fgoldenpipe","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benzsevern%2Fgoldenpipe/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benzsevern%2Fgoldenpipe/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benzsevern%2Fgoldenpipe/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/benzsevern","download_url":"https://codeload.github.com/benzsevern/goldenpipe/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benzsevern%2Fgoldenpipe/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31402276,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-04T10:20:44.708Z","status":"ssl_error","status_checked_at":"2026-04-04T10:20:06.846Z","response_time":60,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["a2a","agent","cli","data-engineering","data-orchestration","data-pipeline","data-quality","etl","fastapi","golden-suite","mcp","mcp-server","orchestration","pipeline","pluggable","polars","python","remote-mcp","tui","yaml"],"created_at":"2026-04-04T13:49:59.205Z","updated_at":"2026-04-04T13:49:59.429Z","avatar_url":"https://github.com/benzsevern.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003c!-- mcp-name: io.github.benzsevern/goldenpipe --\u003e\n# GoldenPipe\n\n**Golden Suite orchestrator** -- Check quality, fix issues, deduplicate records. One command.\n\n[![PyPI](https://img.shields.io/pypi/v/goldenpipe?color=d4a017)](https://pypi.org/project/goldenpipe/)\n[![CI](https://github.com/benzsevern/goldenpipe/actions/workflows/test.yml/badge.svg)](https://github.com/benzsevern/goldenpipe/actions/workflows/test.yml)\n[![codecov](https://codecov.io/gh/benzsevern/goldenpipe/graph/badge.svg)](https://codecov.io/gh/benzsevern/goldenpipe)\n[![Downloads](https://static.pepy.tech/badge/goldenpipe/month)](https://pepy.tech/project/goldenpipe)\n[![Python 3.11+](https://img.shields.io/badge/python-3.11%2B-blue)](https://python.org)\n[![License: MIT](https://img.shields.io/badge/license-MIT-green)](LICENSE)\n[![Docs](https://img.shields.io/badge/docs-benzsevern.github.io%2Fgoldenpipe-d4a017)](https://benzsevern.github.io/goldenpipe/)\n[![DQBench Pipeline](https://img.shields.io/badge/DQBench%20Pipeline-88.07-gold)](https://github.com/benzsevern/dqbench)\n[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/benzsevern/goldenpipe/blob/main/scripts/goldenpipe_demo.ipynb)\n\n## What It Does\n\n```\nRaw Data\n  | GoldenCheck   -- profile \u0026 discover quality issues\n  | GoldenFlow    -- fix issues, standardize, reshape\n  | GoldenMatch   -- deduplicate, match, create golden records\n  v\nGolden Records\n```\n\nGoldenPipe orchestrates the full pipeline with adaptive logic:\n- **Skips** transformation if no quality issues found\n- **Routes** to privacy-preserving matching if sensitive fields detected\n- **Reports** reasoning for every decision\n\n## Install\n\n```bash\npip install goldenpipe\n```\n\n## Quick Start\n\n```python\nimport goldenpipe as gp\n\nresult = gp.run(\"customers.csv\")\n\nprint(result.status)        # \"success\"\nprint(result.check)         # Quality findings\nprint(result.transform)     # What was fixed\nprint(result.match)         # Deduplicated clusters\nprint(result.reasoning)     # Why each decision was made\n```\n\n## CLI\n\n```bash\ngoldenpipe run customers.csv                # Full pipeline\ngoldenpipe run customers.csv --verbose      # Show reasoning\ngoldenpipe run customers.csv --skip-flow    # Check + Match only\ngoldenpipe run customers.csv --strategy pprl  # Force privacy mode\ngoldenpipe run customers.csv -o golden.csv  # Save golden records\n```\n\n## Remote MCP Server\n\nGoldenPipe is available as a hosted MCP server on [Smithery](https://smithery.ai/servers/benzsevern/goldenpipe) — connect from any MCP client without installing anything.\n\n**Claude Desktop / Claude Code:**\n```json\n{\n  \"mcpServers\": {\n    \"goldenpipe\": {\n      \"url\": \"https://goldenpipe-mcp-production.up.railway.app/mcp/\"\n    }\n  }\n}\n```\n\n**Local server:**\n```bash\npip install goldenpipe[mcp]\ngoldenpipe mcp-serve\n```\n\n4 tools available: list pipeline stages, validate wiring, run full check-transform-match pipeline, explain configs.\n\n## Part of the Golden Suite\n\n| Tool | Purpose | Install |\n|------|---------|---------|\n| [GoldenCheck](https://github.com/benzsevern/goldencheck) | Validate \u0026 profile data quality | `pip install goldencheck` |\n| [GoldenFlow](https://github.com/benzsevern/goldenflow) | Transform \u0026 standardize data | `pip install goldenflow` |\n| [GoldenMatch](https://github.com/benzsevern/goldenmatch) | Deduplicate \u0026 match records | `pip install goldenmatch` |\n| [GoldenPipe](https://github.com/benzsevern/goldenpipe) | Orchestrate the full pipeline | `pip install goldenpipe` |\n\n## License\n\nMIT\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbenzsevern%2Fgoldenpipe","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbenzsevern%2Fgoldenpipe","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbenzsevern%2Fgoldenpipe/lists"}