{"id":29420730,"url":"https://github.com/buchananja/dpyp","last_synced_at":"2026-04-18T17:32:40.593Z","repository":{"id":219001983,"uuid":"747914759","full_name":"buchananja/dpyp","owner":"buchananja","description":"A convenience tool for small-scale data pipelines in Python","archived":false,"fork":false,"pushed_at":"2024-07-03T11:19:56.000Z","size":4493,"stargazers_count":1,"open_issues_count":2,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-07-05T00:48:10.058Z","etag":null,"topics":["data","data-analysis","data-cleaning","data-engineering","data-pipeline","data-preprocessing","data-processing","data-science","pandas","pipeline"],"latest_commit_sha":null,"homepage":"https://pypi.org/project/dpyp/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/buchananja.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-01-24T22:17:58.000Z","updated_at":"2024-07-13T13:56:22.000Z","dependencies_parsed_at":"2024-02-05T19:54:58.721Z","dependency_job_id":"80668473-3fc5-4247-bde8-8aeaad7ae8e7","html_url":"https://github.com/buchananja/dpyp","commit_stats":null,"previous_names":["buchananja/pyping-module","buchananja/pyped","buchananja/dpypr"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/buchananja/dpyp","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buchananja%2Fdpyp","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buchananja%2Fdpyp/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buchananja%2Fdpyp/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buchananja%2Fdpyp/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/buchananja","download_url":"https://codeload.github.com/buchananja/dpyp/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buchananja%2Fdpyp/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":264923905,"owners_count":23683835,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","data-analysis","data-cleaning","data-engineering","data-pipeline","data-preprocessing","data-processing","data-science","pandas","pipeline"],"created_at":"2025-07-12T02:12:42.069Z","updated_at":"2026-04-18T17:32:35.573Z","avatar_url":"https://github.com/buchananja.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# **dpyp**\n*A convenience tool for small-scale data pipelines in Python*\n\n\u003cp align = \"center\"\u003e\n  \u003cimg src = \"docs/images/dpyp_logo.svg\" alt = \"image\" width = \"350\" height = \"350\"\u003e\n\u003c/p\u003e\n\n## About\ndpyp is a data-pipeline convenience tool containing functionality for reading and writing batches, cleaning data, diagnosing pipelines, manipulating text, and calculating fields in Python.\n\n[PyPI](https://pypi.org/project/dpyp/)\n\n\n## Usage\n- dpyp consists of seven modules: 'calculate', 'clean', 'diagnose', 'read', 'text', 'write', and 'transform'.\n- Designed for use in small-scale Python pipelines with an emphasis on batch-processing via 'data-dictionaries'.\n- Batch processing of data via dictionaries allows iterative functions to improve readability and ease of use.\n- Built using a combination of base Python and pandas for writing robust small-scale pipelines with text manipulation capabilities.\n\n## Dependencies\n- pandas\n- pyarrow\n- numpy\n\n## Installation\n```bash\npip install dpyp\n```\n\n## License\nSee [LICENSE.md](LICENSE.md)\n\n## Contributing\nSee [CONTRIBUTING.md](CONTRIBUTING.md)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbuchananja%2Fdpyp","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbuchananja%2Fdpyp","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbuchananja%2Fdpyp/lists"}