{"id":29507943,"url":"https://github.com/divvun/divvun-pipeline","last_synced_at":"2025-10-24T23:44:31.428Z","repository":{"id":66192751,"uuid":"207825683","full_name":"divvun/divvun-pipeline","owner":"divvun","description":"Experimental asynchronous, parallel pipelines for linguistic resources","archived":false,"fork":false,"pushed_at":"2019-09-27T09:47:35.000Z","size":104,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-07-29T05:51:28.707Z","etag":null,"topics":["async","linguistics","parallel","pipeline","rust"],"latest_commit_sha":null,"homepage":"","language":"Rust","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/divvun.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2019-09-11T13:57:43.000Z","updated_at":"2022-04-01T07:39:29.000Z","dependencies_parsed_at":"2023-02-25T06:45:56.310Z","dependency_job_id":null,"html_url":"https://github.com/divvun/divvun-pipeline","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/divvun/divvun-pipeline","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/divvun%2Fdivvun-pipeline","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/divvun%2Fdivvun-pipeline/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/divvun%2Fdivvun-pipeline/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/divvun%2Fdivvun-pipeline/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/divvun","download_url":"https://codeload.github.com/divvun/divvun-pipeline/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/divvun%2Fdivvun-pipeline/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":280883921,"owners_count":26407607,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-24T02:00:06.418Z","response_time":73,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["async","linguistics","parallel","pipeline","rust"],"created_at":"2025-07-16T04:06:57.783Z","updated_at":"2025-10-24T23:44:31.160Z","avatar_url":"https://github.com/divvun.png","language":"Rust","funding_links":[],"categories":[],"sub_categories":[],"readme":"# divvun-pipeline\n\nAsynchronous pipelines for linguistic resources.\n\n## Usage\n\nCan run pipeline with:\n\n`cargo run --bin divvun-pipeline -- myfile.zpipe`\n\n`cargo run --bin divvun-pipeline divvun-pipeline/tests/pipeline.zpipe`\n\nTo test text input and output:\n\n`cargo run --bin zinput-convert -- --text \"this is my awesome string that should come back the same\" | cargo run --bin zoutput-convert`\n\nInput, output, divvun-pipeline:\n\n`cargo run --bin zinput-convert -- --text \"this is my awesome string that should come back the same\" | cargo run --bin divvun-pipeline divvun-pipeline/tests/pipeline.zpipe`\n\nTo generate a 0 compression pipeline zip file (on Unix):\n\n`zip -0 -r pipeline.zpipe pipeline.json yummy_resource`\n\nIf you just do `zip -0 -r pipeline.zpipe unzipped`, it will have the actual folder `unzipped` there, which is not supported\n\n## Testing\n\nTo run tests:\n\nOn Mac:\n\n`./test.sh`\n\nOn Linux:\n\n`./test-linux.sh`\n\nOn Windows:\n\nExercise for the reader, but probably modify one of the other files to refer to `.dll` files.\n\n### Hfst Module\nSee modules/hfst/README.md about getting the latest hfst binaries to compile. Get the se.zcheck file from somewhere and extract to the folder se_zcheck.\n\nUnignore the test in divvun-pipeline/tests/hfst.rs and run that after compiling \u0026 copying everything with ./test.sh\n\n## Building on Linux\nThere's a Dockerfile that sets up a Debian build environment. It can be built, run and attached to the current directory with:\n`docker build -t divvun-pipeline . \u0026\u0026 docker run --rm -ti -v ${PWD}:/opt divvun-pipeline /bin/bash`\n\nParticularly of importance for building some dependencies (hfst for example) is a modern clang version, which Debian (stretch) does not appear to have.\n\n## License\n\nThe `divvun-pipeline` and `divvun-schema` crates of this project are licensed under either of\n\n * Apache License, Version 2.0 ([LICENSE-APACHE](LICENSE-APACHE) or http://www.apache.org/licenses/LICENSE-2.0)\n * MIT license ([LICENSE-MIT](LICENSE-MIT) or http://opensource.org/licenses/MIT)\n\nat your option.\n\n---\n\nFor `modules/`, please see each module's license respectively.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdivvun%2Fdivvun-pipeline","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdivvun%2Fdivvun-pipeline","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdivvun%2Fdivvun-pipeline/lists"}