{"id":15520721,"url":"https://github.com/proycon/hyphertool","last_synced_at":"2026-02-21T21:35:28.087Z","repository":{"id":221142076,"uuid":"753602674","full_name":"proycon/hyphertool","owner":"proycon","description":"Command-line tool for syllabification and hyphenisation for multiple languages","archived":false,"fork":false,"pushed_at":"2025-02-21T10:38:20.000Z","size":26,"stargazers_count":1,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2026-02-15T13:29:15.068Z","etag":null,"topics":["dehyphenation","hyphenation","nlp","syllabification"],"latest_commit_sha":null,"homepage":"https://codeberg.org/proycon/hyphertool","language":"Rust","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/proycon.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2024-02-06T12:58:25.000Z","updated_at":"2025-02-21T10:45:22.000Z","dependencies_parsed_at":"2024-02-06T13:27:12.372Z","dependency_job_id":"73d50def-3735-489c-99e7-ca16107e4402","html_url":"https://github.com/proycon/hyphertool","commit_stats":null,"previous_names":["proycon/hyphertool"],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/proycon/hyphertool","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/proycon%2Fhyphertool","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/proycon%2Fhyphertool/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/proycon%2Fhyphertool/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/proycon%2Fhyphertool/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/proycon","download_url":"https://codeload.github.com/proycon/hyphertool/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/proycon%2Fhyphertool/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29694211,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-21T18:18:25.093Z","status":"ssl_error","status_checked_at":"2026-02-21T18:18:22.435Z","response_time":107,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dehyphenation","hyphenation","nlp","syllabification"],"created_at":"2024-10-02T10:29:03.884Z","updated_at":"2026-02-21T21:35:28.050Z","avatar_url":"https://github.com/proycon.png","language":"Rust","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Introduction\n\nThis is a command-line tool for syllabification and hyphenisation, it supports multiple languages.\nIt is just a thin wrapper on top of the [hypher](https://crates.io/crates/hypher) for Rust, the hyphenation rules it uses for\nthe various languages are derived from TeX.\n\n## Install\n\nEnsure ``cargo`` and ``rustc`` are installed on your system, then:\n\n```\n$ cargo install hyphertool\n```\n\n## Usage\n\n### Syllabification\n\nGiven a text file, output the text with all syllables explicitly marked. The delimiter can be set via ``--delimiter``:\n\n```\n$ hyphertool --language nl test.txt\nDit is een test-be-stand. Kan je dit be-stand mooi voor mij ver-wer-ken?\nIk hoop op een po-si-tief re-sul-taat.\n```\n\n### Hyphenation\n\nWrap a text on a certain width. Note that the width is in unicode points (not bytes), and has no regard for double-spaced characters:\n\n```\n$ hyphertool --language nl --width 15 test.txt\nDit is een test-\nbestand. Kan je\ndit bestand mooi\nvoor mij verwer-\nken?\nIk hoop op\neen positief re-\nsultaat.\n```\n\n### Stand-off syllabification\n\nOutput all syllables with stand-off offsets to the text. Offsets are 0-indexed\nunicode character points, the end is non-inclusive. Output will be in TSV (Tab\nSeparated Values) which can be easily imported in other software like [stam\nimport](https://github.com/annotation/stam-tools) for further analysis.\n\n```\n$ hyphertool --language nl --standoff test.txt\nText\tBeginOffset\tEndOffset\nDit\t0\t3\nis\t4\t6\neen\t7\t10\ntest\t11\t15\nbe\t15\t17\nstand\t17\t22\nKan\t24\t27\nje\t28\t30\ndit\t31\t34\nbe\t35\t37\nstand\t37\t42\nmooi\t43\t47\nvoor\t48\t52\nmij\t53\t56\nver\t57\t60\nwer\t60\t63\nken\t63\t66\nIk\t68\t70\nhoop\t71\t75\nop\t76\t78\neen\t79\t82\npo\t83\t85\nsi\t85\t87\ntief\t87\t91\nre\t92\t94\nsul\t94\t97\ntaat\t97\t101\n```\n\n### Dehyphenation\n\nThis tool can also be used for simple dehyphenation. The `--language` parameter is not actually implemented for this so this is not a lexical-informed dehyphenation. The mode is triggered by specifying one or more characters to dehyphenate on via `--dehyphenate`. We take the output of our earlier hyphenation test as input again:\n\n```\n$ hyphertool --language nl --dehyphenate - test2.txt                                                       ❌2 \nDit is een testbestand. Kan je\ndit bestand mooi\nvoor mij verwerken?\nIk hoop op\neen positief resultaat.\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fproycon%2Fhyphertool","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fproycon%2Fhyphertool","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fproycon%2Fhyphertool/lists"}