{"id":20273103,"url":"https://github.com/910jqk/add-jyutping","last_synced_at":"2025-10-09T11:12:53.083Z","repository":{"id":71156618,"uuid":"102344690","full_name":"910JQK/add-jyutping","owner":"910JQK","description":"OCR 粤拼字幕生成工具","archived":false,"fork":false,"pushed_at":"2017-09-04T14:43:38.000Z","size":482,"stargazers_count":7,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-04-11T04:42:52.832Z","etag":null,"topics":["cantonese","jyutping","language-learning","ocr","opencv","python"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/910JQK.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-09-04T09:37:23.000Z","updated_at":"2025-01-10T07:43:55.000Z","dependencies_parsed_at":null,"dependency_job_id":"685bdba4-f8b3-4859-b131-332b2d31e943","html_url":"https://github.com/910JQK/add-jyutping","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/910JQK/add-jyutping","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/910JQK%2Fadd-jyutping","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/910JQK%2Fadd-jyutping/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/910JQK%2Fadd-jyutping/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/910JQK%2Fadd-jyutping/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/910JQK","download_url":"https://codeload.github.com/910JQK/add-jyutping/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/910JQK%2Fadd-jyutping/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279001288,"owners_count":26083058,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-09T02:00:07.460Z","response_time":59,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cantonese","jyutping","language-learning","ocr","opencv","python"],"created_at":"2024-11-14T12:48:03.069Z","updated_at":"2025-10-09T11:12:53.031Z","avatar_url":"https://github.com/910JQK.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 粵拼字幕生成工具\n\n## 簡介\n\n此 Python Script 可用於給 TVB 的電視節目添加粵拼字幕，目前尚在 Experimental 階段。開源授權條款寫在程式檔案中。\n\n\u003cimg src=\"screenshot.png\" /\u003e\n\n現在的效果一般，時間軸稍有偏差，OCR 有不少錯字，多音字也沒辦法處理。而且二十分鐘的影片都需要處理八十分鐘（……）。不過到這種程度就已經很有用了，借用一句名言就是「不滿意但可以接受」。\n\n圖形處理和 OCR 的部分，是由 OpenCV 和 pyocr 完成的。由於本人對這些方面並不太了解，所以這部分借用的是 kerrickstaley 的程式，原程式見：[kerrickstaley/extracting-chinese-subs](https://github.com/kerrickstaley/extracting-chinese-subs)\n\n## 使用方法\n\n```\nusage: jyutping.py [-h] [--top TOP] [--bottom BOTTOM] [--left LEFT]\n                   [--right RIGHT]\n                   video_file\n```\n其中 top, bottom, left, right 是包住字幕的矩形的邊界，若搞錯則得不到任何文字。默認的數字是以 1280×720 大小的影片為準的。\n\n運行後，程式會向 stderr 輸出有關處理進度的資訊，並在 stdout 輸出 .srt 檔的內容，因此使用時請重定向 stdout 到檔案。\n\n例如：\n\n```\n$ ./jyutping.py foobar.mp4 \u003e foobar.srt\n```\n\n## 這麽粗製濫造真的大丈夫？\n\n講真，為了學個廣東話去深入學習 OpenCV / Image Processing 實在是主次顛倒，所以就先這樣吧。而且最近不大想寫程式了，心累。","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F910jqk%2Fadd-jyutping","html_url":"https://awesome.ecosyste.ms/projects/github.com%2F910jqk%2Fadd-jyutping","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F910jqk%2Fadd-jyutping/lists"}