{"id":13825406,"url":"https://github.com/moi15moi/VideoSubOCR","last_synced_at":"2025-07-08T21:32:30.887Z","repository":{"id":106447420,"uuid":"541345752","full_name":"moi15moi/VideoSubOCR","owner":"moi15moi","description":"OCR automation for VideoSubFinder","archived":false,"fork":false,"pushed_at":"2022-09-26T01:15:58.000Z","size":51,"stargazers_count":10,"open_issues_count":0,"forks_count":3,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-08-05T09:14:11.637Z","etag":null,"topics":["advanced-substation-alpha","ass","ocr","ocr-python","ocr-recognition","subtitle","videosubfinder"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/moi15moi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-09-26T00:05:10.000Z","updated_at":"2024-07-31T02:13:47.000Z","dependencies_parsed_at":null,"dependency_job_id":"fa729b20-6687-4497-8076-f719bdddfe80","html_url":"https://github.com/moi15moi/VideoSubOCR","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moi15moi%2FVideoSubOCR","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moi15moi%2FVideoSubOCR/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moi15moi%2FVideoSubOCR/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/moi15moi%2FVideoSubOCR/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/moi15moi","download_url":"https://codeload.github.com/moi15moi/VideoSubOCR/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":225465291,"owners_count":17478518,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["advanced-substation-alpha","ass","ocr","ocr-python","ocr-recognition","subtitle","videosubfinder"],"created_at":"2024-08-04T09:01:20.306Z","updated_at":"2024-11-20T03:31:16.394Z","avatar_url":"https://github.com/moi15moi.png","language":"Python","readme":"# VideoSubOCR\nOCR automation for VideoSubFinder. \n\n## Installation and Update\n```\npip install git+https://github.com/moi15moi/VideoSubOCR.git\n```\n\n## Dependencies\n- [Python 3.7 or more](https://www.python.org/downloads)\n- [ABBYY FineReader](https://pdf.abbyy.com) (or Tesseract)\n- [Tesseract](https://digi.bib.uni-mannheim.de/tesseract/?C=M;O=D) (or ABBYY FineReader)\n- [VapourSynth](https://github.com/vapoursynth/vapoursynth/releases/latest)  (Optional, but required to use cropbox argument)\n- [VapourSynth-Editor](https://github.com/YomikoR/VapourSynth-Editor/releases/latest)  (Optional, but required to use VideoPreview.vpy)\n\n### Setup for [ABBYY FineReader](https://pdf.abbyy.com)\nYou will need to have the same parameters has this image.\nTo change your parameters, open **ABBYY FineReader** and go to ``Tools -\u003e Options -\u003e Format Settings -\u003e TXT``\n![Alt text](https://github.com/moi15moi/VideoSubOCR/blob/main/ABBYY%20FineReader%20-%20OCR%20Parameters.png)\n\n### Setup for [Tesseract](https://digi.bib.uni-mannheim.de/tesseract/?C=M;O=D)\nTo be able to select the language you want, you need to download tesseract tessdata.\n\n- [tessdata](https://github.com/tesseract-ocr/tessdata)\n- [tessdata_best](https://github.com/tesseract-ocr/tessdata_best)\n- [tessdata_fast](https://github.com/tesseract-ocr/tessdata_fast)\n\nI recommand you to use **tessdata_best** traineddata.\n\nYou will need to download the language you need and move it in the right folder.\nOn Windows, it should be: ``C:\\Users\\YOUR_USERNAME\\AppData\\Local\\Tesseract-OCR\\tessdata``\n\n## Usage\n```console\nusage: VideoSubOCR [-h] --input INPUT [--generalSettings GENERALSETTINGS]\n                   [--cropBoxDimension Width_or_Height Width_or_Height] [--cropBoxHeight Height]\n                   [--videosubfinderwxw VIDEOSUBFINDERWXW] [--tesseract [TESSERACT]] [--finereader [FINEREADER]]\n                   [--lang LANG]\n\nVideo OCR automation.\n\noptions:\n  -h, --help            show this help message and exit\n  --input INPUT, -i INPUT\n                        Video file needed to be OCR.\n  --generalSettings GENERALSETTINGS\n                        Path to general settings (*.cfg file). By default, VideoSubFinder use the file\n                        settings/general.cfg\n  --cropBoxDimension Width_or_Height Width_or_Height, -cpdh Width_or_Height Width_or_Height\n                        CropBox dimension Width x Height. Ex: --cropBoxDimension 1920 1080\n  --cropBoxHeight Height, -cph Height\n                        CropBox height. It is the height between the video bottom and the cropbox bottom.\n  --videosubfinderwxw VIDEOSUBFINDERWXW, -vsf VIDEOSUBFINDERWXW\n                        Path to VideoSubFinderWXW.exe if not in variable environments.\n  --tesseract [TESSERACT], -t [TESSERACT]\n                        If tesseract.exe is in your variable environments, simply use --tesseract. If not, path to\n                        FineCmd.exe\n  --finereader [FINEREADER], -f [FINEREADER]\n                        If FineCmd.exe is in your variable environments, simply use --finereader. If not, path to\n                        FineCmd.exe\n  --lang LANG, -l LANG  Language in which the hardsubbed subtitle is. It will be use by the ocr engine you choose.\n                        Warning, ABBYY FineReader and Tesseract doesn't have the exact same input for the same\n                        language. Tesseract support ISO 639-2 (t version):\n                        https://www.loc.gov/standards/iso639-2/php/code_list.php ABBYY FineReader support the\n                        fullname: https://help.abbyy.com/en-us/finereader/15/user_guide/commandline_lang/\n```\n\n### VideoSubFinder General Settings\nI recommand you to always create a setting.\nThis will allow to have a better ocr\nYou can save the settings with ``File --\u003e Save Settings As...``\n\nFor more information about VideoSubFinder, see their [website](https://sourceforge.net/projects/videosubfinder/)\n\n### Alternative to VideoSubFinder General Settings\n\nIf you don't want to create a setting, I recommand you to use the file named **VideoPreview.vpy**. To use it, you will need to install [VapourSynth](https://github.com/vapoursynth/vapoursynth/releases/latest) and [VapourSynth-Editor](https://github.com/YomikoR/VapourSynth-Editor/releases/latest).\n\n1. Open VideoPreview.vpy with VapourSynth-Editor.\n2. Change ``inputVideo``, ``cropBoxDimension``, ``cropBoxHeight`` to the value you think you need.\n3. Press **F5**\n4. See if the cropbox is good for you. If not, repeat the step 2 and 3.","funding_links":[],"categories":["Python"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmoi15moi%2FVideoSubOCR","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmoi15moi%2FVideoSubOCR","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmoi15moi%2FVideoSubOCR/lists"}