{"id":23536202,"url":"https://github.com/huuhuy227/hardcoded-subtitle-extraction","last_synced_at":"2026-05-02T03:08:44.535Z","repository":{"id":268742052,"uuid":"902933681","full_name":"HuuHuy227/Hardcoded-Subtitle-Extraction","owner":"HuuHuy227","description":"Leverage OCR for hardcoded subtitle extractor","archived":false,"fork":false,"pushed_at":"2024-12-27T14:41:20.000Z","size":62350,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-17T06:41:53.460Z","etag":null,"topics":["deep-learning","ocr-recognition","paddleocr","paddlepaddle","streamlit","subtitles","tkinter-gui"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/HuuHuy227.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-12-13T15:13:54.000Z","updated_at":"2025-01-17T05:50:56.000Z","dependencies_parsed_at":"2024-12-18T16:55:12.471Z","dependency_job_id":"7b37afd6-1022-434d-b772-7dad7c00a4b8","html_url":"https://github.com/HuuHuy227/Hardcoded-Subtitle-Extraction","commit_stats":null,"previous_names":["huuhuy227/subtitle-extraction","huuhuy227/hardcoded-subtitle-extraction"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HuuHuy227%2FHardcoded-Subtitle-Extraction","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HuuHuy227%2FHardcoded-Subtitle-Extraction/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HuuHuy227%2FHardcoded-Subtitle-Extraction/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HuuHuy227%2FHardcoded-Subtitle-Extraction/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/HuuHuy227","download_url":"https://codeload.github.com/HuuHuy227/Hardcoded-Subtitle-Extraction/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254240217,"owners_count":22037841,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["deep-learning","ocr-recognition","paddleocr","paddlepaddle","streamlit","subtitles","tkinter-gui"],"created_at":"2024-12-26T02:17:27.882Z","updated_at":"2026-05-02T03:08:44.518Z","avatar_url":"https://github.com/HuuHuy227.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Video Hardcoded Subtitle Extractor\n\n[![Python 3.8+](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/downloads/)\nExtract hardcoded/burned-in subtitles from videos using OCR technology. Available as both a desktop application and web interface.\nThis implementation using [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR) for backend OCR task.\n\n## 🎯 Features\n- GUI and web interface options\n- Support for MP4, AVI, MOV video formats\n- Adjustable frame rate and confidence threshold\n- Multiple language support (English, Chinese, Japanese, Korean, Arabic)\n- SRT export format. Also supported bilingual subtitles.\n- **Note:** For long video process recommend install GPU version for efficient of speed process (about 1/5 the length of the video)\n\n\n## ⚙️ Requirements\n- Python 3.8+\n- NVIDIA GPU (optional)\n- CUDA Toolkit 11.8, 12.0+ (for GPU acceleration)\n- 4GB RAM minimum (8GB recommended)\n\n## 📥 Installation\n\n### Option 1: Conda Environment \nFor GPU version you should install CUDA and cuDNN (version base on their [Install paddlepaddle](https://www.paddlepaddle.org.cn/en/install/quick?docurl=/documentation/docs/en/install/pip/windows-pip_en.html))\n```bash\n# Create conda environment\nconda create -n subtitle-env python=3.10\nconda activate subtitle-env\n# For GPU support (optional)\npip install paddlepaddle # pip install paddlepaddle-gpu==2.6.1 for GPU version\n# Install dependencies\npip install -r requirements.txt\n```\n\n### Option 2: Docker\n```bash\n# Install NVIDIA Container Toolkit first\n# Then build and run with GPU support\ndocker-compose -f docker-compose.yml build\ndocker-compose -f docker-compose.yml up\n```\n\n### 🚀 Usage\n### Desktop Application\n```bash\n# Launch GUI\npython gui.py\n```\n![Alt Text](https://i.giphy.com/media/v1.Y2lkPTc5MGI3NjExcGJ0ejlkbXY2OGxkOXY0azlwZ2ttOHMxbnB4eDVsdDRlbTBmbmk0bCZlcD12MV9pbnRlcm5hbF9naWZfYnlfaWQmY3Q9Zw/Mk4QxIiNl1V2lx9afi/giphy.gif)\n### Web Interface\n```bash\n# Launch web app\nstreamlit run app.py\n```\n![Alt Text](https://i.giphy.com/media/v1.Y2lkPTc5MGI3NjExbTk2NWpnbXI5MWV6ZzVoYmIwODZpdzNtZnVybHF1N2JrempybjY1dCZlcD12MV9pbnRlcm5hbF9naWZfYnlfaWQmY3Q9Zw/3Y1bedk8LoZkPi18OK/giphy.gif)\n\n[Link Demo](https://www.youtube.com/watch?v=2ZxI7lb3C2I)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhuuhuy227%2Fhardcoded-subtitle-extraction","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhuuhuy227%2Fhardcoded-subtitle-extraction","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhuuhuy227%2Fhardcoded-subtitle-extraction/lists"}