{"id":20825752,"url":"https://github.com/ioriens/whisper-video","last_synced_at":"2025-07-23T22:32:24.892Z","repository":{"id":181641322,"uuid":"667039477","full_name":"IOriens/whisper-video","owner":"IOriens","description":"Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.","archived":false,"fork":false,"pushed_at":"2024-05-03T05:56:17.000Z","size":70,"stargazers_count":29,"open_issues_count":2,"forks_count":4,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-10T02:58:21.775Z","etag":null,"topics":["langchain","subtitle-generator","summary","video-to-audio","video-to-text","whisper"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/IOriens.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-16T12:48:44.000Z","updated_at":"2025-01-10T05:32:19.000Z","dependencies_parsed_at":null,"dependency_job_id":"f43d9f14-f919-439c-93eb-8d29b42a0326","html_url":"https://github.com/IOriens/whisper-video","commit_stats":null,"previous_names":["ioriens/whisper-video"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/IOriens/whisper-video","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/IOriens%2Fwhisper-video","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/IOriens%2Fwhisper-video/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/IOriens%2Fwhisper-video/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/IOriens%2Fwhisper-video/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/IOriens","download_url":"https://codeload.github.com/IOriens/whisper-video/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/IOriens%2Fwhisper-video/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":266761337,"owners_count":23980286,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-07-23T02:00:09.312Z","response_time":66,"last_error":null,"robots_txt_status":null,"robots_txt_updated_at":null,"robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["langchain","subtitle-generator","summary","video-to-audio","video-to-text","whisper"],"created_at":"2024-11-17T23:06:43.685Z","updated_at":"2025-07-23T22:32:24.869Z","avatar_url":"https://github.com/IOriens.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"## Video to SRT Subtitle Converter\n\nThis Python script converts video files to SRT subtitle files using the Faster-Whisper model for speech recognition. The script reads video files from an input folder, converts them to MP3 format using ffmpeg, transcribes the audio using the Faster-Whisper model, and generates an SRT subtitle file for each video file in the output folder.\n\n![](./Intro.png)\n\n### Requirements\n\nTo run this script, you will need:\n\n- Python 3.7 or higher\n- The dependencies listed in `requirements.txt`\n- ffmpeg\n\nTo install the required Python libraries, you can use pip:\n\n```\npip install -r requirements.txt\n```\n\nTo install ffmpeg, you can download a pre-built binary from the official website:\n\n- [FFmpeg Downloads ↗](https://www.ffmpeg.org/download.html)\n\nIf you plan to run the script on a GPU, you will also need the NVIDIA libraries cuBLAS 11.x and cuDNN 8.x installed on your system. Please refer to the CTranslate2 documentation for installation instructions.\n\n### Usage\n\n1. Configure the input and output folder paths in the `config.ini` file.\n2. Run the script using the following command:\n\n```\npython main.py\n```\n\nThe script will iterate over all video files in the input folder, convert them to MP3 format using ffmpeg, transcribe the audio using the Faster-Whisper model, and generate an SRT subtitle file for each video file in the output folder.\n\n### Config\n\nThe script uses a configuration file `config.ini` to set the input and output folder paths, as well as the model configuration parameters. The file contains two sections: `[PATHS]` and `[MODEL]`.\n\n#### PATHS\n\nThe `[PATHS]` section contains the following options:\n\n- `input_folder`: the path to the folder containing the video files to be converted. Default is `/path/to/input/folder`.\n- `output_folder`: the path to the folder where the SRT subtitle files will be saved. Default is `/path/to/output/folder`.\n\n#### MODEL\n\nThe `[MODEL]` section contains the following options:\n\n- `model_size`: the size of the Faster-Whisper model to use for speech recognition. Default is `large-v2`.\n- `device`: the device to run the model on. Default is `cuda`.\n- `compute_type`: the compute type to use for the model. Default is `float16`.\n\nTo modify the configuration, open `config.ini` in a text editor and change the values of the options as desired. Make sure to save the file before running the script.\n\nNote: The `[MODEL]` section determines the performance and accuracy of the speech recognition, and may affect the speed and memory usage of the script. If you experience performance issues, consider modifying the model size or compute type.\n\n### License\n\nThis script is released under the MIT License.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fioriens%2Fwhisper-video","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fioriens%2Fwhisper-video","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fioriens%2Fwhisper-video/lists"}