{"id":13825937,"url":"https://github.com/JorianWoltjer/AutoCaptions","last_synced_at":"2025-07-08T22:32:49.152Z","repository":{"id":132523424,"uuid":"582311065","full_name":"JorianWoltjer/AutoCaptions","owner":"JorianWoltjer","description":"A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.","archived":false,"fork":false,"pushed_at":"2023-09-25T06:47:06.000Z","size":193,"stargazers_count":41,"open_issues_count":1,"forks_count":2,"subscribers_count":4,"default_branch":"master","last_synced_at":"2024-11-14T20:03:08.483Z","etag":null,"topics":["ai","premiere-pro","srt","subtitles","whisper","xml"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/JorianWoltjer.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2022-12-26T12:22:43.000Z","updated_at":"2024-10-31T10:48:10.000Z","dependencies_parsed_at":"2023-09-25T09:33:39.413Z","dependency_job_id":null,"html_url":"https://github.com/JorianWoltjer/AutoCaptions","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JorianWoltjer%2FAutoCaptions","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JorianWoltjer%2FAutoCaptions/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JorianWoltjer%2FAutoCaptions/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JorianWoltjer%2FAutoCaptions/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/JorianWoltjer","download_url":"https://codeload.github.com/JorianWoltjer/AutoCaptions/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":225470631,"owners_count":17479366,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","premiere-pro","srt","subtitles","whisper","xml"],"created_at":"2024-08-04T09:01:29.502Z","updated_at":"2024-11-20T04:31:01.166Z","avatar_url":"https://github.com/JorianWoltjer.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# AutoCaptions\n\n**Transcribe an audio file to Premiere Pro layers**\n\nA GUI tool that uses [OpenAI's Whisper](https://github.com/openai/whisper) to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles. Mainly for adding quick subtitles to action-packed videos, by making segments of a small word count. \n\nOutputs a `.xml` file which is a sequence containing text layers (Essential Graphics) that can be imported into your Premiere Pro project. \n\nUses [`stable-ts`](https://github.com/jianfch/stable-ts) regrouping functions to split the result into small configurable segments. \n\n## Installation\n\n```cmd\ngit clone https://github.com/JorianWoltjer/AutoCaptions.git \u0026\u0026 cd AutoCaptions\npython -m pip install -r requirements.txt\n```\n\n### Torch\n\nMake sure to install the GPU enabled version of `torch` to make Whisper a lot faster:\n\n```shell\npython -m pip uninstall torch\npython -m pip cache purge\npython -m pip install torch -f https://download.pytorch.org/whl/torch_stable.html\n```\n\n### ffmpeg\n\nAn external dependency for Whisper that needs to be installed:\n\n###### Windows\n\nInstall [Chocolatey](https://docs.chocolatey.org/en-us/choco/setup), then run the following command:\n\n```cmd\nchoco install ffmpeg\n```\n\n###### Linux\n\n```Shell\nsudo apt update \u0026\u0026 sudo apt install ffmpeg\n```\n\n## Running\n\n###### Windows\n\nSimply create a shortcut to [`start.bat`](start.bat)\n\n###### Linux\n\n```shell\n$ python main.py\n```\n\n## Example\n\nStart the batch script, and select a file as input. Then some configuration is available and you can transcribe the audio:\n\n![A terminal showing Whisper output and some progress updates, with the simple GUI on Windows](img/terminal_example.png)\n\nThe resulting XML file can then be imported into a Premiere project, where you can use and edit the text layers it created:\n\n![A screenshot of the Premiere Pro timeline showing 3 text layers with the transcribed text](img/premiere_example.png)\n\n\u003e **Tip**: To apply a style to all the text layers, you can create an Essential Graphics preset. Just do your settings on one of the layers, and then save it as a preset. Then you can drag the preset from your Project window to all the layers you select.\n\u003e \n\u003e For animation keyframes you want to save an Animation Preset, which you can do by right-clicking on your created effect with keyframes and saving the Preset. Then you can drag it from your Effects window under Presets to all the layers you select. \n\n## Resources\n\n* https://github.com/jianfch/stable-ts\n* https://github.com/openai/whisper/discussions/3#discussioncomment-3730914\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FJorianWoltjer%2FAutoCaptions","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FJorianWoltjer%2FAutoCaptions","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FJorianWoltjer%2FAutoCaptions/lists"}