Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jorianwoltjer/autocaptions
A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.
https://github.com/jorianwoltjer/autocaptions
ai premiere-pro srt subtitles whisper xml
Last synced: 6 days ago
JSON representation
A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.
- Host: GitHub
- URL: https://github.com/jorianwoltjer/autocaptions
- Owner: JorianWoltjer
- Created: 2022-12-26T12:22:43.000Z (almost 2 years ago)
- Default Branch: master
- Last Pushed: 2023-09-25T06:47:06.000Z (about 1 year ago)
- Last Synced: 2024-08-05T09:14:40.881Z (3 months ago)
- Topics: ai, premiere-pro, srt, subtitles, whisper, xml
- Language: Python
- Homepage:
- Size: 188 KB
- Stars: 37
- Watchers: 4
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AutoCaptions
**Transcribe an audio file to Premiere Pro layers**
A GUI tool that uses [OpenAI's Whisper](https://github.com/openai/whisper) to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles. Mainly for adding quick subtitles to action-packed videos, by making segments of a small word count.
Outputs a `.xml` file which is a sequence containing text layers (Essential Graphics) that can be imported into your Premiere Pro project.
Uses [`stable-ts`](https://github.com/jianfch/stable-ts) regrouping functions to split the result into small configurable segments.
## Installation
```cmd
git clone https://github.com/JorianWoltjer/AutoCaptions.git && cd AutoCaptions
python -m pip install -r requirements.txt
```### Torch
Make sure to install the GPU enabled version of `torch` to make Whisper a lot faster:
```shell
python -m pip uninstall torch
python -m pip cache purge
python -m pip install torch -f https://download.pytorch.org/whl/torch_stable.html
```### ffmpeg
An external dependency for Whisper that needs to be installed:
###### Windows
Install [Chocolatey](https://docs.chocolatey.org/en-us/choco/setup), then run the following command:
```cmd
choco install ffmpeg
```###### Linux
```Shell
sudo apt update && sudo apt install ffmpeg
```## Running
###### Windows
Simply create a shortcut to [`start.bat`](start.bat)
###### Linux
```shell
$ python main.py
```## Example
Start the batch script, and select a file as input. Then some configuration is available and you can transcribe the audio:
![A terminal showing Whisper output and some progress updates, with the simple GUI on Windows](img/terminal_example.png)
The resulting XML file can then be imported into a Premiere project, where you can use and edit the text layers it created:
![A screenshot of the Premiere Pro timeline showing 3 text layers with the transcribed text](img/premiere_example.png)
> **Tip**: To apply a style to all the text layers, you can create an Essential Graphics preset. Just do your settings on one of the layers, and then save it as a preset. Then you can drag the preset from your Project window to all the layers you select.
>
> For animation keyframes you want to save an Animation Preset, which you can do by right-clicking on your created effect with keyframes and saving the Preset. Then you can drag it from your Effects window under Presets to all the layers you select.## Resources
* https://github.com/jianfch/stable-ts
* https://github.com/openai/whisper/discussions/3#discussioncomment-3730914