Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bbc-esq/whispers2t-transcriber
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
https://github.com/bbc-esq/whispers2t-transcriber
audio-recorder audio-recording audio-transcribing audio-transcription ctranslate2 flash-attention-2 transcr transcriber transcription whispers2t
Last synced: about 1 month ago
JSON representation
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
- Host: GitHub
- URL: https://github.com/bbc-esq/whispers2t-transcriber
- Owner: BBC-Esq
- Created: 2024-02-29T18:36:51.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-09-17T08:19:25.000Z (3 months ago)
- Last Synced: 2024-09-17T10:56:03.185Z (3 months ago)
- Topics: audio-recorder, audio-recording, audio-transcribing, audio-transcription, ctranslate2, flash-attention-2, transcr, transcriber, transcription, whispers2t
- Language: Python
- Homepage:
- Size: 72.3 KB
- Stars: 7
- Watchers: 1
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ๐WhisperS2T-transcriber๐
* Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files## Requirements
1) ๐[Python 3.11](https://www.python.org/downloads/release/python-3117/)
2) ๐[Git](https://git-scm.com/downloads)
3) ๐[Git Large File Storage](https://git-lfs.com/)
8) ๐ช Windows (linux not yet supported)
> You can modify the installation instructions manually for Linux, I just don't have Linux in order to test them reliably. You can do this by analyzing the ```setup_gpu.py``` or ```setup_cpu_only.py``` file to see what libraries are required.# Installation
Download the latest release and extract the files your computer. Navigate to the respository folder, create a command prompt, and run the following commands:```
python -m venv .
```
```
.\Scripts\activate
```
> Run this again to activate the environment each time you restart the program.Select GPU (includes CPU) or CPU-only installation:
```
python setup_gpu.py
```
```
python setup_cpu_only.py
```
# Usage
```
python whispers2t_batch_gui.py
```
The program will process any and all of the following file types:
* ```.mp3```, ```.wav```, ```.flac```, ```.wma```, ```.aac```, ```.m4a```, ```.avi```, ```.mkv```, ```.mp4```, ```.asf```, ```.amr```.### Important
All transcriptions are output in the same folder of the file that was transcribed. If you'd like to change this behavior put an issue on Github requesting it.