https://github.com/i4ds/stt4sg-transcribe

Last synced: 11 months ago
JSON representation

Host: GitHub
URL: https://github.com/i4ds/stt4sg-transcribe
Owner: i4Ds
License: mit
Created: 2025-03-31T15:26:37.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2025-05-20T10:38:09.000Z (about 1 year ago)
Last Synced: 2025-06-18T01:51:29.161Z (about 1 year ago)
Language: Python
Size: 14.6 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# WhisperX Transcription Tool

This tool uses WhisperX to transcribe all audio and video files in a folder. Additionally, it offers a guide for installation and CUDA and CUDNN version management.

## Installation
1. Install Python==3.11.
2. Find out your CUDA version with `nvidia-smi` and CUDNN with `python get_cudnn_version.py`.
3. CUDA and PyTorch installation: Follow the instructions on the [PyTorch website](https://pytorch.org/get-started/locally/) to install the correct version of CUDA and PyTorch for your system.
4. With your CUDA and CUDNN version, head over to [faster-whisper | Requirements](https://github.com/SYSTRAN/faster-whisper), check for the correct version of `ctranslate2`, based on the version of CUDA and CUDNN you have, and install it.
5. Install the required Python packages;
```bash
pip install -r requirements.txt
```

## Usage
In the `transcribe.py` file, update the `folder_path` variable to the path of the folder containing and pass the model name or path to the `AudioTranscriber` class. Then, run the script:
```bash
python transcribe.py
```

## Maintainer
- [@kenfus](https://github.com/kenfus)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/i4ds/stt4sg-transcribe

Awesome Lists containing this project

README