Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/i4ds/whisper-prep
Data preparation utility for the finetuning of OpenAI's Whisper model.
https://github.com/i4ds/whisper-prep
fine-tuning nlp speech-to-text whisper
Last synced: 5 days ago
JSON representation
Data preparation utility for the finetuning of OpenAI's Whisper model.
- Host: GitHub
- URL: https://github.com/i4ds/whisper-prep
- Owner: i4Ds
- License: mit
- Created: 2024-05-14T11:35:20.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-08-16T13:45:26.000Z (3 months ago)
- Last Synced: 2024-08-16T14:54:14.305Z (3 months ago)
- Topics: fine-tuning, nlp, speech-to-text, whisper
- Language: Python
- Homepage:
- Size: 365 KB
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[![Issues][issues-shield]][issues-url]
[![MIT License][license-shield]][license-url]
whisper-prep
Data preparation utility for the finetuning of OpenAI's Whisper model.
Table of Contents
## About The Project
This package assists in generating training data for fine-tuning Whisper by synthesizing .srt files from sentences, mimicking real data through sentence concatenation.## Data Preparation Guide
1. **Data File (.tsv):**
- Create a `.tsv` file with two required columns:
- `path`: The relative path to the `.mp3` file.
- `sentence`: The text corresponding to the audio file.
- Optional: If a `client_id` is included, it can be used to increase the probability that following sentences are from the same speaker. Refer to `generate_fold` in `src/whisper_prep/generation/generate.py` for additional features.2. **Configuration File (.yaml):**
- Set up a `.yaml` configuration file. An example can be found at `example.yaml`.3. **Running the Generation Script:**
- Run `whisper_prep -c `.4. **Upload to Huggingface.com:**
- https://huggingface.co/docs/datasets/v1.16.0/upload_dataset.html## Contact
Vincenzo Timmel - [email protected]
## License
Distributed under the MIT License. See `LICENSE` for more information.
[issues-shield]: https://img.shields.io/github/issues/i4Ds/whisper-prep.svg?style=for-the-badge
[issues-url]: https://github.com/i4Ds/whisper-prep/issues
[license-shield]: https://img.shields.io/github/license/i4Ds/whisper-prep.svg?style=for-the-badge
[license-url]: https://github.com/i4Ds/whisper-prep/blob/main/LICENSE