Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/i4ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.
https://github.com/i4ds/whisper-prep

fine-tuning nlp speech-to-text whisper

Last synced: 5 days ago
JSON representation

Data preparation utility for the finetuning of OpenAI's Whisper model.

Awesome Lists containing this project

README

        

[![Issues][issues-shield]][issues-url]
[![MIT License][license-shield]][license-url]




whisper-prep


Data preparation utility for the finetuning of OpenAI's Whisper model.


Table of Contents


  1. About The Project

  2. License

  3. Contact

## About The Project
This package assists in generating training data for fine-tuning Whisper by synthesizing .srt files from sentences, mimicking real data through sentence concatenation.

(back to top)

## Data Preparation Guide
1. **Data File (.tsv):**
- Create a `.tsv` file with two required columns:
- `path`: The relative path to the `.mp3` file.
- `sentence`: The text corresponding to the audio file.
- Optional: If a `client_id` is included, it can be used to increase the probability that following sentences are from the same speaker. Refer to `generate_fold` in `src/whisper_prep/generation/generate.py` for additional features.

2. **Configuration File (.yaml):**
- Set up a `.yaml` configuration file. An example can be found at `example.yaml`.

3. **Running the Generation Script:**
- Run `whisper_prep -c `.

4. **Upload to Huggingface.com:**
- https://huggingface.co/docs/datasets/v1.16.0/upload_dataset.html

(back to top)

## Contact

Vincenzo Timmel - [email protected]

(back to top)

## License

Distributed under the MIT License. See `LICENSE` for more information.

(back to top)

[issues-shield]: https://img.shields.io/github/issues/i4Ds/whisper-prep.svg?style=for-the-badge
[issues-url]: https://github.com/i4Ds/whisper-prep/issues
[license-shield]: https://img.shields.io/github/license/i4Ds/whisper-prep.svg?style=for-the-badge
[license-url]: https://github.com/i4Ds/whisper-prep/blob/main/LICENSE