Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/i4ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.
https://github.com/i4ds/whisper-prep

fine-tuning nlp speech-to-text whisper

Last synced: 3 months ago
JSON representation

Data preparation utility for the finetuning of OpenAI's Whisper model.

Host: GitHub
URL: https://github.com/i4ds/whisper-prep
Owner: i4Ds
License: mit
Created: 2024-05-14T11:35:20.000Z (9 months ago)
Default Branch: main
Last Pushed: 2024-08-16T13:45:26.000Z (6 months ago)
Last Synced: 2024-08-16T14:54:14.305Z (6 months ago)
Topics: fine-tuning, nlp, speech-to-text, whisper
Language: Python
Homepage:
Size: 365 KB
Stars: 2
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

[![Issues][issues-shield]][issues-url]
[![MIT License][license-shield]][license-url]

whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.

Table of Contents

About The Project

License

Contact

## About The Project
This package assists in generating training data for fine-tuning Whisper by synthesizing .srt files from sentences, mimicking real data through sentence concatenation.

(back to top)

## Data Preparation Guide
1. **Data File (.tsv):**
- Create a `.tsv` file with two required columns:
- `path`: The relative path to the `.mp3` file.
- `sentence`: The text corresponding to the audio file.
- Optional: If a `client_id` is included, it can be used to increase the probability that following sentences are from the same speaker. Refer to `generate_fold` in `src/whisper_prep/generation/generate.py` for additional features.

2. **Configuration File (.yaml):**
- Set up a `.yaml` configuration file. An example can be found at `example.yaml`.

3. **Running the Generation Script:**
- Run `whisper_prep -c `.

4. **Upload to Huggingface.com:**
- https://huggingface.co/docs/datasets/v1.16.0/upload_dataset.html

(back to top)

## Contact

Vincenzo Timmel - [email protected]

(back to top)

## License

Distributed under the MIT License. See `LICENSE` for more information.

(back to top)

[issues-shield]: https://img.shields.io/github/issues/i4Ds/whisper-prep.svg?style=for-the-badge
[issues-url]: https://github.com/i4Ds/whisper-prep/issues
[license-shield]: https://img.shields.io/github/license/i4Ds/whisper-prep.svg?style=for-the-badge
[license-url]: https://github.com/i4Ds/whisper-prep/blob/main/LICENSE