https://github.com/makaveli10/whisper-finetuning-dataset
https://github.com/makaveli10/whisper-finetuning-dataset
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/makaveli10/whisper-finetuning-dataset
- Owner: makaveli10
- Created: 2023-06-19T11:23:54.000Z (almost 2 years ago)
- Default Branch: master
- Last Pushed: 2023-07-05T15:45:23.000Z (almost 2 years ago)
- Last Synced: 2025-02-12T10:21:25.593Z (3 months ago)
- Language: HTML
- Size: 145 MB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Transcription Verification
The purpose of this is to verify the transcript from ```large-v2``` whisper and prepare data to fine tune a smaller whisper model with the custom dataset.## Getting Started
### Local setup
- Clone this repo.
- Start simple http server
```bash
cd transcription_verification
python -m http.server
```
- Navigate to http://0.0.0.0:8000/
- Make your changes and click on ```save_changes``` to download the modified metadata.csv
- Commit your changes to this repo.### Github Pages
- Or simply navigate to [whisper-finetuning-dataset github page](https://makaveli10.github.io/whisper-finetuning-dataset/).
- Verify the dataset
- Save changes to download the modified metadata.csv