Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/bilalhameed248/whisper-fine-tuning-for-pronunciation-learning

Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning
https://github.com/bilalhameed248/whisper-fine-tuning-for-pronunciation-learning

deep-learning deep-neural-networks dnn fine-tuning openai pronunciation python seq2seq speech speech-recognition speech-synthesis speech-to-text whisper whisper-ai

Last synced: about 2 hours ago
JSON representation

Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning

Awesome Lists containing this project

README

        

Whisper Fine-tuning for Pronunciation Learning



In this project, I undertook the task of fine-tuning a whisper speech to text base model to enhance pronunciation learning, particularly focusing on broken words or fragmented speech segments. The primary objective was to develop a robust system capable of accurately transcribing whispered speech, especially in scenarios where words are partially uttered or fragmented. Leveraged advanced transfer learning techniques and deep learning architectures to achieve an impressive accuracy rate of nearly 95%. Collaborated with educators to integrate the model into language learning applications, demonstrating a commitment to leveraging technology for educational enhancement.