Projects in Awesome Lists tagged with whisper-model
A curated list of projects in awesome lists tagged with whisper-model .
https://github.com/shhossain/banglaspeech2text
BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.
bangla bangla-asr bangla-automatic-speech-recognition bangla-speech-recognition bangla-speech-to-text bangla-voice-recognition deep-learning hacktoberfest machine-learning pytorch speech speech-recognition speech-to-text transformer voice-recognition whisper whisper-model
Last synced: 05 Apr 2025
https://github.com/jim-schwoebel/nala_assistant
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
chatbot chatgpt dolly2 fastapi fastapi-boilerplate fastapi-sqlalchemy fastapi-template large-language-models llm llms speech-recognition speech-to-text speecht5 tts voice voice-assistant voice-assistants wakeword whisper whisper-model
Last synced: 11 Apr 2025
https://github.com/hemangjoshi37a/french_audio_transcription_using_gradio
French audio transcription using gradio
audio-processing audio-to-text audio-transcription french-audio-transcription french-language gradio machine-learning speech-recognition transcription-tool whisper-model
Last synced: 15 Apr 2025
https://github.com/otonomee/youtube-to-transcript
Convert YouTube videos to text files. Why spend 30 minutes watching a video when you can skim the transcript in a couple minutes?
audio-transcription machine-learning openai python pytube speech-to-text transcription video-to-text whisper-model youtube-downloader
Last synced: 25 Feb 2025
https://github.com/seccanj/generate-subtitle-llm
Generates subtitles from a video speech (Whisper OpenAI LLM) or extracts existing subtitles, translates them into a different language using Mistral LLM and adds them to the video. Uses ffmpeg for extracting and encoding
ai ffmpeg llms machine-learning mistral-7b mistral-ai python3 subtitles-generator subtitles-translator video video-processing whisper-model
Last synced: 23 Feb 2025
https://github.com/ashwinsomi/messagingapp
A real time chat application using Next, Redis, Pub/Sub, Audio-To-Text LLM, Next-auth. I am still working on it
google-oauth2 huggingface next-auth nextjs15-typescript pusher redis rest tailwindcss whisper-model
Last synced: 31 Mar 2025
https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-
This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.
artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model
Last synced: 06 Apr 2025
https://github.com/matheusfd3/transcriptions-and-translations
Projeto que transcreve e traduz em tempo real para português.
2024 deep-translator pulseaudio python whisper-model
Last synced: 13 Mar 2025
https://github.com/sushantdhumak/crewai-agents-minutesofmeeting-gmail
MinutesOfMeeting and Gmail is a collaborative crew of AI agents that autonomously understand audio, transcripts, summarizes, writes and drafts an email in Gmail account.
agent-ops agentic-workflow audio-segmentation chunking crewai crewai-flow gmail-api google-auth-library google-cloud-platform gpt-4o-mini llm-tools whisper-model
Last synced: 26 Mar 2025