Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-11-14 00:26:52 UTC
- JSON Representation
https://github.com/daymade/tiktok-whisper
Batch convert video to text using openai's whisper or the local coreML via whisper.cpp on your MacBook
coreml openai pgvector podcast postgresql sqlite tiktok whisper whisper-cpp xiaoyuzhou
Last synced: 07 Nov 2024
https://github.com/shamspias/chatgpt-voice-chatbot-telegram
ChatGPT Voice Chatbot Telegram is a Python and Flask-based GitHub repository that enables users to communicate with an AI chatbot using voice-to-text and text-to-voice technologies powered by OpenAI. The repository provides a flexible and customizable solution for building advanced voice-enabled chatbots using natural language processing.
celery chatbot chatgpt dall-e flask gpt-3 openjourney python telegram-bot telegram-voice-chat text-to-speech text-to-speech-python3 tts voice-chat voice-conversion voice-recognition voice-to-text whisper
Last synced: 03 Aug 2024
https://github.com/dartvauder/neurosandboxwebui
(Windows/Linux) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages
audioldm cogvideox demucs diffusers flux gradio llamacpp llm neural-network python rvc seamlessm4t stable-diffusion stableaudio stablefast3d transformers tts wav2lip webui whisper
Last synced: 29 Oct 2024
https://github.com/EtienneAb3d/karaok-AI
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
djing karaoke karaoke-maker lyrics mp3-player music party-apps sound-processing speech-to-text srt-subtitles subtitles vad whisper
Last synced: 08 Nov 2024
https://github.com/kurianbenoy/indic-subtitler
Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
asr deep-learning fastapi faster-whisper inference nextjs openai quantization speech-recognition speech-to-text transformers vegam-whisper webapp whisper whisperx
Last synced: 01 Nov 2024
https://github.com/i5ucc/vrctextboxstt
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!
obs openai openai-whisper openvr osc speech-recognition speech-to-text vrchat vrchat-avatars vrchat-osc vrchat-sdk3 vrchat-tool whisper
Last synced: 09 Oct 2024
https://github.com/runpod/serverless-workers
⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.
ai anything-v3 containers docker openjourney runpod serverless stable-diffusion whisper workers
Last synced: 07 Aug 2024
https://github.com/aronweiler/assistant
An intellligent AI assistant that can do anything!
ai database large-language-models llama2 llamacpp llm open-ai open-ai-api openai pgvector polly-voice postgres postgresql python streamlit transcription voice-assistant voice-recognition whisper
Last synced: 10 Oct 2024
https://github.com/QuantiusBenignus/blurt
Gnome shell extension for accurate speech to text input in Linux using whisper.cpp. Input text from speech anywhere.
ai asr bloat-free dictate dictation gnome gnome-extension gnome-shell-extension input input-method kiss linux machine-learning speech-recognition speech-to-text whisper whisper-cpp
Last synced: 08 Nov 2024
https://codeberg.org/pluja/web-whisper
New repo: https://codeberg.org/pluja/web-whisper-plus
ai audio go openai speech-to-text svelte transcription translation ui web whisper
Last synced: 14 Nov 2024
https://github.com/gewoonjaap/winwhisper
Create subtitles with ease, using Whisper AI for Windows
csharp openai subtitle subtitles subtitles-generator videos whisper whisper-ai
Last synced: 27 Oct 2024
https://github.com/kurianbenoy/whisper_normalizer
A python package for whisper normalizer
asr asr-benchmark jupyter-notebook nbdev normalizers openai whisper
Last synced: 01 Nov 2024
https://github.com/lucaluke13/talkybotty
Simply forward a video or voice message in any language to the bot, and it will reply with a translation.
ai osint telegram-bot text-to-speech translation voice whisper
Last synced: 12 Oct 2024
https://github.com/jorianwoltjer/autocaptions
A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.
ai premiere-pro srt subtitles whisper xml
Last synced: 07 Nov 2024
https://github.com/pinto0309/faster-whisper-env
An environment where you can try out faster-whisper immediately.
Last synced: 22 Oct 2024
https://github.com/divineux23/audio-to-audio-translation
Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...
chatgpt elevenlabs flask language translator whisper
Last synced: 10 Nov 2024
https://github.com/jovanveljanoski/jupyter-voicepilot
A JupyterLab extension for generating code and interacting with JupyterLab Notebooks via voice commands
gpt-3 jupyterlab jupyterlab-extension voice whisper
Last synced: 09 Oct 2024
https://github.com/pas1ko/meeper
Meeper 📝 - is your secretary for any in-browser conference.
ai chatgpt extension langchain summary transcription whisper
Last synced: 27 Aug 2024
https://github.com/faker2048/youtube-faster-whisper
YTWS is a simple CLI tool that downloads YouTube videos and creates subtitles quickly. It uses yt-dlp for downloading and faster-whisper for transcribing, making it easy and efficient to use.
substitle tools transcript whisper youtube
Last synced: 11 Oct 2024
https://github.com/serg-plusplus/meeper
Meeper 📝 - is your secretary for any in-browser conference.
ai chatgpt extension langchain summary transcription whisper
Last synced: 02 Nov 2024
https://github.com/saharmor/anima
Turn text into video using Stable Diffusion and Google FILM
artificial-intelligence deep-learning generativeai generativeart stable-diffusion text-to-video whisper
Last synced: 13 Nov 2024
https://github.com/platisd/phonix
Generate captions for videos using the power of OpenAI's Whisper API
openai openai-api openai-whisper video-srt video-to-caption video-to-text whisper
Last synced: 27 Oct 2024
https://github.com/appleboy/go-whisper
Speech o Text using docker image with ggerganov/whisper.cpp
golang openai whisper whisper-ai whisper-cpp
Last synced: 15 Oct 2024
https://github.com/Maitreyapatel/speech-conversion-between-different-modalities
Generative Adversarial Networks for different impaired speech conversions
deep-learning generative-adversarial-networks pytorch speech-conversion voice-conversion whisper
Last synced: 07 Aug 2024
https://github.com/DivineUX23/Audio-to-Audio-translation
Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...
chatgpt elevenlabs flask language translator whisper
Last synced: 24 Oct 2024
https://github.com/KostasEreksonas/Audio-transcriber
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
audio audio-to-text openai openai-whisper pip python text transcription whisper youtube youtube-dl
Last synced: 07 Aug 2024
https://github.com/solygambas/python-openai-projects
13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python.
auto-gpt chatbot chatgpt dall-e embeddings gpt-4 langchain langchain-python machine-learning nlp nlp-machine-learning open-ai-api openai python reddit reddit-api spotify spotify-api stable-diffusion whisper
Last synced: 27 Oct 2024
https://github.com/eyevinn/auto-subtitles
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
ffmpeg openai openai-whisper subtitle-generator subtitles subtitles-generator tools transcription video video-streaming whisper
Last synced: 09 Nov 2024
https://github.com/JorianWoltjer/AutoCaptions
A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.
ai premiere-pro srt subtitles whisper xml
Last synced: 04 Aug 2024
https://github.com/nooqta/kodyfire
AI-powered code generator and automation tool
ai automation boilerplate chatgpt cli codex generator low-code no-code openai openai-api scaffold template typescript whisper yeoman
Last synced: 07 Nov 2024
https://github.com/alxpez/alts
100% free, local & offline voice assistant with speech recognition
assistant chatbot llm local offline ollama speech-recognition stt tts voice voice-assistant whisper
Last synced: 20 Oct 2024
https://github.com/yj-20/auto-subtitle-translate
Automatically generate, translate, and overlay subtitles for any video.
ai ai-subtitle automatic-subtitle deep-learning ffmpeg llama llama2 python subtitle-generator subtitles subtitles-generator translates translator whisper
Last synced: 27 Sep 2024
https://github.com/yohasebe/whisper-stream
A bash script that uses the OpenAI Whisper API to transcribe continuous spoken audio into text
command-line dictation openai transcription voice-to-text whisper
Last synced: 08 Nov 2024
https://github.com/beingamanforever/tech-enhanced-ai-interview-learning-platform
Developed a sophisticated machine learning model capable of generating diverse interview questions aligned with specific topics, ensuring depth of conversation. Integrated advanced Natural Language Processing (NLP) algorithms to analyse spoken responses, identifying grammatical errors & offering accurate corrections after the interview.
ai-chatbot ai-chatbots api chatbot dataset fine flask huggingface huggingface-transformers inteview-test kaggle kaggle-notebooks large latex-document machine-learning mlops openai whisper
Last synced: 11 Oct 2024
https://github.com/aifsh/comfyui-whisperx
a comfyui cuatom node for audio subtitling based on whisperX and translators
srt-subtitles sutitles translation whisper
Last synced: 08 Nov 2024
https://github.com/smaranjitghose/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python
docker open-source openai python python3 speech-recognition speech-to-text streamlit streamlit-lottie streamlit-webapp whisper
Last synced: 07 Aug 2024
https://codeberg.org/pluja/web-whisper-plus
NEW VERSION AT: https://github.com/pluja/whishper. A transcription suite on your web browser: OpenAI's whisper and many other features. Formerly "web-whisper-plus"
ai audio docker go golang speech subtitles sveltekit text transcription whisper
Last synced: 24 Oct 2024
https://github.com/Eyevinn/auto-subtitles
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
ffmpeg openai openai-whisper subtitle-generator subtitles subtitles-generator tools transcription video video-streaming whisper
Last synced: 07 Nov 2024
https://github.com/umer-sheikh/bird-whisperer
[InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification" accepted in InterSpeech 2024 conference.
bird-call-classification birdclef-2023 fine-tuning whisper
Last synced: 09 Oct 2024
https://github.com/jim-schwoebel/nala_assistant
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
chatbot chatgpt dolly2 fastapi fastapi-boilerplate fastapi-sqlalchemy fastapi-template large-language-models llm llms speech-recognition speech-to-text speecht5 tts voice voice-assistant voice-assistants wakeword whisper whisper-model
Last synced: 07 Nov 2024
https://github.com/abus-aikorea/kara-audio
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.
asr demucs faster-whisper gradio karaoke mdx-net music-source-separation openai-whisper speech-recognition speech-to-text stt subtitle uvr vocal-remover webui whisper
Last synced: 10 Nov 2024
https://github.com/ultmaster/whisper-movie
Generate subtitles for long movies / podcasts with OpenAI Whisper API.
audio-transcription speech-to-text subtitles translation whisper
Last synced: 08 Nov 2024
https://github.com/Nachimak28/LAI-voice-search-openai-whisper-demo
A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo
openai speech-to-text websearch whisper
Last synced: 24 Oct 2024
https://github.com/nitaiaharoni1/whisper-speech-to-text
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via OpenAI's Whisper, intended for web applications.
javascript openai openai-whisper react speech speech-recognition speech-to-text text-recognition typescript webapp whisper whisper-ai
Last synced: 13 Nov 2024
https://github.com/codingforentrepreneurs/Smarter-Web-Scraping-with-Python
Leverage modern open-source tools to create better web scraping workflows.
apple-itunes-search-api brightdata gpt gpt3 hacker-news itunes-podcast-api llama2 llm ollama open-source openai podcast proxy-scraper python3 selenium whisper
Last synced: 15 Oct 2024
https://github.com/ireddragonicy/vixevia
An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.
ai anime api artificial-intelligence chatbot collaborate gemini-api gemini-chatbot gemini-pro gemini-pro-vision gemini-vision-pro girl google javascript python vits vtuber waifu whisper youtuber
Last synced: 10 Nov 2024
https://github.com/tensorchord/inference-benchmark
Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)
benchmark inference-server llm stable-diffusion whisper
Last synced: 12 Nov 2024
https://github.com/CrimeIsDown/trunk-transcribe
Transcription of calls from trunk-recorder using OpenAI Whisper
celery meilisearch openai-whisper telegram-bot trunk-recorder whisper
Last synced: 07 Aug 2024
https://github.com/lrq3000/futo-voiceinput-whisper
Mirror of FUTO's Voice Input, an Android Voice Keyboard for Speech-To-Text transcribing using Whisper, supporting large multilanguage models and with automatic language detection
android speech-to-text whisper
Last synced: 09 Nov 2024
https://github.com/rufuszhu/WhisperSRT
Generate subtitle for video using whisper and translate to other language using DeepL
openai python srt translation whisper
Last synced: 24 Oct 2024
https://github.com/botbahlul/whisper_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
auto-caption auto-subtitle caption faster-whisper ffmpeg google-translate-api openai openai-whisper python speech-recognition speechrecognition subtitle voice-recognition voicerecognition whisper
Last synced: 09 Oct 2024
https://github.com/IRedDragonICY/vixevia
An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.
ai anime api artificial-intelligence chatbot collaborate gemini-api gemini-chatbot gemini-pro gemini-pro-vision gemini-vision-pro girl google javascript python vits vtuber waifu whisper youtuber
Last synced: 24 Oct 2024
https://github.com/aviaryan/voice-writing-electron
A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hood
electron groq groq-api svelte whisper whisper-cpp
Last synced: 10 Oct 2024
https://github.com/bzed/whisper-to-graphite
Read and send metrics from whisper files to graphite - Used to migrate to different graphite backends
golang graphite graphite-backends metrics migration whisper whisper-files
Last synced: 24 Oct 2024
https://github.com/markgoodhead/dictate-wizard
Dictate Wizard is an open source dictation tool powered by OpenAI's Whisper. The goal is to obsolete as much typing as possible and let you speak your emails, instant messages etc instead.
conjecture faster-whisper openai soniox whisper
Last synced: 24 Oct 2024
https://github.com/fengredrum/finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
Last synced: 24 Oct 2024
https://github.com/transcribejs/transcribe.js
Monorepo for Transcribe.js
javascript speech speech-recognition speech-to-text transcribe wasm whisper
Last synced: 27 Oct 2024
https://github.com/GodModed/ai-captions
This small project uses OpenAI's whisper AI to generate captions for videos.
ai captions collaborate communityexchange github gitlens learn python student-vscode whisper
Last synced: 24 Oct 2024
https://github.com/neonwatty/bleep_that_sht
Make someone sound naughty - bleep out words of your choice leveraging Whisper transcription
ai demo-app generative-ai machine-learning transcribe transcription whisper
Last synced: 09 Oct 2024
https://github.com/QuantiusBenignus/BlahST
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline.
accessibility ai bloat-free bloatfree cli command-line command-line-tool desktop-integration gnome kiss llm machine-learning no-nonsense speech-recognition speech-to-text whisper whisper-cpp
Last synced: 24 Oct 2024
https://github.com/devemperor/dictate
A powerful Whisper AI keyboard for reliable speech transcription
android keyboard openai openai-api whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/thewh1teagle/pyannote-rs
pyannote audio diarization in rust
asr diarization onnxruntime rust speech-recognition whisper
Last synced: 09 Oct 2024
https://github.com/thinh-vu/ur_audio_sub
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
audio-to-text audio-transcription caption-generator speech-recognition whisper
Last synced: 07 Nov 2024
https://github.com/mcdallas/whispersub
format whisper transcripts to .srt
openai srt subtitles transcription whisper whisper-cpp
Last synced: 27 Oct 2024
https://github.com/transcripts4all/tools4all
A curated collection of tools to aid transcriptionists and subtitlers.
deaf google-colab google-colab-notebook hard-of-hearing ipython-notebook jupyter-notebook openai subtitles transcription whisper whisper-ai
Last synced: 11 Oct 2024
https://github.com/CoHuK/gpt-telegram-bot
GPT/Whisper/DALL-E Telegram Bot with easy deployment using Chalice to AWS Lambda
aws-lambda dall-e gpt gpt-35-turbo telegram telegram-bot whisper
Last synced: 24 Oct 2024
https://github.com/sloganking/desk-talk
A desktop transcription software
desktop dictation transcription whisper
Last synced: 13 Nov 2024
https://github.com/neka-nat/mylangrobot
Language instructions to mycobot using GPT-4V
chatgpt gpt-4-vision gpt-4-vision-preview gpt4v mycobot segment-anything whisper
Last synced: 14 Oct 2024
https://github.com/santima10/resumico
🤖 A WhatsApp bot to transcribe and summarize audio messages.
google-cloud-platform gpt-3 openai speech-to-text whatsapp-api whatsapp-bot whisper
Last synced: 07 Nov 2024
https://github.com/pulijon/sttcast
Transcription from mp3 files to html with or without embedded player
ansible automation aws-ec2 aws-s3 g4dn gpu ia iac puppet python terraform transcription vagrant vosk-engine whisper
Last synced: 14 Oct 2024
https://github.com/ieasybooks/almufarrigh
الواجهة الرسومية الخاصة بأداة تفريغ على أنظمة التشغيل المختلفة
ai audio-processing desktop linux macos python qt subtitles video-processing whisper windows wit
Last synced: 08 Nov 2024
https://github.com/xuegao-tzx/whisper_flutter_new
A flutter library for offline speech-to-text conversion which use whisper.cpp models implementation for Android、iOS、macOS.
android flutter ios whisper whisper-cpp
Last synced: 09 Oct 2024
https://github.com/machinelearningzh/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
audio-transcription machine-learning whisper
Last synced: 02 Nov 2024
https://github.com/XMuli/ThinkyMatePages
Simple and easy to use desktop application for ChatGPT & AI, will supporting Window, MacOS, Linux platforms. | 洁且易用的 ChatGPT/星火大模型 & AI 的跨平台客户端
chatgpt cross-platform linux macos openai qt whisper
Last synced: 08 Nov 2024
https://github.com/gumblex/whisper_vad
Whisper.cpp Speech-to-text with Voice Acticity Detection
speech-to-text whisper whisper-cpp
Last synced: 06 Nov 2024
https://github.com/ssciwr/vink
A stand-alone application with GUI for OpenAI's Whisper
gui hacktoberfest iwr-hacktoberfest openai pyinstaller speech-to-text transcription whisper whisper-ai
Last synced: 09 Nov 2024
https://github.com/eryk-mazus/sigh
Seamless Voice Interactions with LLMs
llm speech-recognition speech-to-text voice-recognition whisper
Last synced: 24 Oct 2024
https://github.com/miclast/FreePBX-Call-intrusion
Intrusion. Custom Asterisk dial plan for listen, whisper and barge in calls. For Asterisk FreePBX, Issabel, Asterisk based Elastix call centers.
asterisk barge call callcenter intrusion monitoring whisper
Last synced: 24 Oct 2024
https://github.com/jjwroeloffs/transcribe_align_textgrid
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!
force-alignment praat speech-recognition speech-to-text textgrid whisper
Last synced: 01 Nov 2024
https://github.com/t0mer/wassist
Wassist allows you to contact GPT3 directly from WhatsApp and not only that. Wassist also allows you to save your own personal data and later search and retrieve it using GPT3 to generate a response. In the examples folder, you can see several examples of how to use this bot so you don't have to remember anything ever again.
dall-e docker personal-assistant python weather whatsapp whisper
Last synced: 15 Oct 2024
https://github.com/RoyNkem/SwiftUI-AI-Voice-Assistant
A multi-platform app for voice-based interactions built using SwiftUI with advanced AI capabilities.
gpt-4 ios macos mvvm openai-api swiftui text-to-speech visionos whisper
Last synced: 23 Oct 2024
https://github.com/mribeirodantas/nf-whisper
Proof-of-concept Nextflow pipeline to interact with OpenAI Whisper
docker nextflow pipeline speech-to-text transcription whisper
Last synced: 15 Oct 2024
https://github.com/stayallive/whisper-subtitles
Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.
Last synced: 24 Oct 2024
https://github.com/YvesCheung/Whisper
一套用于代码检阅的注解
android annotation inspect lint whisper
Last synced: 24 Oct 2024
https://github.com/abus-aikorea/studio-free
youtube download, vocal remover, vocal extraction, karaoke video production, STT, automatic speech recognition, transcription, automatic subtitle, AI, yt-dlp, demucs, whisper, webui, gradio, windows
ai automatic-speech-recognition automatic-subtitle demucs gradio karaoke openai stt transcription video-download vocal-remover webui whisper windows yt-dlp
Last synced: 10 Nov 2024
https://github.com/sanhacks/aigen
Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported)
chatbot gpt3-turbo openai speech-recognition speech-synthesis speech-to-text text-to-speech tts voice whisper
Last synced: 09 Oct 2024
https://github.com/status-im/status-js-api
Status Javascript Client (WIP)
ethereum javascript shh status-im web3 web3js whisper
Last synced: 01 Nov 2024
https://github.com/sepiropht/auto-subtitle
Automatic subtitles in your videos
ffmpeg openai subtitles subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/redocrepus/ahk-whisper-paste
Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.
dictation openai openai-api text-to-speech voice-typing whisper whisper-ai windows
Last synced: 24 Oct 2024
https://github.com/scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers
automatic-speech-recognition common-voice-dataset huggingface openai transformers whisper
Last synced: 24 Oct 2024
https://github.com/devanshu-17/transcriptiq
TranscriptIQ is a project that enables users to transcribe YouTube videos and perform various NLP (Natural Language Processing) tasks, chat with youtube video and many more on the transcribed text.
clarifai-python cohere streamlit whisper
Last synced: 11 Oct 2024
https://github.com/hay/audio2text
Python command line utility wrappers for Whispercpp and other speech-to-text utilities
speech-recognition speech-to-text stt whisper whisper-cpp
Last synced: 15 Oct 2024