Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-02-07 00:32:40 UTC
- JSON Representation
https://github.com/aitor-alvarez/large-speech-models
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
arabic-speech-recognition asr asr-model finetuning-wav2vec finetuning-whisper large-speech-models speech-recognition-model wav2vec2 whisper
Last synced: 25 Jan 2025
https://github.com/jowadev/interview
Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.
interview-preparation job-interviews nextjs professional students whisper
Last synced: 07 Feb 2025
https://github.com/i4ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/utrechtuniversity/transcription-d-lucea
python utrecht-university whisper
Last synced: 23 Jan 2025
https://github.com/cris-m/langgraph_examples
duckduckgo kokoro langgraph llama3-2 whisper
Last synced: 18 Jan 2025
https://github.com/tposcic/audio-to-srt-transcriber
Audio to srt transcriber in Python using whisper for transcription and Tcl/Tk for GUI
audio python3 srt transcription whisper
Last synced: 05 Jan 2025
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 01 Feb 2025
https://github.com/antoniosbarotsis/telegram-transcriber
A Telegram bot for transcribing voice messages
telegram transcribe voice whisper
Last synced: 26 Dec 2024
https://github.com/sugarcane-mk/speaker_classification
This repository provides a Python script for extracting speech embeddings using OpenAI's Whisper model. The embeddings are high-dimensional feature vectors that capture the acoustic properties of the input audio. These embeddings can be used for downstream tasks such as speech classification, clustering, and speaker recognition.
asr classification feature-extraction openai speech-processing speech-recognition speech-to-text svm-classifier whisper
Last synced: 09 Jan 2025
https://github.com/seanvelasco/ai
Cloudflare AI challenge submission: Slater - your virtual foreign language friend
ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper
Last synced: 03 Feb 2025
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 02 Feb 2025
https://github.com/niqifan007/openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能
openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api
Last synced: 09 Oct 2024
https://github.com/aeronjl/transcribe
Python package for accurate audio transcription with speaker diarisation
audio-transcription gpt speaker-diarization whisper
Last synced: 09 Oct 2024
https://github.com/mikeesto/subber
A small CLI tool for converting video & audio to a text transcription
audio cli ffmpeg golang transcribe video whisper
Last synced: 19 Dec 2024
https://github.com/bhattbhavesh91/openai-whisper-benchmarking
Comparing the performance of OpenAI's Whisper model on a GPU vs OpenAI's API
gpu openai speech-to-text whisper
Last synced: 16 Nov 2024
https://github.com/etienneab3d/srt-sync
Synchronize SRT timestamps over an existing accurate transcription
aligner asr nlp subtitles text-to-speech whisper
Last synced: 19 Dec 2024
https://github.com/abdnh/anki-asr
Anki add-on for speech recognition
anki anki-addon deepgram speech-recognition whisper
Last synced: 24 Nov 2024
https://github.com/jlcarveth/skreech
An HTTP API wrapper around Whisper for transcribing audio files.
speech-recognition speech-to-text whisper whisper-ai
Last synced: 19 Jan 2025
https://github.com/pawelzeja098/whisper-video-transcription
Testing whisper Open-AI to transcribe videos
audio mp3 mp4 transcription video whisper whisper-ai
Last synced: 27 Jan 2025
https://github.com/topdev0215/AudioMultifunctionChatbot
This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.
gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper
Last synced: 24 Oct 2024
https://github.com/wtlow003/auto-subtitles
CLI tool to transcribe (+ translate) videos and embed subtitles automatically.
faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp
Last synced: 15 Nov 2024
https://github.com/h3yn3s/tl-dl
A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.
Last synced: 24 Oct 2024
https://github.com/gabriellopesdesouza2002/funcspy
Functions to help you develop any program or script you want
automation chatbot dall-e email email-library ocr openai-api openai-chatgpt openai-whisper pdf pdf-tools python regex selenium selenium-webdriver whisper
Last synced: 30 Oct 2024
https://github.com/mickekring/top-of-mind-clara
Clara är en prototyp som möjliggör att anonymt kunna göra sin röst hörd. Medarbetaren kan prata eller skriva in det du vill säga och AI anonymiserar det. Medarbetaren har dessutom tillgång till en chatbot att rådfråga. Därefter analyseras och sammanställs alla medarbetares tankar i en dashboard.
ai chatbot feedback openai python streamlit transcription whisper
Last synced: 22 Dec 2024
https://github.com/voqal/browser
Natural speech browsing for the software developers of tomorrow
cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper
Last synced: 20 Oct 2024
https://github.com/nerdimite/meetsy-backend
AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/jesse-c/local-audio-toolkit
Some handy tools to do with audio locally.
large-language-models lm-studio macos side-project whisper
Last synced: 29 Jan 2025
https://github.com/szilvia-csernus/openai-audio-api-calls
Speech-to-text and text-to-speech API call examples, using OpenAI's whisper-1 and tts-1 models.
jupyter-notebook openai openai-api tts-1 whisper
Last synced: 09 Oct 2024
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 24 Oct 2024
https://github.com/madh93/whisper
🎙️ My Whisper stuff
docker openai speech-recognition speech-to-text whisper whisper-cpp
Last synced: 29 Jan 2025
https://github.com/pkarpovich/kira-client
An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices
ai-assistant intent-classification porcupine trigger-word-detection whisper
Last synced: 13 Jan 2025
https://github.com/toomore/whisper
🔐📦📜🔑🍞 Write some notes by using the GPG encrypts.
gpg notes pgp quickstart whisper
Last synced: 23 Jan 2025
https://github.com/platput/pysubs
api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/mooerslab/bash-whisper-transcription
Bash function to ease the transcription of audio files with OpenAI's whisper.
asr audio audio-file-trancription audio-messages automate-the-boring-stuff automatic-speech-recognition automation bash bash-function beginner-friendly speech-to-text stt whisper
Last synced: 14 Dec 2024
https://github.com/sumitesh9/localizedwhisper
An initiative to make OpenAI Whisper more localized by adding support for more languages.
albanian albanian-language huggingface openai speech speech-to-text whisper
Last synced: 02 Jan 2025
https://github.com/gangula-karthik/memo-mate
🚀 Discord meetings redefined with Memo Mate: Transcribe, summarize, and automate minutes seamlessly! ✨
discord-bot huggingface mistral py-cord speech-to-text transcribe whisper
Last synced: 22 Dec 2024
https://github.com/xaionaro-go/speech
A Speech-To-Text (with translation) library for Go; currently uses Whisper (runs locally if needed; no need in any API keys)
ai converter go golang library module package speech speech-recognition speech-to-text text whisper
Last synced: 13 Jan 2025
https://github.com/team-mansumugang/mansumugang-backend
만수무강 서비스의 스프링 부트 어플리케이션입니다.
aws github-actions jpa jpa-hibernate spring-boot whisper
Last synced: 09 Oct 2024
https://github.com/becomingbabyman/eunoia-desktop
local desktop transcription and search for apple voice memos and videos
search second-brain transcription videos voice-memos whisper
Last synced: 25 Dec 2024
https://github.com/maawad/luna
Personal assistant
bot openai personal-assistant whisper
Last synced: 17 Dec 2024
https://github.com/huuquyet/phowhisper-next
Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js
nextjs onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 19 Dec 2024
https://github.com/yc-w-cn/s-wave
S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.
podcast pouchdb typescript wasm whisper whisper-cpp
Last synced: 28 Dec 2024
https://github.com/extrange/transcription-benchmarks
Speech to text model benchmarks
Last synced: 08 Dec 2024
https://github.com/roman01la/sub-deep
Transcribe and translate audio with AI
deepl transcribe translate whisper
Last synced: 30 Dec 2024
https://github.com/volkansah/text-to-speech-pygui-for-whisper
This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.
openai openai-api python-gui-tkinter python3 whisper whisper-ai
Last synced: 27 Jan 2025
https://github.com/pdcalado/waste
Whisper Audio Service for Transcription and Ergonomics
productivity rofi transcription tts whisper
Last synced: 21 Jan 2025
https://github.com/aspadax/subtitlegenerator
Automatically generate a subtitle for your video.
gpt machine-learning openai rust streamlit subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/shtirmann/v2t
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 09 Oct 2024
https://github.com/whisper-666/TikTok-Login
TikTok Login With No Captcha No Proxy (unlimited requests)
api combo combo-checker proxyless tiktok tiktok-api tiktok-followers tiktok-followers-generator tiktok-followers-software tiktok-login tiktok-views whisper
Last synced: 24 Oct 2024
https://github.com/pjarbas/azure-ai
Examples using Azure AI services (DALLE3, Text to Speech, Whisper)
azure-openai dalle-3 image-generation-ai speech-synthesis text-to-speech whisper
Last synced: 21 Jan 2025
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 01 Feb 2025
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 27 Dec 2024
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/evil0ctal/whisper-speech-to-text-api
An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.
ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api
Last synced: 25 Oct 2024
https://github.com/thealphamerc/audio-to-text
Transcribe multi-lingual audio clips using whisper model
Last synced: 02 Feb 2025
https://github.com/bilelouahmed/vocal-assistant
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts
Last synced: 21 Dec 2024
https://github.com/soenneker/soenneker.libraries.whisper.ctranslate
Simply adds the Whisper_CTrantlate2 Windows executable, updated daily (if available)
ai csharp ctranslate ctranslate2 dotnet faster libraries library whisper whisperctranslate
Last synced: 29 Dec 2024
https://github.com/educa-ch/educa24-speech-to-summary
Demonstrator for an open-source speech-to-summary workflow
langchain ollama open-source open-weight speech-to-text summarization whisper
Last synced: 11 Oct 2024
https://github.com/neiltron/autocap
ALL CAPS
closedcaptions ml subtitles transcription whisper
Last synced: 19 Dec 2024
https://github.com/iamarunbrahma/smart-voice-assistant
A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.
Last synced: 02 Feb 2025
https://github.com/vifill/audio-recorder-and-summarizer
This project is a Python script that records system audio on macOS using BlackHole, transcribes the audio using OpenAI's Whisper API, and summarizes the transcription using OpenAI's GPT models
ai audio blackhole gpt openai records summarize system whisper
Last synced: 20 Dec 2024
https://github.com/ekito-station/whisper-api-unity
UnityでOpenAI Whisper APIを使って文字起こしを行ったサンプル
Last synced: 20 Dec 2024
https://github.com/sudiptab2100/waku-user-chat
Waku Chat using Usernames
communication-protocol decentralised-application decentralized ethereum ipfs libp2p waku waku-connect web3 whisper zk-snarks zkp
Last synced: 20 Dec 2024
https://github.com/miosipof/asr_train
Fine-tuning OpenAI Whisper for ASR tasks on low-size datasets
asr machine-learning nlp whisper
Last synced: 07 Jan 2025
https://github.com/miosipof/whisper_inference
OpenAI Whisper ASR inference on CPU with OpenVino, PyTorch or Huggingface
asr inference machine-learning openvino pytorch whisper
Last synced: 07 Jan 2025
https://github.com/arkapravo-ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
ai fastapi huggingface machine-learning openai python3 speech-to-text transformers whisper
Last synced: 21 Dec 2024
https://github.com/theaussiepom/wyoming-openai
OpenAI SST and TTS support for the Wyoming protocol
home-assistant home-assistant-assist openai sst tts whisper wyoming
Last synced: 21 Dec 2024
https://github.com/datvm/openaiwhisperclient
A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.
client-side openai subtitle timestamp transcript transcription whisper whisper-ai
Last synced: 08 Feb 2025
https://github.com/teemow/mnote
Generates meeting notes and summaries from video recordings
ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper
Last synced: 02 Feb 2025
https://github.com/luluw8071/whisper-tune
Finetuning Whisper on your own voice
Last synced: 07 Feb 2025
https://github.com/julrog/jokes-on-you
Storyteller
ggj2024 global-game-jam openai unity whisper
Last synced: 17 Dec 2024
https://github.com/kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization
ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo
Last synced: 14 Dec 2024
https://github.com/kitschpatrol/ambient-novel
An interface for nonlinear interactive exploration of a novel.
ambient book fiction interactive novel svelte whisper
Last synced: 20 Jan 2025
https://github.com/bluebirdback/groq-subtitles
Batch video subtitle generation using Groq Whisper API
groq speech-to-text subtitles video whisper
Last synced: 21 Dec 2024
https://github.com/soenneker/soenneker.runners.whisper.ctranslate
Automatically updates the Soenneker.Whisper.CTranslate package
ai csharp ctranslate ctranslate2 dotnet faster library runner runners whisper whisperctranslate
Last synced: 28 Dec 2024
https://github.com/sivakumar-mahalingam/subtitle-generator
🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python
ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper
Last synced: 09 Oct 2024
https://github.com/felipecastrosales/scripts
List of useful scripts.
audio helper-functions helpers ia pip python python3 script scripts video whisper whisper-ai
Last synced: 22 Dec 2024
https://github.com/aixerum/faster-whisper
faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.
ctranslate2 gpu transcription whisper
Last synced: 07 Jan 2025
https://github.com/zdwolfe/transcription-tools
Docker video transcriber, wrapper around OpenAI
openai transcription whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/ashot72/answering-questions-about-images
You can upload images, ask questions about images using voice prompts, then listen to the responses in voice
answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper
Last synced: 30 Dec 2024
https://github.com/chloelavrat/speech-to-text-app
Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.
audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/man2dev/whisper-cpp
dev fork of https://src.fedoraproject.org/rpms/whisper-cpp
fedora fedora-repository linux whisper whisper-cpp whispercpp
Last synced: 09 Oct 2024
https://github.com/valkryst/whisper_automations
Various scripts for automating tasks using OpenAI's Whisper.
automation openai subtitle subtitle-generator transcription translation whisper
Last synced: 26 Dec 2024
https://github.com/yousofss/speechtotext
Speech-to-Text using OpenAI's Whisper model
audio-to-text openai openai-whisper speech-to-text transcription whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/sskorol/home-assistant-voice
Home Assistant Voice PE Setup Guide
docker home-assistant home-automation piper smart-home speech-recognition speech-synthesis voice-assistant whisper
Last synced: 04 Feb 2025
https://github.com/zahidhasann88/video-summarizer
A videos by extracting audio and generating summaries based on the audio content.
nodejs openai typescript whisper
Last synced: 07 Jan 2025
https://github.com/nexuslux/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 09 Oct 2024
https://github.com/cnseniorious000/dl-a2t
download, audio-to-text PyPI: https://pypi.org/p/dl-a2t
audio transcription whisper youtube
Last synced: 02 Jan 2025
https://github.com/cp3249/athena_project
Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.
Last synced: 15 Jan 2025
https://github.com/ubos-tech/node-red-contrib-speech-to-text-ubos
Learn how to turn audio into text.
ai low-code lowcode node-red node-red-contrib node-red-flow openai openai-api openai-whisper speech-to-text whisper whisper-ai whisper-api
Last synced: 20 Jan 2025