Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-02-04 00:30:59 UTC
- JSON Representation
https://github.com/doctorpok42/pheere-app
Pheere is a simple virtual assistant
ai chatgpt desktop-app elevenlabs nextjs scss tauri ts virtual-assistant whisper
Last synced: 10 Jan 2025
https://github.com/saamerm/whisperkit-ios15
iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon
ios ios15 swiftui whisper whisper-ai
Last synced: 19 Jan 2025
https://github.com/madh93/whisper
🎙️ My Whisper stuff
docker openai speech-recognition speech-to-text whisper whisper-cpp
Last synced: 29 Jan 2025
https://github.com/jojasadventure/whisper-client
Very simple Python based client for Whisper compatible endpoint
desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper
Last synced: 09 Oct 2024
https://github.com/voqal/browser
Natural speech browsing for the software developers of tomorrow
cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper
Last synced: 20 Oct 2024
https://github.com/breadrock1/audio-to-text
There is simple backend project to use whisper-rs.
actix-web audio-to-text rust swagger-ui whisper
Last synced: 10 Jan 2025
https://github.com/winstxnhdw/capgen
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper
Last synced: 23 Oct 2024
https://github.com/adisol07/sharpspeech
SharpSpeech is free, local and open source way to speech and wake word recognition.
audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/canaxs/whisper-core
An application where users can make rumor-based news and earn money in return.
mysql panel spring spring-boot whisper
Last synced: 19 Dec 2024
https://github.com/toomore/whisper
🔐📦📜🔑🍞 Write some notes by using the GPG encrypts.
gpg notes pgp quickstart whisper
Last synced: 23 Jan 2025
https://github.com/oov/aviutl_subtitler
AviUtl+拡張編集の環境で Whisper による文字起こしをするためのプラグイン
Last synced: 19 Dec 2024
https://github.com/nri12/filter_voice
Dự án lọc và tắt tiếng video những từ khóa mong muốn
Last synced: 19 Dec 2024
https://github.com/brentwong-kiel1997/ai_language_school_based_on_django_and_openai
Django and OpenAI API example use case
django gpt-4 openai openai-api whisper
Last synced: 09 Oct 2024
https://github.com/egorsmkv/star-adapt-uk
Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.
asr speech-recognition ukrainian whisper
Last synced: 19 Dec 2024
https://github.com/TranBaVinhSon/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 24 Oct 2024
https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo
This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.
ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper
Last synced: 09 Oct 2024
https://github.com/chinese-soup/cbot-telegram-whisper
Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.
bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp
Last synced: 17 Jan 2025
https://github.com/jowadev/interview
Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.
interview-preparation job-interviews nextjs professional students whisper
Last synced: 14 Dec 2024
https://github.com/Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.
ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper
Last synced: 24 Oct 2024
https://github.com/natanielf/lecsum
Automatically transcribe and summarize lecture recordings completely on-device using AI.
ollama ollama-python whisper whisper-ai
Last synced: 18 Dec 2024
https://github.com/extrange/transcription-benchmarks
Speech to text model benchmarks
Last synced: 08 Dec 2024
https://github.com/gangula-karthik/memo-mate
🚀 Discord meetings redefined with Memo Mate: Transcribe, summarize, and automate minutes seamlessly! ✨
discord-bot huggingface mistral py-cord speech-to-text transcribe whisper
Last synced: 22 Dec 2024
https://github.com/baristikir/voice-typing
Simple Desktop Application with Voice Typing features. Runs locally, transcribes locally and works fully offline with support for real-time transcribing. Powered by OpenAI Whisper ASR-models and whisper.cpp inference engine
Last synced: 24 Dec 2024
https://github.com/tylim88/voicefu
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/shani-sinojiya/sandalquest
AI/ML project for recognizing colloquial Kannada speech and building a speech-based Q&A system focused on sandalwood cultivation.
ai audio-processing data-augmentation deep-learning machine-learning mongodb nlp python pytorch question-answering speech-based-question-answering-system speech-recognition whisper
Last synced: 10 Jan 2025
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 24 Oct 2024
https://github.com/mooerslab/bash-whisper-transcription
Bash function to ease the transcription of audio files with OpenAI's whisper.
asr audio audio-file-trancription audio-messages automate-the-boring-stuff automatic-speech-recognition automation bash bash-function beginner-friendly speech-to-text stt whisper
Last synced: 14 Dec 2024
https://github.com/platput/pysubs
api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/szilvia-csernus/openai-audio-api-calls
Speech-to-text and text-to-speech API call examples, using OpenAI's whisper-1 and tts-1 models.
jupyter-notebook openai openai-api tts-1 whisper
Last synced: 09 Oct 2024
https://github.com/sugarcane-mk/speaker_classification
This repository provides a Python script for extracting speech embeddings using OpenAI's Whisper model. The embeddings are high-dimensional feature vectors that capture the acoustic properties of the input audio. These embeddings can be used for downstream tasks such as speech classification, clustering, and speaker recognition.
asr classification feature-extraction openai speech-processing speech-recognition speech-to-text svm-classifier whisper
Last synced: 09 Jan 2025
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 24 Oct 2024
https://github.com/bhattbhavesh91/openai-whisper-benchmarking
Comparing the performance of OpenAI's Whisper model on a GPU vs OpenAI's API
gpu openai speech-to-text whisper
Last synced: 16 Nov 2024
https://github.com/schnoddelbotz/whisper-ui
Transcribe audio/video to text, locally on macOS, Linux and Windows. A simple whisper.cpp wrapper/UI built with Go/Fyne.
ffmpeg ffmpeg-wrapper fyne gui local privacy speech-to-text transcription whisper whisper-cpp
Last synced: 27 Jan 2025
https://github.com/rokbenko/arctic-meet
ArcticMeet is an AI meeting assistant using Streamlit for the GUI and the Snowflake Arctic LLM via the Snowflake Cortex for the AI features
ffmpeg pandas plotly python pytorch snowflake snowflake-arctic snowflake-cortex snowpark streamlit transformers whisper
Last synced: 11 Jan 2025
https://github.com/kunesj/holo-subs-search
Tool for searching transcriptions of vtuber videos.
holodex pyannote transcription vtuber whisper youtube
Last synced: 19 Jan 2025
https://github.com/ahmetoner/master-whisper
Master Whisper transcription with CTranslate2
deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 08 Jan 2025
https://github.com/mikeesto/whispercpp-android
An Android app using whisper.cpp to do voice-to-text transcriptions
android kotlin speech-to-text whisper whisper-cpp
Last synced: 17 Dec 2024
https://github.com/bbc-esq/batch-openai-whisper-ctranslate2
Batch process multiple files using the fasted ctranslate2 implementation of Open AI's Whisper
batch-processing batch-script openai openai-whisper pyside6 transcription translation whisper whisperx
Last synced: 11 Jan 2025
https://github.com/abdnh/anki-asr
Anki add-on for speech recognition
anki anki-addon deepgram speech-recognition whisper
Last synced: 24 Nov 2024
https://github.com/carlosulisesochoa/whisper-ai-transcription-audio-to-text-file
A Python tool that uses OpenAI's Whisper model to batch transcribe audio files with GPU acceleration. Features include multi-language support, timestamp-based output, automatic file status checking, and CUDA support for faster processing. Perfect for transcribing lectures, interviews, or any audio content with high accuracy.
ai audio-to-text transcription whisper
Last synced: 28 Jan 2025
https://github.com/chaoticbyte/audio-summarize
An audio summarizer (faster-whisper and BART glued together)
ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper
Last synced: 09 Oct 2024
https://github.com/jpzinn654/speaker-diarization-portuguese
This project implements speaker diarization for Portuguese audio using WhisperX for transcription and PyAnotAudio's Speaker-Diarization 3.1 for speaker separation. It includes a Flask UI for easy file upload, transcription, and speaker identification.
flask gender-detection portuguese-language speaker-diarization speaker-recognition speech-recognition transcription whisper
Last synced: 28 Jan 2025
https://github.com/yc-w-cn/s-wave
S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.
podcast pouchdb typescript wasm whisper whisper-cpp
Last synced: 28 Dec 2024
https://github.com/nexuslux/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 09 Oct 2024
https://github.com/yousofss/speechtotext
Speech-to-Text using OpenAI's Whisper model
audio-to-text openai openai-whisper speech-to-text transcription whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/man2dev/whisper-cpp
dev fork of https://src.fedoraproject.org/rpms/whisper-cpp
fedora fedora-repository linux whisper whisper-cpp whispercpp
Last synced: 09 Oct 2024
https://github.com/sivakumar-mahalingam/subtitle-generator
🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python
ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper
Last synced: 09 Oct 2024
https://github.com/soenneker/soenneker.runners.whisper.ctranslate
Automatically updates the Soenneker.Whisper.CTranslate package
ai csharp ctranslate ctranslate2 dotnet faster library runner runners whisper whisperctranslate
Last synced: 28 Dec 2024
https://github.com/kitschpatrol/ambient-novel
An interface for nonlinear interactive exploration of a novel.
ambient book fiction interactive novel svelte whisper
Last synced: 20 Jan 2025
https://github.com/kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization
ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo
Last synced: 14 Dec 2024
https://github.com/luluw8071/whisper-tune
Finetuning Whisper on your own voice
Last synced: 14 Dec 2024
https://github.com/teemow/mnote
Generates meeting notes and summaries from video recordings
ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper
Last synced: 02 Feb 2025
https://github.com/sudiptab2100/waku-user-chat
Waku Chat using Usernames
communication-protocol decentralised-application decentralized ethereum ipfs libp2p waku waku-connect web3 whisper zk-snarks zkp
Last synced: 20 Dec 2024
https://github.com/iamarunbrahma/smart-voice-assistant
A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.
Last synced: 02 Feb 2025
https://github.com/neiltron/autocap
ALL CAPS
closedcaptions ml subtitles transcription whisper
Last synced: 19 Dec 2024
https://github.com/educa-ch/educa24-speech-to-summary
Demonstrator for an open-source speech-to-summary workflow
langchain ollama open-source open-weight speech-to-text summarization whisper
Last synced: 11 Oct 2024
https://github.com/flo-bit/youtube-speaker-separation
simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate
speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube
Last synced: 19 Dec 2024
https://github.com/thealphamerc/audio-to-text
Transcribe multi-lingual audio clips using whisper model
Last synced: 02 Feb 2025
https://github.com/khushijtrivedi/speech
The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.
ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper
Last synced: 09 Oct 2024
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 27 Dec 2024
https://github.com/fkiller/whispertranscript
Transcribe voice from mic input using OpenAI Whisper API.
llm openai transcribe transcript transcription webaudio whisper
Last synced: 06 Jan 2025
https://github.com/barrylee111/voicechat-llm
A chatbot with both prompt and voicechat capabilities leveraging LangChain, Elasticsearch, and FastAPI. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.
elasticsearch fastapi langchain largelanguagemodel python react speech-to-text tailwind text-to-speech typescript websocket whisper
Last synced: 19 Dec 2024
https://github.com/brunogaliati/speech2text-investments
This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.
chatgpt investment openai politics python speech-recognition speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/pjarbas/azure-ai
Examples using Azure AI services (DALLE3, Text to Speech, Whisper)
azure-openai dalle-3 image-generation-ai speech-synthesis text-to-speech whisper
Last synced: 21 Jan 2025
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 01 Feb 2025
https://github.com/danibcorr/university-helper
🧑🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.
deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper
Last synced: 19 Dec 2024
https://github.com/flaviodelgrosso/whisper-transcriber
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
ai audio-to-text diarization openai torch whisper
Last synced: 19 Dec 2024
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/evil0ctal/whisper-speech-to-text-api
An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.
ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api
Last synced: 25 Oct 2024
https://github.com/egorsmkv/optimized-whisper-intel
Run quantized Whisper models only on CPU with Intel hardware
intel onnx onnxruntime quantized-neural-networks whisper
Last synced: 19 Dec 2024
https://github.com/devgeekm/chat-it-up
Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.
ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl
Last synced: 20 Dec 2024
https://github.com/bilelouahmed/vocal-assistant
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts
Last synced: 21 Dec 2024
https://github.com/soenneker/soenneker.libraries.whisper.ctranslate
Simply adds the Whisper_CTrantlate2 Windows executable, updated daily (if available)
ai csharp ctranslate ctranslate2 dotnet faster libraries library whisper whisperctranslate
Last synced: 29 Dec 2024
https://github.com/vifill/audio-recorder-and-summarizer
This project is a Python script that records system audio on macOS using BlackHole, transcribes the audio using OpenAI's Whisper API, and summarizes the transcription using OpenAI's GPT models
ai audio blackhole gpt openai records summarize system whisper
Last synced: 20 Dec 2024
https://github.com/ekito-station/whisper-api-unity
UnityでOpenAI Whisper APIを使って文字起こしを行ったサンプル
Last synced: 20 Dec 2024
https://github.com/miosipof/asr_train
Fine-tuning OpenAI Whisper for ASR tasks on low-size datasets
asr machine-learning nlp whisper
Last synced: 07 Jan 2025
https://github.com/miosipof/whisper_inference
OpenAI Whisper ASR inference on CPU with OpenVino, PyTorch or Huggingface
asr inference machine-learning openvino pytorch whisper
Last synced: 07 Jan 2025
https://github.com/arkapravo-ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
ai fastapi huggingface machine-learning openai python3 speech-to-text transformers whisper
Last synced: 21 Dec 2024
https://github.com/theaussiepom/wyoming-openai
OpenAI SST and TTS support for the Wyoming protocol
home-assistant home-assistant-assist openai sst tts whisper wyoming
Last synced: 21 Dec 2024
https://github.com/a-iceberg/whisper_model_evaluator
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
asr audio-to-text automatic-speech-recognition data-analysis evaluation google-speech-recognition python tuning-parameters visualization vosk whisper
Last synced: 24 Oct 2024
https://github.com/julrog/jokes-on-you
Storyteller
ggj2024 global-game-jam openai unity whisper
Last synced: 17 Dec 2024
https://github.com/bluebirdback/groq-subtitles
Batch video subtitle generation using Groq Whisper API
groq speech-to-text subtitles video whisper
Last synced: 21 Dec 2024
https://github.com/josemarcosrf/Lexicap-QA
QA retrieval for Lex Fridman's podcast transcriptions
Last synced: 24 Oct 2024
https://github.com/felipecastrosales/scripts
List of useful scripts.
audio helper-functions helpers ia pip python python3 script scripts video whisper whisper-ai
Last synced: 22 Dec 2024
https://github.com/tylim88/Voicefu-back-end
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 24 Oct 2024
https://gitlab.com/ifrz/asr-multi-lite
Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition
Last synced: 24 Oct 2024
https://github.com/aixerum/faster-whisper
faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.
ctranslate2 gpu transcription whisper
Last synced: 07 Jan 2025
https://github.com/zdwolfe/transcription-tools
Docker video transcriber, wrapper around OpenAI
openai transcription whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/userpjm/whisper-youtube
Generate a SubRip subtitle file (srt) using Whisper for the audio of a YouTube video.
faster-whisper openai speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/ashot72/answering-questions-about-images
You can upload images, ask questions about images using voice prompts, then listen to the responses in voice
answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper
Last synced: 30 Dec 2024