Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-24 00:28:14 UTC
- JSON Representation
https://github.com/nicknaskida/insanely-fast-whisper
Incredibly fast Whisper-large-v3 with speaker diarization
diarization speaker-diarization transfromers whisper whisper-ai whisper-faster whisper-large
Last synced: 26 Sep 2024
https://github.com/canaxs/whisper-core
An application where users can make rumor-based news and earn money in return.
mysql panel spring spring-boot whisper
Last synced: 19 Dec 2024
https://github.com/troyanovsky/llm_summarizer
Use LLM and Whisper to summarize long text and audio/video
Last synced: 13 Nov 2024
https://github.com/bhattbhavesh91/openai-whisper-benchmarking
Comparing the performance of OpenAI's Whisper model on a GPU vs OpenAI's API
gpu openai speech-to-text whisper
Last synced: 16 Nov 2024
https://github.com/juanestban/whisper-tnode
cli ts typescript whisper whisper-cpp whisper-ia whisper-node whisper-node-ts
Last synced: 21 Dec 2024
https://github.com/brentwong-kiel1997/ai_language_school_based_on_django_and_openai
Django and OpenAI API example use case
django gpt-4 openai openai-api whisper
Last synced: 09 Oct 2024
https://github.com/abdnh/anki-asr
Anki add-on for speech recognition
anki anki-addon deepgram speech-recognition whisper
Last synced: 24 Nov 2024
https://github.com/fukuro-kun/wortweber
Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform für Spracherkennung in Deutsch und Englisch.
Last synced: 17 Nov 2024
https://github.com/valiantlynx/custom-whisper-api
This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.
ai docker-compose fastapi python whisper
Last synced: 22 Dec 2024
https://github.com/i4ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/wtlow003/auto-subtitles
CLI tool to transcribe (+ translate) videos and embed subtitles automatically.
faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp
Last synced: 15 Nov 2024
https://github.com/shtirmann/v2t
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 09 Oct 2024
https://github.com/voqal/browser
Natural speech browsing for the software developers of tomorrow
cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper
Last synced: 20 Oct 2024
https://github.com/aspadax/subtitlegenerator
Automatically generate a subtitle for your video.
gpt machine-learning openai rust streamlit subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 24 Oct 2024
https://github.com/ioriens/whisper-video
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
subtitle-generator video-to-audio video-to-text whisper
Last synced: 17 Nov 2024
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 24 Oct 2024
https://github.com/antoniosbarotsis/telegram-transcriber
A Telegram bot for transcribing voice messages
telegram transcribe voice whisper
Last synced: 31 Oct 2024
https://github.com/jowadev/interview
Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.
interview-preparation job-interviews nextjs professional students whisper
Last synced: 14 Dec 2024
https://github.com/elmiraghorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl
Last synced: 29 Nov 2024
https://github.com/Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.
ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper
Last synced: 24 Oct 2024
https://github.com/TranBaVinhSon/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 24 Oct 2024
https://github.com/nerdimite/meetsy-app
Frontend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/gangula-karthik/memo-mate
🚀 Discord meetings redefined with Memo Mate: Transcribe, summarize, and automate minutes seamlessly! ✨
discord-bot huggingface mistral py-cord speech-to-text transcribe whisper
Last synced: 22 Dec 2024
https://github.com/doctorpok42/pheere-app
Pheere is a simple virtual assistant
ai chatgpt desktop-app elevenlabs nextjs scss tauri ts virtual-assistant whisper
Last synced: 11 Nov 2024
https://github.com/pdcalado/waste
Whisper Audio Service for Transcription and Ergonomics
productivity rofi transcription tts whisper
Last synced: 20 Nov 2024
https://github.com/notyusheng/transcribe-translate
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 11 Oct 2024
https://github.com/aitor-alvarez/large-speech-models
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
arabic-speech-recognition asr asr-model finetuning-wav2vec finetuning-whisper large-speech-models speech-recognition-model wav2vec2 whisper
Last synced: 25 Nov 2024
https://github.com/julienvincent/whalker
Whisper talker
whisper whisper-ai whisper-cpp
Last synced: 07 Nov 2024
https://github.com/extrange/transcription-benchmarks
Speech to text model benchmarks
Last synced: 08 Dec 2024
https://github.com/huuquyet/phowhisper-next
Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js
nextjs onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 19 Dec 2024
https://github.com/shani-sinojiya/sandalquest
AI/ML project for recognizing colloquial Kannada speech and building a speech-based Q&A system focused on sandalwood cultivation.
ai audio-processing data-augmentation deep-learning machine-learning mongodb nlp python pytorch question-answering speech-based-question-answering-system speech-recognition whisper
Last synced: 02 Dec 2024
https://github.com/volkansah/text-to-speech-pygui-for-whisper
This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.
openai openai-api python-gui-tkinter python3 whisper whisper-ai
Last synced: 28 Nov 2024
https://github.com/maawad/luna
Personal assistant
bot openai personal-assistant whisper
Last synced: 17 Dec 2024
https://github.com/Shtirmann/V2T
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 24 Oct 2024
https://github.com/szilvia-csernus/openai-audio-api-calls
Speech-to-text and text-to-speech API call examples, using OpenAI's whisper-1 and tts-1 models.
jupyter-notebook openai openai-api tts-1 whisper
Last synced: 09 Oct 2024
https://github.com/team-mansumugang/mansumugang-backend
만수무강 서비스의 스프링 부트 어플리케이션입니다.
aws github-actions jpa jpa-hibernate spring-boot whisper
Last synced: 09 Oct 2024
https://github.com/jojasadventure/whisper-client
Very simple Python based client for Whisper compatible endpoint
desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper
Last synced: 09 Oct 2024
https://github.com/adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube
Last synced: 10 Dec 2024
https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo
This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.
ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper
Last synced: 09 Oct 2024
https://github.com/chaoticbyte/audio-summarize
An audio summarizer (faster-whisper and BART glued together)
ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper
Last synced: 09 Oct 2024
https://github.com/adisol07/sharpspeech
SharpSpeech is free, local and open source way to speech and wake word recognition.
audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/yc-w-cn/s-wave
S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.
podcast pouchdb typescript wasm whisper whisper-cpp
Last synced: 07 Nov 2024
https://github.com/bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
accessibility diarization speech-to-text storytelling-with-data transcription whisper
Last synced: 19 Dec 2024
https://github.com/gamut73/quizinator
Generating quizzes, on Android, from YouTube videos.
kotlin-android llm python whisper
Last synced: 19 Dec 2024
https://github.com/slinusc/speaker_identification_evaluation
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
Last synced: 09 Oct 2024
https://github.com/marty1885/useful-whisper-server
Whisper server based on useful-transformers for the RK3588
npu rk3588 rockchip useful-transformers whisper
Last synced: 05 Dec 2024
https://github.com/platput/pysubs
api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/mikeesto/whispercpp-android
An Android app using whisper.cpp to do voice-to-text transcriptions
android kotlin speech-to-text whisper whisper-cpp
Last synced: 17 Dec 2024
https://github.com/niqifan007/openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能
openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api
Last synced: 09 Oct 2024
https://github.com/aeronjl/transcribe
Python package for accurate audio transcription with speaker diarisation
audio-transcription gpt speaker-diarization whisper
Last synced: 09 Oct 2024
https://github.com/mikeesto/subber
A small CLI tool for converting video & audio to a text transcription
audio cli ffmpeg golang transcribe video whisper
Last synced: 19 Dec 2024
https://github.com/etienneab3d/srt-sync
Synchronize SRT timestamps over an existing accurate transcription
aligner asr nlp subtitles text-to-speech whisper
Last synced: 19 Dec 2024
https://github.com/tylim88/voicefu
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/baristikir/voice-typing
Simple Desktop Application with Voice Typing features. Runs locally, transcribes locally and works fully offline with support for real-time transcribing. Powered by OpenAI Whisper ASR-models and whisper.cpp inference engine
Last synced: 24 Dec 2024
https://github.com/becomingbabyman/eunoia-desktop
local desktop transcription and search for apple voice memos and videos
search second-brain transcription videos voice-memos whisper
Last synced: 25 Dec 2024
https://github.com/topdev0215/AudioMultifunctionChatbot
This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.
gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper
Last synced: 24 Oct 2024
https://github.com/brentwong-kiel1997/brents_ai_language_school
Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos
ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube
Last synced: 08 Nov 2024
https://github.com/natanielf/lecsum
Automatically transcribe and summarize lecture recordings completely on-device using AI.
ollama ollama-python whisper whisper-ai
Last synced: 18 Dec 2024
https://github.com/utrechtuniversity/transcription-d-lucea
python utrecht-university whisper
Last synced: 22 Nov 2024
https://github.com/h3yn3s/tl-dl
A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.
Last synced: 24 Oct 2024
https://github.com/mooerslab/bash-whisper-transcription
Bash function to ease the transcription of audio files with OpenAI's whisper.
asr audio audio-file-trancription audio-messages automate-the-boring-stuff automatic-speech-recognition automation bash bash-function beginner-friendly speech-to-text stt whisper
Last synced: 14 Dec 2024
https://github.com/nerdimite/meetsy-backend
AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/egorsmkv/star-adapt-uk
Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.
asr speech-recognition ukrainian whisper
Last synced: 19 Dec 2024
https://github.com/alancunningham/chatgpt-assistant
A ChatGPT assistant with voice activation and image generation, connected to a Raspberry Pi display.
chatgpt chatgpt-api dall-e dall-e-api porcupine python raspberry-pi whisper
Last synced: 10 Nov 2024
https://github.com/tranbavinhson/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 06 Nov 2024
https://github.com/roman01la/sub-deep
Transcribe and translate audio with AI
deepl transcribe translate whisper
Last synced: 08 Nov 2024
https://github.com/bluebirdback/groq-subtitles
Batch video subtitle generation using Groq Whisper API
groq speech-to-text subtitles video whisper
Last synced: 21 Dec 2024
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/mariatepei/vt_thesis_mtepei
This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.
data-augmentation fastspeech2 speech-recognition whisper
Last synced: 09 Oct 2024
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/ashot72/speech-to-text-to-image
Generating texts from your voice then images form the texts
chatgpt large-language-models llm replicate speech-to-text speechtotext stability-ai text-to-image texttoimage whisper whisper-ai
Last synced: 08 Nov 2024
https://github.com/darienmt/radio-listener
Speech Recognition applied to transcribe amateur radio traffic experiments
python3 radio-amateurs speach-to-text speech-recognition whisper
Last synced: 21 Nov 2024
https://github.com/flyingfathead/youwhisper-cli
A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.
cli cli-app python transcribe transcriber transcription whisper whisper-ai whisperx youtube-downloader yt-dlp yt-dlp-wrapper
Last synced: 12 Nov 2024
https://github.com/mottla/speech-to-text
Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.
huggingface speech-recognition stt teams transcription translation whisper zoom
Last synced: 21 Nov 2024
https://github.com/xi-rick/captains-log
Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.
audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper
Last synced: 21 Nov 2024
https://github.com/mdbecker/whisper_cpp_macos_utils
Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.
audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/samliebl/ai-whisper
Simple Node.js app: speech-to-text via whisper by OpenAI with file download.
nodejs openai speect-to-text transcription whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/madh93/whisper
🎙️ My Whisper stuff
docker openai speech-recognition speech-to-text whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/kolger/forty-two-transcribe
A Telegram bot that transcribes videos and audio messages to text via OpenAI Whisper API
openai self-hosted telegram whisper
Last synced: 25 Nov 2024
https://github.com/tobybenjaminclark/intermew
👨💻 Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.
computer-vision openai-api whisper
Last synced: 25 Nov 2024
https://github.com/teemow/mnote
Generates meeting notes and summaries from video recordings
ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper
Last synced: 07 Dec 2024
https://github.com/armaggheddon/whisper2me
whisper2me is a telegram bot written with pyTelegramBotAPI that uses OpenAI's whisper to perform speech2text so you no longer have listen to voice messages 🤫🔇
docker openia pytelegrambotapi python whisper
Last synced: 25 Nov 2024
https://github.com/heng30/vtbox
It is an offline voice to text tool. Using whisper model to transcribe.
rust slint-ui voice2text whisper
Last synced: 21 Nov 2024
https://github.com/electroneum/electroneum-web3.js
Electroneum SmartChain JavaScript API
api electroneum ethereum etn-sc javascript swarm typescript whisper
Last synced: 26 Sep 2024
https://github.com/iamarunbrahma/smart-voice-assistant
A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.
Last synced: 07 Dec 2024
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 07 Nov 2024
https://github.com/chloelavrat/speech-to-text-app
Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.
audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai
Last synced: 09 Nov 2024
https://github.com/breadrock1/audio-to-text
There is simple backend project to use whisper-rs.
actix-web audio-to-text rust swagger-ui whisper
Last synced: 11 Nov 2024
https://github.com/notyusheng/transcribe-translate_kubernetes
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 22 Nov 2024