Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-07-29 14:04:56 UTC
- JSON Representation
https://github.com/noco-ai/spellbook-docker
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models
automatic-speech-recognition bark llama2 llm-inference mixtral musicgeneration stable-diffusion text-to-speech whisper xttsv2
Last synced: 04 Aug 2024
https://github.com/Cledev-Limited/Cledev.OpenAI
.NET 7 SDK for OpenAI with a Blazor Server playground
azureopenai blazor blazor-server chat-gpt chatgpt chatgpt-4 chatgpt-api dall-e dontnet-core dotnet gpt-3 gpt3 net7 openai openai-api sdk sdk-dotnet tokenizer whisper whisper-ai
Last synced: 04 Aug 2024
https://github.com/Illyism/openai-whisper-api
OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example
chatgpt openai openai-whisper whisper
Last synced: 03 Aug 2024
https://github.com/johniwasz/whetstone.chatgpt
A simple light-weight library that wraps the Open AI API.
chatgpt dotnet dotnet-standard2 dotnet-standard2-1 gpt-3 gpt-35-turbo gpt-4 openai whisper whisper-ai
Last synced: 02 Aug 2024
https://github.com/bits-by-brandon/whisper-ui
A GUI interface for Open AI Whisper based on Tauri and Sveltekit
rust speech-to-text svelte tauri whisper
Last synced: 01 Aug 2024
https://github.com/Woolverine94/biniou
a self-hosted webui for 30+ generative ai
audiogen bark controlnet diffusers generative-ai gfpgan gradio huggingface insightface ip-adapter kandinsky llama-cpp-python musicgen photomaker pix2pix real-esrgan stable-diffusion stable-video-diffusion webui whisper
Last synced: 31 Jul 2024
https://github.com/askrella/speech-rest-api
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
artificial-intelligence openai python3 speech-recognition speech-to-text text-to-speech whisper whisper-ai
Last synced: 01 Aug 2024
https://github.com/supershaneski/openai-whisper
A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for some time interval then uploads the audio data to the server for transcribing/translating.
nextjs openai openai-whisper reactjs whisper
Last synced: 29 Jul 2024
https://github.com/nalbion/whisper-server
streaming speech to text server using Whisper
Last synced: 02 Aug 2024
https://github.com/JonathanFly/faster-whisper-livestream-translator
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
faster-whisper speech-to-text subtitles whisper
Last synced: 05 Aug 2024
https://github.com/RayFernando1337/MLX-Auto-Subtitled-Video-Generator
Generate accurate transcripts using Apple's MLX framework
apple mlx pinokio transcribe translate whisper
Last synced: 03 Sep 2024
https://github.com/piotrkawa/deepfake-whisper-features
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
audio-deepfake-detection deep-learning deepfake-detection paper-implementations whisper
Last synced: 29 Jul 2024
https://github.com/mingkuan/voice-assistant-chatgpt
Voice Assistant based on Whisper ASR and ChatGPT API
ai-web-app asr chatbot-application chatgpt chatgpt-bot multilingual speech-recognition speech-synthesis streamlit streamlit-webapp voice-assistant whisper
Last synced: 01 Aug 2024
https://github.com/shamspias/chatgpt-voice-chatbot-telegram
ChatGPT Voice Chatbot Telegram is a Python and Flask-based GitHub repository that enables users to communicate with an AI chatbot using voice-to-text and text-to-voice technologies powered by OpenAI. The repository provides a flexible and customizable solution for building advanced voice-enabled chatbots using natural language processing.
celery chatbot chatgpt dall-e flask gpt-3 openjourney python telegram-bot telegram-voice-chat text-to-speech text-to-speech-python3 tts voice-chat voice-conversion voice-recognition voice-to-text whisper
Last synced: 03 Aug 2024
https://github.com/daymade/tiktok-whisper
Batch convert video to text using openai's whisper or the local coreML via whisper.cpp on your MacBook
coreml openai pgvector podcast postgresql sqlite tiktok whisper whisper-cpp xiaoyuzhou
Last synced: 01 Aug 2024
https://github.com/runpod/serverless-workers
⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.
ai anything-v3 containers docker openjourney runpod serverless stable-diffusion whisper workers
Last synced: 07 Aug 2024
https://codeberg.org/pluja/web-whisper
New repo: https://codeberg.org/pluja/web-whisper-plus
ai audio go openai speech-to-text svelte transcription translation ui web whisper
Last synced: 03 Aug 2024
https://github.com/QuantiusBenignus/blurt
Gnome shell extension for accurate speech to text input in Linux using whisper.cpp. Input text from speech anywhere.
ai asr bloat-free dictate dictation gnome gnome-extension gnome-shell-extension input input-method kiss linux machine-learning speech-recognition speech-to-text whisper whisper-cpp
Last synced: 01 Aug 2024
https://github.com/EtienneAb3d/karaok-AI
Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)
djing karaoke karaoke-maker lyrics mp3-player music party-apps sound-processing speech-to-text srt-subtitles subtitles vad whisper
Last synced: 01 Aug 2024
https://github.com/serg-plusplus/meeper
Meeper 📝 - is your secretary for any in-browser conference.
ai chatgpt extension langchain summary transcription whisper
Last synced: 01 Aug 2024
https://github.com/pas1ko/meeper
Meeper 📝 - is your secretary for any in-browser conference.
ai chatgpt extension langchain summary transcription whisper
Last synced: 27 Aug 2024
https://github.com/Maitreyapatel/speech-conversion-between-different-modalities
Generative Adversarial Networks for different impaired speech conversions
deep-learning generative-adversarial-networks pytorch speech-conversion voice-conversion whisper
Last synced: 07 Aug 2024
https://github.com/KostasEreksonas/Audio-transcriber
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
audio audio-to-text openai openai-whisper pip python text transcription whisper youtube youtube-dl
Last synced: 07 Aug 2024
https://github.com/JorianWoltjer/AutoCaptions
A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.
ai premiere-pro srt subtitles whisper xml
Last synced: 04 Aug 2024
https://github.com/nooqta/kodyfire
AI-powered code generator and automation tool
ai automation boilerplate chatgpt cli codex generator low-code no-code openai openai-api scaffold template typescript whisper yeoman
Last synced: 10 Aug 2024
https://github.com/smaranjitghose/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python
docker open-source openai python python3 speech-recognition speech-to-text streamlit streamlit-lottie streamlit-webapp whisper
Last synced: 07 Aug 2024
https://codeberg.org/pluja/web-whisper-plus
NEW VERSION AT: https://github.com/pluja/whishper. A transcription suite on your web browser: OpenAI's whisper and many other features. Formerly "web-whisper-plus"
ai audio docker go golang speech subtitles sveltekit text transcription whisper
Last synced: 29 Jul 2024
https://github.com/Eyevinn/auto-subtitles
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
ffmpeg openai openai-whisper subtitle-generator subtitles subtitles-generator tools transcription video video-streaming whisper
Last synced: 01 Aug 2024
https://github.com/nitaiaharoni1/whisper-speech-to-text
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via OpenAI's Whisper, intended for web applications.
javascript openai openai-whisper react speech speech-recognition speech-to-text text-recognition typescript webapp whisper whisper-ai
Last synced: 29 Jul 2024
https://github.com/jim-schwoebel/nala_assistant
🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.
chatbot chatgpt dolly2 fastapi fastapi-boilerplate fastapi-sqlalchemy fastapi-template large-language-models llm llms speech-recognition speech-to-text speecht5 tts voice voice-assistant voice-assistants wakeword whisper whisper-model
Last synced: 01 Aug 2024
https://github.com/platisd/phonix
Generate captions for videos using the power of OpenAI's Whisper API
openai openai-api openai-whisper video-srt video-to-caption video-to-text whisper
Last synced: 01 Aug 2024
https://github.com/CrimeIsDown/trunk-transcribe
Transcription of calls from trunk-recorder using OpenAI Whisper
celery meilisearch openai-whisper telegram-bot trunk-recorder whisper
Last synced: 07 Aug 2024
https://github.com/Nachimak28/LAI-voice-search-openai-whisper-demo
A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo
openai speech-to-text websearch whisper
Last synced: 29 Jul 2024
https://github.com/bzed/whisper-to-graphite
Read and send metrics from whisper files to graphite - Used to migrate to different graphite backends
golang graphite graphite-backends metrics migration whisper whisper-files
Last synced: 29 Jul 2024
https://github.com/IRedDragonICY/vixevia
An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.
ai anime api artificial-intelligence chatbot collaborate gemini-api gemini-chatbot gemini-pro gemini-pro-vision gemini-vision-pro girl google javascript python vits vtuber waifu whisper youtuber
Last synced: 29 Jul 2024
https://github.com/GodModed/ai-captions
This small project uses OpenAI's whisper AI to generate captions for videos.
ai captions collaborate communityexchange github gitlens learn python student-vscode whisper
Last synced: 29 Jul 2024
https://github.com/fengredrum/finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
Last synced: 29 Jul 2024
https://github.com/CoHuK/gpt-telegram-bot
GPT/Whisper/DALL-E Telegram Bot with easy deployment using Chalice to AWS Lambda
aws-lambda dall-e gpt gpt-35-turbo telegram telegram-bot whisper
Last synced: 29 Jul 2024
https://github.com/miclast/FreePBX-Call-intrusion
Intrusion. Custom Asterisk dial plan for listen, whisper and barge in calls. For Asterisk FreePBX, Issabel, Asterisk based Elastix call centers.
asterisk barge call callcenter intrusion monitoring whisper
Last synced: 29 Jul 2024
https://github.com/neonwatty/bleep_that_sht
Make someone sound naughty - bleep out words of your choice leveraging Whisper transcription
ai demo-app generative-ai machine-learning transcribe transcription whisper
Last synced: 29 Jul 2024
https://github.com/QuantiusBenignus/BlahST
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline.
accessibility ai bloat-free bloatfree cli command-line command-line-tool desktop-integration gnome kiss machine-learning no-nonsense speech-recognition speech-to-text whisper whisper-cpp
Last synced: 29 Jul 2024
https://github.com/eryk-mazus/sigh
background voice detection program that listens for a wake word and activates transcription mode
speech-recognition speech-to-text voice-recognition whisper
Last synced: 29 Jul 2024
https://github.com/hay/audio2text
Python command line utility wrappers for Whispercpp and other speech-to-text utilities
speech-recognition speech-to-text stt whisper whisper-cpp
Last synced: 04 Aug 2024
https://github.com/YvesCheung/Whisper
一套用于代码检阅的注解
android annotation inspect lint whisper
Last synced: 29 Jul 2024
https://github.com/redocrepus/ahk-whisper-paste
Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.
dictation openai openai-api text-to-speech voice-typing whisper whisper-ai windows
Last synced: 29 Jul 2024
https://github.com/lrq3000/futo-voiceinput-whisper
Mirror of FUTO's Voice Input, an Android Voice Keyboard for Speech-To-Text transcribing using Whisper, supporting large multilanguage models and with automatic language detection
android speech-to-text whisper
Last synced: 29 Jul 2024
https://github.com/XMuli/ThinkyMatePages
Simple and easy to use desktop application for ChatGPT & AI, will supporting Window, MacOS, Linux platforms. | 洁且易用的 ChatGPT/星火大模型 & AI 的跨平台客户端
chatgpt cross-platform linux macos openai qt whisper
Last synced: 01 Aug 2024
https://github.com/legendsort/openAISpeechToDatabase
AI automation to save formatted text with proper title from speech
automation chatgpt dropbox notion openai whisper zapier
Last synced: 05 Aug 2024
https://github.com/hoangv97/ai-chatbot
Integrate ChatGPT, Dall-E, Whisper and other AI models in Replicate into Messenger and Telegram bot
bottender chatbot chatgpt dall-e2 messenger-bot replicate telegram-bot typescript whisper
Last synced: 29 Jul 2024
https://github.com/mharrvic/redhorse-ai-transcriber
Audio transcriber using Openai whisper ML deployed to Banana.dev
Last synced: 07 Aug 2024
https://github.com/voidful/whisper-live-asr-demo
run whisper on CPU/GPU server
Last synced: 29 Jul 2024
https://github.com/stayallive/whisper-subtitles
Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.
Last synced: 29 Jul 2024
https://github.com/SrinadhVura/OpenAI-Stack-Hack
Our Medifix is an AI powered assistant powered on gpt-3.5 turbo (chatGPT). Medifix is designed to help people by providing preventive measures based on the symptoms mentioned.
chatgpt gtts streamlit whisper
Last synced: 29 Jul 2024
https://github.com/erkara/Rise-of-Transfer-Learning
you will find brief code implementations of some of the latest developments in AI, including Stable Diffusion, Whisper, YOLO and HuggigFace Transformers
gpt-3 huggingface openai stable-diffusion transfer-learning whisper yolov5
Last synced: 29 Jul 2024
https://github.com/VoXera/VoXera
An Open-Source Persian Language Techs Toolkit with Python
deep-learning deep-neural-networks keyword-extraction machine-learning natural-language-processing nlp openai persian persian-language speech-recognition speech-to-text text-processing vosk vosk-api whisper
Last synced: 04 Aug 2024
https://github.com/jxxe/murmur
A proof-of-concept transcription app
journalism mac macos transcribe transcription whisper
Last synced: 29 Jul 2024
https://github.com/detektor777/colab_list
colab list for video
ai colab-notebook colorization dain deblur enhance instcolorization nafnet real-esrgan transcribe upscaling video whisper
Last synced: 29 Jul 2024
https://github.com/xuegao-tzx/whisper_flutter_new
A flutter library for offline speech-to-text conversion which use whisper.cpp models implementation for Android、iOS、macOS.
android flutter ios whisper whisper-cpp
Last synced: 29 Jul 2024
https://github.com/Pmking27/AutoTalker
The project focuses on leveraging technology to create new courses, personalize existing ones, and enhance the assessment process, ultimately contributing to the development of 21st-century skills in students.
ai bark gdsc gdsc-dypsn gemini-api gemini-pro gen-ai ngo python solution-challenge-2024 stt subtitles tts video-creation whisper
Last synced: 29 Jul 2024
https://github.com/botisan-ai/whisper-aws-stack
Deplay Whisper on AWS Scalably
aws cdk ecs fargate fastapi openai silero-vad whisper
Last synced: 29 Jul 2024
https://github.com/redocrepus/arkode
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
accessibility chatgpt chatgpt-api code-assistant coding-assistant coding-by-voice developer-tools openai openai-api programming-assistant programming-by-voice visual-studio-code visual-studio-code-extension visualstudiocode voice-coding voicecode voicecoding vscode-extension whisper whisper-api
Last synced: 29 Jul 2024
https://github.com/redocrepus/Whisper-Paste
Chrome extension that allows dictating anywhere using OpenAI Whisper
chrome-extension dictation openai openai-api text-to-speech voice-recognition voice-typing whisper whisper-ai
Last synced: 29 Jul 2024
https://github.com/scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers
automatic-speech-recognition common-voice-dataset huggingface openai transformers whisper
Last synced: 29 Jul 2024
https://github.com/CrabAss/dCollab
Decentralized e-Learning Collaboration Platform as a Capstone Project (COMP4913, PolyU)
comp4913 dapp ethereum javascript react whisper
Last synced: 29 Jul 2024
https://github.com/BatuhanYilmaz26/Youtube-Transcriber
Input a YouTube video link and get a transcription as a .txt, .vtt or .srt file.
automatic-speech-recognition huggingface openai python speech-recognition streamlit whisper
Last synced: 29 Jul 2024
https://github.com/limdongjin/ignkafasr
Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)
apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper
Last synced: 29 Jul 2024
https://github.com/gustavz/audio-to-text
streamlit app to transcript audio to text using openai's whisper library
audio-to-text streamlit whisper
Last synced: 29 Jul 2024
https://github.com/Shtirmann/V2T
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 29 Jul 2024
https://github.com/JoSuru/speeka
Speeaka is an open-source project that uses the Whisper model of OpenAI to transcribe audio into text. Its intuitive web interface makes it easy to use. Contributions are welcome.
open-source python python3 speech-to-text streamlit whisper
Last synced: 29 Jul 2024
https://github.com/chriamue/whisper-example
Docker compose environment and example for whisper.
docker-compose geth p2p-network shh web3js whisper
Last synced: 29 Jul 2024
https://github.com/fly-apps/cog-whisper
Run OpenAI Whisper as a Cog model on Fly GPUs
Last synced: 29 Jul 2024
https://github.com/seitzquest/RavenWhisperer
Listens to your voice and queries a language model for answers when a question is detected
Last synced: 05 Aug 2024
https://github.com/carloscdias/whisper-cpp-python
whisper.cpp bindings for python
python python3 whisper whisper-api whisper-cpp
Last synced: 29 Jul 2024
https://github.com/DivineUX23/Audio-to-Audio-translation
Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...
chatgpt elevenlabs flask language translator whisper
Last synced: 29 Jul 2024
https://github.com/topdev0215/AudioMultifunctionChatbot
This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.
gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper
Last synced: 29 Jul 2024
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 29 Jul 2024
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 29 Jul 2024
https://github.com/TranBaVinhSon/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 29 Jul 2024
https://github.com/my-north-ai/semantic_audio_filtering
Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.
automatic-speech-recognition fine-tuning synthetic-dataset-generation text-to-speech whisper
Last synced: 29 Jul 2024
https://github.com/Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.
ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper
Last synced: 29 Jul 2024
https://github.com/water25234/ChatREP
Summary on Youtube By ChatGPT & whisper
chatgpt-api openai python python3 video whisper youtube
Last synced: 29 Jul 2024
https://github.com/rufuszhu/WhisperSRT
Generate subtitle for video using whisper and translate to other language using DeepL
openai python srt translation whisper
Last synced: 29 Jul 2024
https://github.com/platput/pysubs
api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper
Last synced: 29 Jul 2024
https://github.com/umerarif01/ai-translator
AI Translator: Fast and Accurate Translations with Next.js and OpenAI's Whisper and GPT-3 APIs
Last synced: 29 Jul 2024
https://github.com/RoyNkem/SwiftUI-AI-Voice-Assistant
A multi-platform app for voice-based interactions built using SwiftUI with advanced AI capabilities.
gpt-4 ios macos mvvm openai-api swiftui text-to-speech visionos whisper
Last synced: 29 Jul 2024
https://github.com/nerdimite/meetsy-backend
AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 29 Jul 2024
https://github.com/nerdimite/meetsy-app
Frontend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 29 Jul 2024
https://github.com/marty1885/useful-whisper-server
Whisper server based on useful-transformers for the RK3588
npu rk3588 rockchip useful-transformers whisper
Last synced: 29 Jul 2024
https://github.com/lissettecarlr/AutomaticSpeechRecognition
语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库
ai asr audio audio-processing deepl paraformer python speech-to-text text whisper
Last synced: 29 Jul 2024
https://github.com/h3yn3s/tl-dl
A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.
Last synced: 29 Jul 2024
https://github.com/paddy41601/faster-whisper-cli
A command-line interface wrapper for Faster Whisper
faster-whisper openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 29 Jul 2024
https://github.com/lifeosm/whisper
🐳 Docker image with OpenAI Whisper.
docker octolab speech-to-text whisper
Last synced: 29 Jul 2024