Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-23 00:26:54 UTC
- JSON Representation
https://github.com/thewh1teagle/pyannote-rs
pyannote audio diarization in rust
asr diarization onnxruntime rust speech-recognition whisper
Last synced: 09 Oct 2024
https://github.com/thinh-vu/ur_audio_sub
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
audio-to-text audio-transcription caption-generator speech-recognition whisper
Last synced: 07 Nov 2024
https://github.com/gha3mi/foropenai
ForOpenAI - A Fortran library for OpenAI API.
api chatgpt dall-e fortran fortran-package-manager gpt openai openai-api whisper
Last synced: 12 Dec 2024
https://github.com/transcripts4all/tools4all
A curated collection of tools to aid transcriptionists and subtitlers.
deaf google-colab google-colab-notebook hard-of-hearing ipython-notebook jupyter-notebook openai subtitles transcription whisper whisper-ai
Last synced: 11 Oct 2024
https://github.com/CoHuK/gpt-telegram-bot
GPT/Whisper/DALL-E Telegram Bot with easy deployment using Chalice to AWS Lambda
aws-lambda dall-e gpt gpt-35-turbo telegram telegram-bot whisper
Last synced: 24 Oct 2024
https://github.com/neka-nat/mylangrobot
Language instructions to mycobot using GPT-4V
chatgpt gpt-4-vision gpt-4-vision-preview gpt4v mycobot segment-anything whisper
Last synced: 14 Oct 2024
https://github.com/sloganking/desk-talk
A desktop transcription software
desktop dictation transcription whisper
Last synced: 13 Nov 2024
https://github.com/santima10/resumico
🤖 A WhatsApp bot to transcribe and summarize audio messages.
google-cloud-platform gpt-3 openai speech-to-text whatsapp-api whatsapp-bot whisper
Last synced: 07 Nov 2024
https://github.com/XMuli/ThinkyMatePages
Simple and easy to use desktop application for ChatGPT & AI, will supporting Window, MacOS, Linux platforms. | 洁且易用的 ChatGPT/星火大模型 & AI 的跨平台客户端
chatgpt cross-platform linux macos openai qt whisper
Last synced: 08 Nov 2024
https://github.com/xuegao-tzx/whisper_flutter_new
A flutter library for offline speech-to-text conversion which use whisper.cpp models implementation for Android、iOS、macOS.
android flutter ios whisper whisper-cpp
Last synced: 09 Oct 2024
https://github.com/pulijon/sttcast
Transcription from mp3 files to html with or without embedded player
ansible automation aws-ec2 aws-s3 g4dn gpu ia iac puppet python terraform transcription vagrant vosk-engine whisper
Last synced: 14 Oct 2024
https://github.com/m0rf30/shisper
A quick & dirty script to generate and view subtitles and transcriptions for your multimedia files using ggerganov/whisper.cpp
asr bash shisper whisper whispercpp
Last synced: 06 Dec 2024
https://github.com/machinelearningzh/audio-transcription
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
audio-transcription machine-learning whisper
Last synced: 02 Nov 2024
https://github.com/ieasybooks/almufarrigh
الواجهة الرسومية الخاصة بأداة تفريغ على أنظمة التشغيل المختلفة
ai audio-processing desktop linux macos python qt subtitles video-processing whisper windows wit
Last synced: 08 Nov 2024
https://github.com/xmuli/thinkymatepages
Simple and easy to use desktop application for ChatGPT & AI, will supporting Window, MacOS, Linux platforms. | 洁且易用的 ChatGPT/星火大模型 & AI 的跨平台客户端
chatgpt cross-platform linux macos openai qt whisper
Last synced: 25 Nov 2024
https://github.com/miclast/FreePBX-Call-intrusion
Intrusion. Custom Asterisk dial plan for listen, whisper and barge in calls. For Asterisk FreePBX, Issabel, Asterisk based Elastix call centers.
asterisk barge call callcenter intrusion monitoring whisper
Last synced: 24 Oct 2024
https://github.com/ssciwr/vink
A stand-alone application with GUI for OpenAI's Whisper
gui hacktoberfest iwr-hacktoberfest openai pyinstaller speech-to-text transcription whisper whisper-ai
Last synced: 09 Nov 2024
https://github.com/eryk-mazus/sigh
Seamless Voice Interactions with LLMs
llm speech-recognition speech-to-text voice-recognition whisper
Last synced: 24 Oct 2024
https://github.com/gumblex/whisper_vad
Whisper.cpp Speech-to-text with Voice Acticity Detection
speech-to-text whisper whisper-cpp
Last synced: 06 Nov 2024
https://github.com/nicolodiamante/notefy
Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast and efficient summarising.
ai-powered apple-notes apple-shortcuts chatgpt chatgpt-api gpt-4 gpt-4-turbo gpt-4o gpt-4o-mini gpt35turbo notes openai openai-api openai-chatgpt openai-whisper siri summarization summary whisper whisper-ai
Last synced: 20 Nov 2024
https://github.com/jjwroeloffs/transcribe_align_textgrid
A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!
force-alignment praat speech-recognition speech-to-text textgrid whisper
Last synced: 01 Nov 2024
https://github.com/RoyNkem/SwiftUI-AI-Voice-Assistant
A multi-platform app for voice-based interactions built using SwiftUI with advanced AI capabilities.
gpt-4 ios macos mvvm openai-api swiftui text-to-speech visionos whisper
Last synced: 23 Oct 2024
https://github.com/SanHacks/AiGen
Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported)
chatbot gpt3-turbo openai speech-recognition speech-synthesis speech-to-text text-to-speech tts voice whisper
Last synced: 15 Nov 2024
https://github.com/stayallive/whisper-subtitles
Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.
Last synced: 24 Oct 2024
https://github.com/mribeirodantas/nf-whisper
Proof-of-concept Nextflow pipeline to interact with OpenAI Whisper
docker nextflow pipeline speech-to-text transcription whisper
Last synced: 15 Oct 2024
https://github.com/t0mer/wassist
Wassist allows you to contact GPT3 directly from WhatsApp and not only that. Wassist also allows you to save your own personal data and later search and retrieve it using GPT3 to generate a response. In the examples folder, you can see several examples of how to use this bot so you don't have to remember anything ever again.
dall-e docker personal-assistant python weather whatsapp whisper
Last synced: 15 Oct 2024
https://github.com/YvesCheung/Whisper
一套用于代码检阅的注解
android annotation inspect lint whisper
Last synced: 24 Oct 2024
https://github.com/abus-aikorea/studio-free
youtube download, vocal remover, vocal extraction, karaoke video production, STT, automatic speech recognition, transcription, automatic subtitle, AI, yt-dlp, demucs, whisper, webui, gradio, windows
ai automatic-speech-recognition automatic-subtitle demucs gradio karaoke openai stt transcription video-download vocal-remover webui whisper windows yt-dlp
Last synced: 10 Nov 2024
https://github.com/decryptu/decryptgpt
A multifaceted ChatGPT Discord bot that harnesses discord.js, OpenAI's GPT-4o model, Whisper to understand voice messages, and Dall-E for image generation — engage in smart conversations, get voice messages transcribed, and have images analyzed directly within your Discord community.
chatgpt dall-e dalle discord discord-bot discord-js discordjs gpt gpt-3 gpt-4 nodejs openai whisper
Last synced: 12 Dec 2024
https://github.com/scalable-ml-deep-learning/fine_tune_whisper
Fine-Tune Whisper for Italian ASR with transformers
automatic-speech-recognition common-voice-dataset huggingface openai transformers whisper
Last synced: 24 Oct 2024
https://github.com/devanshu-17/transcriptiq
TranscriptIQ is a project that enables users to transcribe YouTube videos and perform various NLP (Natural Language Processing) tasks, chat with youtube video and many more on the transcribed text.
clarifai-python cohere streamlit whisper
Last synced: 22 Dec 2024
https://github.com/mj23978/openserver
Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also includes a chat functionality.
autogen g4f image-generation langchain litellm llamacpp llm llmops openai stable vector-database whisper
Last synced: 14 Dec 2024
https://github.com/redocrepus/ahk-whisper-paste
Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.
dictation openai openai-api text-to-speech voice-typing whisper whisper-ai windows
Last synced: 24 Oct 2024
https://github.com/status-im/status-js-api
Status Javascript Client (WIP)
ethereum javascript shh status-im web3 web3js whisper
Last synced: 01 Nov 2024
https://github.com/sanhacks/aigen
Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported)
chatbot gpt3-turbo openai speech-recognition speech-synthesis speech-to-text text-to-speech tts voice whisper
Last synced: 09 Oct 2024
https://github.com/sepiropht/auto-subtitle
Automatic subtitles in your videos
ffmpeg openai subtitles subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/flyingfathead/telegrambot-openai-api
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
bot bot-framework chatbot gpt-3-5-turbo gpt-35-turbo gpt-4 gpt-4-api gpt4-api openai openai-api openai-api-chatbot perplexity-api telegram telegram-bot telegram-bot-api telegram-bot-app whisper whisper-ai whisper-api
Last synced: 12 Nov 2024
https://github.com/cp3249/splaa
SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and has capabilities for extending functionalities through a modular tool system.
Last synced: 03 Dec 2024
https://github.com/hay/audio2text
Python command line utility wrappers for Whispercpp and other speech-to-text utilities
speech-recognition speech-to-text stt whisper whisper-cpp
Last synced: 15 Oct 2024
https://github.com/bhattbhavesh91/whisper-youtube
This repository will guide you to create automatically generate YouTube Transcription using Using OpenAI's Whisper
automatic-speech-recognition ffmpeg openai openai-gym python pytube subtitles whisper youtube youtube-dl
Last synced: 16 Nov 2024
https://github.com/hisano/openai-whisper-on-docker
OpenAI Whisper on Docker
Last synced: 10 Nov 2024
https://github.com/natehouk/flow-ai-hackathon-2023
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
ai chatgpt chatgpt-api django gpt-3-5-turbo gpt-4 marketaux-api newsapi openai openai-api python3 whisper whisper-ai whisper-api
Last synced: 14 Oct 2024
https://github.com/luquedaniel/whisper2subs
A CLI tool that transcribes audio using openai-whisper and translates it using DeepL.
audio cli deepl subtitle transcribe translate video weekend-project whisper
Last synced: 11 Oct 2024
https://github.com/kurianbenoy/malayalam_asr_benchmarking
A study to benchmark whisper based ASRs in Malayalam
asr benchmarking speech transformers-library whisper
Last synced: 14 Oct 2024
https://github.com/jim60105/aichatassistant
Stream YouTube live to OpenAI, get AI-generated summaries and real-time reply options. (Chrome Extension)
chrome-extension openai typescript whisper youtube
Last synced: 23 Oct 2024
https://github.com/evilfreelancer/docker-whisper-server
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
api api-server asr cuda docker docker-compose dockerfile nvidia openai openai-api whisper whisper-cpp
Last synced: 09 Oct 2024
https://github.com/nssharmaofficial/reddit-hole
Automated reddit scraper and video creator
amazon-polly amazon-polly-api automation aws captioning openai openai-whisper reddit reddit-bot reddit-crawler reddit-scraper tts whisper
Last synced: 09 Oct 2024
https://github.com/voidful/whisper-live-asr-demo
run whisper on CPU/GPU server
Last synced: 24 Oct 2024
https://github.com/byigitt/transcriptor
create transcripts with youtube links on google colab using whisper ai
python python3 transcript transcription whisper whisper-ai
Last synced: 22 Dec 2024
https://github.com/redocrepus/Whisper-Paste
Chrome extension that allows dictating anywhere using OpenAI Whisper
chrome-extension dictation openai openai-api text-to-speech voice-recognition voice-typing whisper whisper-ai
Last synced: 24 Oct 2024
https://github.com/kardbord/gopenai
Unofficial Go (Golang) bindings for the OpenAI API.
chatgpt-api chatgpt3 dalle2 dalle3 go golang gpt-3 gpt-35-turbo gpt-4 image-generation nlp openai openai-api text-to-image whisper whisper-ai
Last synced: 25 Nov 2024
https://github.com/nik-kras/live_asr_whisper_gradio
Real Time Speech To Text with corrections powered by Gradio
asr faster-whisper gradio speech-to-text whisper
Last synced: 23 Nov 2024
https://github.com/neka-nat/stenocaptioner
CLI tool for automatic subtitling using whisper.
python subtitles subtitles-generator whisper
Last synced: 14 Oct 2024
https://github.com/legendsort/openAISpeechToDatabase
AI automation to save formatted text with proper title from speech
automation chatgpt dropbox notion openai whisper zapier
Last synced: 22 Nov 2024
https://github.com/egorsmkv/optimized-whisper
Use quantized versions of Whisper to speed up inference
faster-whisper hqq quantization whisper
Last synced: 18 Oct 2024
https://github.com/qqxufo/whisper-nodejs
whisper-nodejs is an npm package for using OpenAI's Whisper API to transcribe and translate audio. With whisper-nodejs, you can easily convert audio files into text and translate them into English or other supported languages.
nodejs openai whisper whisper-nodejs
Last synced: 13 Nov 2024
https://github.com/hiradary/simplewhisper
A simple speech-to-text transcription interface using OpenAI's Whisper API.
openai speech-to-text whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/mbotsu/mlx_speech2text
Audio transcription using mlx whisper and vad silence processing
Last synced: 09 Oct 2024
https://github.com/oddlama/whisper-overlay
A wayland overlay providing speech-to-text functionality for any application via a global push-to-talk hotkey
faster-whisper hyprland realtime speech-recognition speech-to-text wayland whisper wlroots
Last synced: 09 Oct 2024
https://github.com/niawjunior/vision-speak
CameraVision: Capture, Analyze - Seamlessly integrate image analysis using GPT-4 Vision API and convert text to speech with Whisper AI
Last synced: 02 Dec 2024
https://github.com/SrinadhVura/OpenAI-Stack-Hack
Our Medifix is an AI powered assistant powered on gpt-3.5 turbo (chatGPT). Medifix is designed to help people by providing preventive measures based on the symptoms mentioned.
chatgpt gtts streamlit whisper
Last synced: 24 Oct 2024
https://github.com/hoangv97/ai-chatbot
Integrate ChatGPT, Dall-E, Whisper and other AI models in Replicate into Messenger and Telegram bot
bottender chatbot chatgpt dall-e2 messenger-bot replicate telegram-bot typescript whisper
Last synced: 24 Oct 2024
https://github.com/kennethleungty/chatpod
ChatPod - Q&A over your Podcasts
chatgpt data-science deep-learning gen-ai generative-ai machine-learning natural-language-processing nlp openai transformers whisper
Last synced: 22 Nov 2024
https://github.com/chetanxpro/autosub
Automatically generate and overlay subtitles for any video.
ai ffmpeg nodejs-whisper openai-whisper subtitles subtitles-generator whisper
Last synced: 15 Nov 2024
https://github.com/mharrvic/redhorse-ai-transcriber
Audio transcriber using Openai whisper ML deployed to Banana.dev
Last synced: 15 Nov 2024
https://github.com/openvoiceos/ovos-docker-stt
Open Voice OS Speech-to-Text (STT) container images and docker-compose.yml file for x86_64 CPU architecture.
fasterwhisper openvoiceos ovos speech-to-text stt whisper
Last synced: 19 Nov 2024
https://github.com/moebiussurfing/ofxsurfingtextsubtitle
Draws subtitles from an .SRT (or plain text) into a formatted styled paragraph with fading opacity and more.
openframeworks openframeworks-addon whisper whisper-cpp
Last synced: 27 Oct 2024
https://github.com/olololoe110399/mikasa_gpt
🚀 MiksaGPT, part of the 'Miksa' project, is a groundbreaking voice assistant utilizing Claude 3 and APIs from 'anthropic' and 'elevenlabs'. It enables real-time Opus two-way voice chat with seamless interruptibility, built with Flutter and available for free on GitHub.
aivoice artificialintelligence claude claudeai elevenlabs flutterai flutterprogramming flutterprojects openai opensource opensourceai opus speechtotext whisper
Last synced: 22 Dec 2024
https://github.com/gabrielrf/voice2text
Descrição automática de mensagens de voz em conversas privadas no Telegram
automation openai openai-whisper pyrogram telegram transcription whisper
Last synced: 13 Dec 2024
https://github.com/jxxe/murmur
A proof-of-concept transcription app
journalism mac macos transcribe transcription whisper
Last synced: 24 Oct 2024
https://github.com/princejoogie/chunktube
It's YouTube.. but text!
gpt-3 openai react typescript whisper
Last synced: 09 Nov 2024
https://github.com/navalnica/whisper-finetuning-be
Finetuning Whisper ASR model for Belarusian language
asr belarus belarusian belarusian-language speech-recognition speech-to-text stt wfte whisper whisper-event
Last synced: 13 Nov 2024
https://github.com/ebowwa/llm_telecenter
A fastapi wrapper of babca / python-gsmmodem for a waveshare sim7600x. Not an exact copy of the 'python-gsmmodem' so be sure to uninstall that lib or venv to run | Open-source Twilio with LLM batteries
agentgpt deepgram elevenlabs elevenlabs-api gsm gsm-modem gsm-module langchain langchain-python llama2 llamacpp mistral-7b mistralai oai openai openai-api pyserial raspberry-pi salesgpt whisper
Last synced: 29 Nov 2024
https://github.com/BatuhanYilmaz26/Youtube-Transcriber
Input a YouTube video link and get a transcription as a .txt, .vtt or .srt file.
automatic-speech-recognition huggingface openai python speech-recognition streamlit whisper
Last synced: 24 Oct 2024
https://github.com/paddy41601/faster-whisper-cli
A command-line interface wrapper for Faster Whisper
faster-whisper openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 24 Oct 2024
https://github.com/royceschultz/ComfyUI-TranscriptionTools
ComfyUI nodes for transcription on audio or video input.
comfyui comfyui-nodes openai-whisper transcription whisper
Last synced: 19 Dec 2024
https://github.com/detektor777/colab_list
colab list for video
ai colab-notebook colorization dain deblur enhance instcolorization nafnet real-esrgan transcribe upscaling video whisper
Last synced: 24 Oct 2024
https://github.com/ignabelitzky/easy-subber
A Python-based tool that that takes video files and generates .srt subtitle files using Whisper for speech recognition, FFmpeg for audio processing, and a simple Tkinter GUI
ffmpeg gui python speech-recognition srt subtitles tkinter transcription video-processing whisper
Last synced: 22 Oct 2024
https://github.com/gorkemkaramolla/whisper-run
Faster Whisper with Speaker Diarization
distil-whisper faster-whisper openai pyannote speaker-diarization speech-recognition transcription whisper whisper-large
Last synced: 09 Oct 2024
https://github.com/avencores/python-openai-cli
🤖 A simple utility for interfacing with the OpenAI API 🤖
api chatgpt chatgpt-api chatgpt-app chatgpt-python chatgpt3 chatgpt4 dalle-2 dalle-e dalle2 open-source openai openai-api openai-cli python python-3 python-script python-scripts python3 whisper
Last synced: 18 Nov 2024
https://github.com/lissettecarlr/AutomaticSpeechRecognition
语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库
ai asr audio audio-processing deepl paraformer python speech-to-text text whisper
Last synced: 24 Oct 2024
https://github.com/shonharsh/horizonforbiddenwest-shardmacro-logitechghub
A macro to get free metal shards in Horizon Forbidden West using Logitech G Hub
automation bow commands config forbidden free game ghub hack horizon horizon-forbidden-west hunter macro pc script sell shards west whisper windows
Last synced: 11 Oct 2024
https://github.com/lissettecarlr/automaticspeechrecognition
语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库
ai asr audio audio-processing deepl paraformer python speech-to-text text whisper
Last synced: 19 Nov 2024
https://github.com/pablocerdeira/whatsapp-bot
This project is an advanced WhatsApp bot that leverages artificial intelligence for automated audio transcription, document summarization, and scheduling of future messages. It uses Whisper for transcription and offers a choice between OpenAI's API and the Ollama local model for document summarization.
api api-rest artificial-intelligence automation bot ollama openai whatsapp whatsapp-bot whisper
Last synced: 03 Dec 2024
https://github.com/gcoter/extract-keywords-from-youtube-videos
This project combines youtube-dl, whisper, LangChain and ChatGPT to extract keywords from YouTube videos. It was intented as a tool for Lyon Data Science to better reference its videos.
chatgpt langchain whisper youtube-dl
Last synced: 24 Oct 2024
https://github.com/cansik/speech-to-text-osc
Speech to text with OSC output.
Last synced: 13 Dec 2024
https://github.com/coderscreative/faster-whisper-rs
a rust crate for easily implementing faster-whisper stt into your rust programs.
ai faster-whisper rust speech-recognition speech-to-text stt whisper
Last synced: 09 Oct 2024
https://github.com/flyingfathead/whisper-transcriber-telegram-bot
Python-based Whisper transcriber bot for Telegram
openai-whisper python telegram telegram-bot telegram-bot-api transcribe transcriber transcription whisper yt-dlp yt-dlp-wrapper
Last synced: 12 Nov 2024
https://github.com/VoXera/VoXera
An Open-Source Persian Language Techs Toolkit with Python
deep-learning deep-neural-networks keyword-extraction machine-learning natural-language-processing nlp openai persian persian-language speech-recognition speech-to-text text-processing vosk vosk-api whisper
Last synced: 20 Nov 2024
https://github.com/drakerossman/state-of-art-ai
State of Art AI models you can run locally.
ai chatgpt deep-learning gpt large-language-models llm machine-learning stable-diffusion transformers whisper
Last synced: 06 Nov 2024
https://github.com/yjg30737/whisper_transcribe_youtube_video_example_gui
GUI Showcase of using Whisper to transcribe and analyze Youtube video
audio-to-text pyqt pyqt5 pyqt5-desktop-application python pytube qt whisper
Last synced: 06 Dec 2024
https://github.com/kristofferv98/voiceprocessingtoolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api
Last synced: 02 Nov 2024
https://github.com/semyon-dev/whissage
the backend of blockchain-based messenger
blockchain blockchain-messenger ethereum geth messenger whisper whisper-protocol
Last synced: 30 Oct 2024