Projects in Awesome Lists tagged with whisper
A curated list of projects in awesome lists tagged with whisper .
https://github.com/ggml-org/whisper.cpp
Port of OpenAI's Whisper model in C/C++
inference openai speech-recognition speech-to-text transformer whisper
Last synced: 12 May 2025
https://github.com/ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
inference openai speech-recognition speech-to-text transformer whisper
Last synced: 01 Apr 2025
https://github.com/systran/faster-whisper
Faster Whisper transcription with CTranslate2
deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 12 May 2025
https://github.com/m-bain/whisperx
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
asr speech speech-recognition speech-to-text whisper
Last synced: 12 May 2025
https://github.com/SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 24 Mar 2025
https://github.com/chidiwilliams/buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Last synced: 12 May 2025
https://github.com/chidiwilliams/Buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Last synced: 01 Apr 2025
https://github.com/guillaumekln/faster-whisper
Faster Whisper transcription with CTranslate2
deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 14 Dec 2024
https://github.com/paddlepaddle/paddlespeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper
Last synced: 12 May 2025
https://github.com/PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper
Last synced: 24 Mar 2025
https://github.com/modelscope/funasr
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper
Last synced: 16 May 2025
https://github.com/m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
asr speech speech-recognition speech-to-text whisper
Last synced: 14 Mar 2025
https://github.com/niedev/rtranslator
Open source real-time translation app for Android that runs locally
android android-app bluetooth-le mobile-app nllb offline onnx onnxruntime realtime-translator sentencepiece transformers translation translator whisper
Last synced: 13 May 2025
https://github.com/xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm
Last synced: 13 May 2025
https://github.com/niedev/RTranslator
Open source real-time translation app for Android that runs locally
android android-app bluetooth-le mobile-app nllb offline onnx onnxruntime realtime-translator sentencepiece transformers translation translator whisper
Last synced: 24 Mar 2025
https://github.com/modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper
Last synced: 24 Mar 2025
https://github.com/zackriya-solutions/meeting-minutes
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/
ai automation cross-platform linux live llm mac macos-app meeting-minutes meeting-notes recorder rust transcript transcription whisper whisper-cpp windows
Last synced: 14 May 2025
https://github.com/sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
deep-learning jax speech-recognition speech-to-text whisper
Last synced: 14 May 2025
https://github.com/argmaxinc/whisperkit
On-device Speech Recognition for Apple Silicon
inference ios macos speech-recognition swift transformers visionos watchos whisper
Last synced: 13 May 2025
https://github.com/nexaai/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
asr audio edge-computing language-model llm on-device-ai on-device-ml sdk sdk-python stable-diffusion transformers tts vlm whisper
Last synced: 11 May 2025
https://github.com/mahmoudashraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
asr speaker-diarization speech speech-recognition speech-to-text whisper
Last synced: 13 May 2025
https://github.com/argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
inference ios macos speech-recognition swift transformers visionos watchos whisper
Last synced: 28 Mar 2025
https://github.com/NexaAI/nexa-sdk
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.
asr audio edge-computing language-model llm on-device-ai on-device-ml sdk sdk-python stable-diffusion transformers tts vlm whisper
Last synced: 07 Feb 2025
https://github.com/MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
asr speaker-diarization speech speech-recognition speech-to-text whisper
Last synced: 28 Mar 2025
https://github.com/leetcode-mafia/cheetah
Mac app for crushing tech interviews with AI
ai chatgpt gpt gpt-4 openai swift swiftui whisper whisper-cpp
Last synced: 14 May 2025
https://github.com/wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
asr automatic-speech-recognition conformer e2e-models production-ready pytorch speech-recognition transformer whisper
Last synced: 13 May 2025
https://github.com/huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
audio speech-recognition whisper
Last synced: 29 Apr 2025
https://github.com/embarklabs/embark
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms
blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper
Last synced: 28 Apr 2025
https://github.com/embark-framework/embark
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms
blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper
Last synced: 03 Mar 2025
https://iurimatias.github.io/embark-framework
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms
blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper
Last synced: 18 Feb 2025
https://github.com/iurimatias/embark-framework
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms
blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper
Last synced: 10 Feb 2025
https://github.com/abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
audiobook faster-whisper gradio karaoke podcasts speech-recognition speech-synthesis speech-to-text subtitles text-to-speech transcription translator tts voice-cloning voice-conversion webui whisper whisperx yt-dlp
Last synced: 14 May 2025
https://github.com/grt1228/chatgpt-java
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
chatgpt chatgpt-java gpt-35-turbo gpt-4 gpt-plugins java openai-api openai-chatgpt openai-images openai-whisper tiktoken-java whisper
Last synced: 10 Apr 2025
https://github.com/Grt1228/chatgpt-java
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
chatgpt chatgpt-java gpt-35-turbo gpt-4 gpt-plugins java openai-api openai-chatgpt openai-images openai-whisper tiktoken-java whisper
Last synced: 02 Apr 2025
https://github.com/n3d1117/chatgpt-telegram-bot
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
chatgpt dall-e openai python telegram-bot whisper
Last synced: 13 May 2025
https://github.com/betalgo/openai
.NET library for the OpenAI service API by Betalgo Ranul
azure-openai chatgpt csharp dall-e dotnet gpt-3 gpt-4 openai openai-api ranul sdk tinga whisper whisper-ai
Last synced: 29 Apr 2025
https://github.com/xenova/whisper-web
ML-powered speech recognition directly in your browser
javascript transformers whisper
Last synced: 15 May 2025
https://github.com/samuraigpt/embedai
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
chatbot chatgpt embedai embeddings generative gpt gpt4 gpt4all langchain models openai privategpt vectorstore whisper
Last synced: 15 May 2025
https://github.com/SamurAIGPT/EmbedAI
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks
chatbot chatgpt embedai embeddings generative gpt gpt4 gpt4all langchain models openai privategpt vectorstore whisper
Last synced: 14 Mar 2025
https://github.com/heywillow/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper
Last synced: 14 May 2025
https://github.com/HeyWillow/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper
Last synced: 04 Apr 2025
https://github.com/toverainc/willow
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper
Last synced: 27 Mar 2025
https://github.com/thewh1teagle/vibe
Transcribe on your own!
ai cross-platform desktop openai rust transcribe whisper
Last synced: 14 May 2025
https://github.com/linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
asr attention-is-all-you-need attention-mechanism attention-model attention-network attention-seq2seq attention-visualization deep-learning machine-learning multilingual-models python python3 pytorch speaker-diarization speech speech-processing speech-recognition speech-to-text transformers whisper
Last synced: 13 May 2025
https://github.com/chenyme/chenyme-aavt
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
faster-whisper gpt-4 gpt-4o speech-recognition video-translation whisper
Last synced: 14 May 2025
https://github.com/buxuku/smartsub
「妙幕」是一款跨平台客户端工具,可以批量为视频或者音频生成字幕文件,并支持对字幕进行翻译,支持百度、火山、openai、ollama、deepseek 等多家翻译
deepseek electron nodejs ollama openai subtitle translate whisper whisper-cpp
Last synced: 14 May 2025
https://github.com/cheshirecc/faster-whisper-gui
faster_whisper GUI with PySide6
asr faster-whisper openai transcribe vad voice-transcription whisper whisperx
Last synced: 14 May 2025
https://github.com/chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
faster-whisper gpt-4 gpt-4o speech-recognition video-translation whisper
Last synced: 16 Mar 2025
https://github.com/pluja/whishper
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
ai audio-to-text golang speech-recognition speech-to-text stt subtitles sveltekit transcription ui web web-whisper webapp whisper
Last synced: 14 May 2025
https://github.com/collabora/whisperlive
A nearly-live implementation of OpenAI's Whisper.
dictation obs openai tensorrt tensorrt-llm text-to-speech translation voice-recognition whisper whisper-tensorrt
Last synced: 09 Apr 2025
https://github.com/collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
dictation obs openai tensorrt tensorrt-llm text-to-speech translation voice-recognition whisper whisper-tensorrt
Last synced: 07 Apr 2025
https://github.com/jhj0517/whisper-webui
A Web UI for easy subtitle using whisper model.
ai gradio open-source python pytorch web-ui whisper
Last synced: 14 May 2025
https://github.com/CheshireCC/faster-whisper-GUI
faster_whisper GUI with PySide6
asr faster-whisper openai transcribe vad voice-transcription whisper whisperx
Last synced: 17 Jan 2025
https://github.com/purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx
Last synced: 14 May 2025
https://github.com/floneum/floneum
Instant, controllable, local pre-trained AI models in Rust
ai candle constrained-generation dioxus floneum-v3 kalosm llama llamacpp llm mistral rust transcription whisper
Last synced: 13 May 2025
https://github.com/m1guelpf/auto-subtitle
Automatically generate and overlay subtitles for any video.
ffmpeg openai-whisper subtitle-generator subtitles subtitles-generator whisper
Last synced: 14 May 2025
https://github.com/Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx
Last synced: 28 Mar 2025
https://github.com/fl33tw00d/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
audio machine-learning rust speech-recognition webgpu whisper windows
Last synced: 15 May 2025
https://github.com/FL33TW00D/whisper-turbo
Cross-Platform, GPU Accelerated Whisper 🏎️
audio machine-learning rust speech-recognition webgpu whisper windows
Last synced: 04 Apr 2025
https://github.com/jhj0517/Whisper-WebUI
A Web UI for easy subtitle using whisper model.
ai gradio open-source python pytorch web-ui whisper
Last synced: 06 Mar 2025
https://github.com/Aallam/openai-kotlin
OpenAI API client for Kotlin with multiplatform and coroutines capabilities.
api chatgpt client coroutines dall-e gpt kotlin llm multiplatform openai whisper
Last synced: 24 Apr 2025
https://github.com/aallam/openai-kotlin
OpenAI API client for Kotlin with multiplatform and coroutines capabilities.
api chatgpt client coroutines dall-e gpt kotlin llm multiplatform openai whisper
Last synced: 14 May 2025
https://github.com/Chenyme/Chenyme-AAVT
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
faster-whisper gpt-4 gpt-4o speech-recognition video-translation whisper
Last synced: 11 Apr 2025
https://github.com/absadiki/subsai
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
cli subtitles subtitles-generator webui whisper whisper-ai
Last synced: 14 May 2025
https://github.com/m1guelpf/yt-whisper
Using OpenAI's Whisper to automatically generate YouTube subtitles
ffmpeg openai openai-whisper subtitles subtitles-generated transcribe whisper youtube youtube-dl
Last synced: 16 May 2025
https://github.com/umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!
asr csharp faster-whisper flyleaf language-learning llm media-player ocr ollama player video video-player whisper wpf yt-dlp
Last synced: 21 Apr 2025
https://github.com/abdeladim-s/subsai
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
cli subtitles subtitles-generator webui whisper whisper-ai
Last synced: 12 Dec 2024
https://github.com/vercel/modelfusion
The TypeScript library for building AI applications.
ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper
Last synced: 15 May 2025
https://github.com/graphite-project/whisper
Whisper is a file-based time-series database format for Graphite.
graphite graphite-components library metrics python time-series whisper
Last synced: 14 May 2025
https://github.com/lenml/speech-ai-forge
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
agent asr chattts chattts-forge chinese colab cosy-voice cosyvoice english firered fireredtts fish-speech gpt llama llm ssml stt text-to-speech tts whisper
Last synced: 15 May 2025
https://github.com/ntegrals/aura-voice
Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.
artificial-intelligence elevenlabs gpt-3 gpt-4 langchain nextjs openai vercel whisper whisper-cpp
Last synced: 14 May 2025
https://github.com/robitx/gp.nvim
Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]
claude codeium copilot gemini gpt-4o gpt4o llm lua mistral neovim nvim ollama parrot perplexity sonnet speech-to-text stt vim voice whisper
Last synced: 14 May 2025
https://github.com/lgrammel/ai-utils.js
The TypeScript library for building AI applications.
ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper
Last synced: 03 Mar 2025
https://github.com/yeyupiaoling/whisper-finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
android asr chinese ctranslate2 huggingface lora pytorch speech-recognition transformers web whisper
Last synced: 14 May 2025
https://github.com/microsoft/ai-dev-gallery
An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.
ai csharp developer-tools directml dotnet genai mistral npu onnx onnxruntime onnxruntime-genai phi3 qnn stable-diffusion visual-studio whisper winappsdk windows winui3 wpf
Last synced: 14 May 2025
https://github.com/tmoroney/auto-subs
Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.
ai davinci davinci-19 davinci-resolve diarize openai pyannote resolve speaker speech-to-text subtitles subtitles-generator transcribe whisper
Last synced: 13 Apr 2025
https://github.com/softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
openai- openai-whisper speech-recognition speech-to-text whisper
Last synced: 14 May 2025
https://github.com/yaofanguk/video-subtitle-generator
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
audio2text generation srt subtitle transcription whisper
Last synced: 16 May 2025
https://github.com/basetenlabs/truss
The simplest way to serve AI/ML models in production
artificial-intelligence easy-to-use falcon inference-api inference-server machine-learning model-serving open-source packaging stable-diffusion whisper wizardlm
Last synced: 13 May 2025
https://github.com/yeyupiaoling/Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
android asr chinese ctranslate2 huggingface lora pytorch speech-recognition transformers web whisper
Last synced: 08 Feb 2025
https://github.com/ardha27/ai-waifu-vtuber
AI Vtuber for Streaming on Youtube/Twitch
ai-vtuber ai-waifu deepl openai speech-recognition speech-synthesis speech-to-text tts voicevox vtuber whisper
Last synced: 12 Apr 2025
https://github.com/substratusai/kubeai
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
ai autoscaler faster-whisper inference-operator k8s kubernetes llm ollama ollama-operator openai-api vllm vllm-operator whisper
Last synced: 15 May 2025
https://github.com/Softcatala/whisper-ctranslate2
Whisper command line client compatible with original OpenAI client based on CTranslate2.
openai- openai-whisper speech-recognition speech-to-text whisper
Last synced: 01 Apr 2025
https://github.com/innovatorved/whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.
asr hacktoberfest innovatorved transcribe whisper
Last synced: 04 Apr 2025
https://github.com/twitchlib/twitchlib
C# Twitch Chat, Whisper, API and PubSub Library. Allows for chatting, whispering, stream event subscription and channel/account modification. Supports everything that supports .NETStandard 2.0
api bot chat client csharp events pubsub twitch whisper
Last synced: 14 May 2025
https://github.com/TwitchLib/TwitchLib
C# Twitch Chat, Whisper, API and PubSub Library. Allows for chatting, whispering, stream event subscription and channel/account modification. Supports everything that supports .NETStandard 2.0
api bot chat client csharp events pubsub twitch whisper
Last synced: 10 May 2025
https://github.com/YaoFANGUK/video-subtitle-generator
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
audio2text generation srt subtitle transcription whisper
Last synced: 20 Nov 2024
https://github.com/aschmelyun/subvert
Generate subtitles, summaries, and chapters from videos in seconds
chatgpt openai transcription translation video-editing whisper
Last synced: 15 May 2025
https://github.com/transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx
Last synced: 07 Apr 2025
https://github.com/Saik0s/Whisperboard
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
audio-to-text composable-architecture ios openai speech-recognition speech-to-text swiftui tca transcription tuist whisper whisper-cpp
Last synced: 19 Apr 2025
https://github.com/saik0s/whisperboard
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
audio-to-text composable-architecture ios openai speech-recognition speech-to-text swiftui tca transcription tuist whisper whisper-cpp
Last synced: 07 Apr 2025
https://github.com/saharmor/whisper-playground
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
machine-learning openai speech-recognition speech-to-text whisper
Last synced: 12 Apr 2025
https://github.com/go-graphite/go-carbon
Golang implementation of Graphite/Carbon server with classic architecture: Agent -> Cache -> Persister
carbon devops graphite hacktoberfest timeseries whisper
Last synced: 13 May 2025
https://github.com/mayeaux/generate-subtitles
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
expressjs gpu libretranslate machine-learning nodejs transcription translation whisper yt-dlp
Last synced: 13 Apr 2025
https://github.com/srcnalt/openai-unity
An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.
chatgpt dalle openai openai-api unity unity3d whisper
Last synced: 14 Apr 2025
https://github.com/srcnalt/OpenAI-Unity
An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.
chatgpt dalle openai openai-api unity unity3d whisper
Last synced: 11 Mar 2025