Projects in Awesome Lists tagged with whisper

https://github.com/ggml-org/whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference openai speech-recognition speech-to-text transformer whisper

Last synced: 12 May 2025

https://github.com/ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference openai speech-recognition speech-to-text transformer whisper

Last synced: 01 Apr 2025

https://github.com/systran/faster-whisper

Faster Whisper transcription with CTranslate2

deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper

Last synced: 12 May 2025

https://github.com/m-bain/whisperx

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text whisper

Last synced: 12 May 2025

https://github.com/SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper

Last synced: 24 Mar 2025

https://github.com/chidiwilliams/buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

whisper

Last synced: 12 May 2025

https://github.com/chidiwilliams/Buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

whisper

Last synced: 01 Apr 2025

https://github.com/guillaumekln/faster-whisper

Faster Whisper transcription with CTranslate2

deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper

Last synced: 14 Dec 2024

https://github.com/paddlepaddle/paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 12 May 2025

https://github.com/PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 24 Mar 2025

https://github.com/modelscope/funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 16 May 2025

https://github.com/m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text whisper

Last synced: 14 Mar 2025

https://github.com/niedev/rtranslator

Open source real-time translation app for Android that runs locally

android android-app bluetooth-le mobile-app nllb offline onnx onnxruntime realtime-translator sentencepiece transformers translation translator whisper

Last synced: 13 May 2025

https://github.com/xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 13 May 2025

https://github.com/niedev/RTranslator

Open source real-time translation app for Android that runs locally

android android-app bluetooth-le mobile-app nllb offline onnx onnxruntime realtime-translator sentencepiece transformers translation translator whisper

Last synced: 24 Mar 2025

https://github.com/modelscope/FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 24 Mar 2025

https://github.com/zackriya-solutions/meeting-minutes

A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on adding linux support soon) https://meetily.zackriya.com/

ai automation cross-platform linux live llm mac macos-app meeting-minutes meeting-notes recorder rust transcript transcription whisper whisper-cpp windows

Last synced: 14 May 2025

https://github.com/sanchit-gandhi/whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

deep-learning jax speech-recognition speech-to-text whisper

Last synced: 14 May 2025

https://github.com/argmaxinc/whisperkit

On-device Speech Recognition for Apple Silicon

inference ios macos speech-recognition swift transformers visionos watchos whisper

Last synced: 13 May 2025

https://github.com/nexaai/nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

asr audio edge-computing language-model llm on-device-ai on-device-ml sdk sdk-python stable-diffusion transformers tts vlm whisper

Last synced: 11 May 2025

https://github.com/mahmoudashraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asr speaker-diarization speech speech-recognition speech-to-text whisper

Last synced: 13 May 2025

https://github.com/argmaxinc/WhisperKit

On-device Speech Recognition for Apple Silicon

inference ios macos speech-recognition swift transformers visionos watchos whisper

Last synced: 28 Mar 2025

https://github.com/NexaAI/nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

asr audio edge-computing language-model llm on-device-ai on-device-ml sdk sdk-python stable-diffusion transformers tts vlm whisper

Last synced: 07 Feb 2025

https://github.com/MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asr speaker-diarization speech speech-recognition speech-to-text whisper

Last synced: 28 Mar 2025

https://github.com/leetcode-mafia/cheetah

Mac app for crushing tech interviews with AI

ai chatgpt gpt gpt-4 openai swift swiftui whisper whisper-cpp

Last synced: 14 May 2025

https://github.com/wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

asr automatic-speech-recognition conformer e2e-models production-ready pytorch speech-recognition transformer whisper

Last synced: 13 May 2025

https://github.com/huggingface/distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

audio speech-recognition whisper

Last synced: 29 Apr 2025

https://github.com/embarklabs/embark

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper

Last synced: 28 Apr 2025

https://github.com/embark-framework/embark

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper

Last synced: 03 Mar 2025

https://iurimatias.github.io/embark-framework

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper

Last synced: 18 Feb 2025

https://github.com/iurimatias/embark-framework

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper

Last synced: 10 Feb 2025

https://github.com/abus-aikorea/voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

audiobook faster-whisper gradio karaoke podcasts speech-recognition speech-synthesis speech-to-text subtitles text-to-speech transcription translator tts voice-cloning voice-conversion webui whisper whisperx yt-dlp

Last synced: 14 May 2025

https://github.com/grt1228/chatgpt-java

ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

chatgpt chatgpt-java gpt-35-turbo gpt-4 gpt-plugins java openai-api openai-chatgpt openai-images openai-whisper tiktoken-java whisper

Last synced: 10 Apr 2025

https://github.com/Grt1228/chatgpt-java

ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

chatgpt chatgpt-java gpt-35-turbo gpt-4 gpt-plugins java openai-api openai-chatgpt openai-images openai-whisper tiktoken-java whisper

Last synced: 02 Apr 2025

https://github.com/n3d1117/chatgpt-telegram-bot

🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python

chatgpt dall-e openai python telegram-bot whisper

Last synced: 13 May 2025

https://github.com/alexrudall/ruby-openai

OpenAI API + Ruby! 🤖❤️ Now with Responses API + DeepSeek!

ai api-client chatgpt dall-e gpt-4 gpt-4o o1 openai rails ruby whisper

Last synced: 12 May 2025

https://github.com/betalgo/openai

.NET library for the OpenAI service API by Betalgo Ranul

azure-openai chatgpt csharp dall-e dotnet gpt-3 gpt-4 openai openai-api ranul sdk tinga whisper whisper-ai

Last synced: 29 Apr 2025

https://github.com/xenova/whisper-web

ML-powered speech recognition directly in your browser

javascript transformers whisper

Last synced: 15 May 2025

https://github.com/samuraigpt/embedai

An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks

chatbot chatgpt embedai embeddings generative gpt gpt4 gpt4all langchain models openai privategpt vectorstore whisper

Last synced: 15 May 2025

https://github.com/SamurAIGPT/EmbedAI

An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks

chatbot chatgpt embedai embeddings generative gpt gpt4 gpt4all langchain models openai privategpt vectorstore whisper

Last synced: 14 Mar 2025

https://github.com/heywillow/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper

Last synced: 14 May 2025

https://github.com/HeyWillow/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper

Last synced: 04 Apr 2025

https://github.com/toverainc/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper

Last synced: 27 Mar 2025

https://github.com/thewh1teagle/vibe

Transcribe on your own!

ai cross-platform desktop openai rust transcribe whisper

Last synced: 14 May 2025

https://github.com/linto-ai/whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

asr attention-is-all-you-need attention-mechanism attention-model attention-network attention-seq2seq attention-visualization deep-learning machine-learning multilingual-models python python3 pytorch speaker-diarization speech speech-processing speech-recognition speech-to-text transformers whisper

Last synced: 13 May 2025

https://github.com/chenyme/chenyme-aavt

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

faster-whisper gpt-4 gpt-4o speech-recognition video-translation whisper

Last synced: 14 May 2025

https://github.com/buxuku/smartsub

「妙幕」是一款跨平台客户端工具，可以批量为视频或者音频生成字幕文件，并支持对字幕进行翻译，支持百度、火山、openai、ollama、deepseek 等多家翻译

deepseek electron nodejs ollama openai subtitle translate whisper whisper-cpp

Last synced: 14 May 2025

https://github.com/cheshirecc/faster-whisper-gui

faster_whisper GUI with PySide6

asr faster-whisper openai transcribe vad voice-transcription whisper whisperx

Last synced: 14 May 2025

https://github.com/chenyme/Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

faster-whisper gpt-4 gpt-4o speech-recognition video-translation whisper

Last synced: 16 Mar 2025

https://github.com/pluja/whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

ai audio-to-text golang speech-recognition speech-to-text stt subtitles sveltekit transcription ui web web-whisper webapp whisper

Last synced: 14 May 2025

https://github.com/collabora/whisperlive

A nearly-live implementation of OpenAI's Whisper.

dictation obs openai tensorrt tensorrt-llm text-to-speech translation voice-recognition whisper whisper-tensorrt

Last synced: 09 Apr 2025

https://github.com/collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper.

dictation obs openai tensorrt tensorrt-llm text-to-speech translation voice-recognition whisper whisper-tensorrt

Last synced: 07 Apr 2025

https://github.com/jhj0517/whisper-webui

A Web UI for easy subtitle using whisper model.

ai gradio open-source python pytorch web-ui whisper

Last synced: 14 May 2025

https://github.com/CheshireCC/faster-whisper-GUI

faster_whisper GUI with PySide6

asr faster-whisper openai transcribe vad voice-transcription whisper whisperx

Last synced: 17 Jan 2025

https://github.com/purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx

Last synced: 14 May 2025

https://github.com/floneum/floneum

Instant, controllable, local pre-trained AI models in Rust

ai candle constrained-generation dioxus floneum-v3 kalosm llama llamacpp llm mistral rust transcription whisper

Last synced: 13 May 2025

https://github.com/m1guelpf/auto-subtitle

Automatically generate and overlay subtitles for any video.

ffmpeg openai-whisper subtitle-generator subtitles subtitles-generator whisper

Last synced: 14 May 2025

https://github.com/Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx

Last synced: 28 Mar 2025

https://github.com/fl33tw00d/whisper-turbo

Cross-Platform, GPU Accelerated Whisper 🏎️

audio machine-learning rust speech-recognition webgpu whisper windows

Last synced: 15 May 2025

https://github.com/FL33TW00D/whisper-turbo

Cross-Platform, GPU Accelerated Whisper 🏎️

audio machine-learning rust speech-recognition webgpu whisper windows

Last synced: 04 Apr 2025

https://github.com/jhj0517/Whisper-WebUI

A Web UI for easy subtitle using whisper model.

ai gradio open-source python pytorch web-ui whisper

Last synced: 06 Mar 2025

https://github.com/speaches-ai/speaches

docker docker-compose faster-whisper openai-api openai-whisper openai-whisper-translation transcription whisper whisper-ai

Last synced: 14 May 2025

https://github.com/Aallam/openai-kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

api chatgpt client coroutines dall-e gpt kotlin llm multiplatform openai whisper

Last synced: 24 Apr 2025

https://github.com/aallam/openai-kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

api chatgpt client coroutines dall-e gpt kotlin llm multiplatform openai whisper

Last synced: 14 May 2025

https://github.com/Chenyme/Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

faster-whisper gpt-4 gpt-4o speech-recognition video-translation whisper

Last synced: 11 Apr 2025

https://github.com/harry0703/audionotes

快速提取音视频内容，整理成一份结构化的markdown笔记

ai asr funasr ollama python qwen2 whisper

Last synced: 08 Apr 2025

https://github.com/harry0703/AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

ai asr funasr ollama python qwen2 whisper

Last synced: 24 Mar 2025

https://github.com/absadiki/subsai

🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️

cli subtitles subtitles-generator webui whisper whisper-ai

Last synced: 14 May 2025

https://github.com/m1guelpf/yt-whisper

Using OpenAI's Whisper to automatically generate YouTube subtitles

ffmpeg openai openai-whisper subtitles subtitles-generated transcribe whisper youtube youtube-dl

Last synced: 16 May 2025

https://github.com/umlx5h/LLPlayer

The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!

asr csharp faster-whisper flyleaf language-learning llm media-player ocr ollama player video video-player whisper wpf yt-dlp

Last synced: 21 Apr 2025

https://github.com/abdeladim-s/subsai

🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️

cli subtitles subtitles-generator webui whisper whisper-ai

Last synced: 12 Dec 2024

https://github.com/vercel/modelfusion

The TypeScript library for building AI applications.

ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper

Last synced: 15 May 2025

https://github.com/graphite-project/whisper

Whisper is a file-based time-series database format for Graphite.

graphite graphite-components library metrics python time-series whisper

Last synced: 14 May 2025

https://github.com/lenml/speech-ai-forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

agent asr chattts chattts-forge chinese colab cosy-voice cosyvoice english firered fireredtts fish-speech gpt llama llm ssml stt text-to-speech tts whisper

Last synced: 15 May 2025

https://github.com/ntegrals/aura-voice

Aura is like Siri, but in your browser. An AI voice assistant optimized for low latency responses.

artificial-intelligence elevenlabs gpt-3 gpt-4 langchain nextjs openai vercel whisper whisper-cpp

Last synced: 14 May 2025

https://github.com/robitx/gp.nvim

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI, Ollama, Anthropic, ..]

claude codeium copilot gemini gpt-4o gpt4o llm lua mistral neovim nvim ollama parrot perplexity sonnet speech-to-text stt vim voice whisper

Last synced: 14 May 2025

https://github.com/lgrammel/ai-utils.js

The TypeScript library for building AI applications.

ai artificial-intelligence chatbot claude dall-e embedding gpt-3 huggingface javascript js llamacpp llm mistral multi-modal ollama openai stable-diffusion ts typescript whisper

Last synced: 03 Mar 2025

https://github.com/yeyupiaoling/whisper-finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

android asr chinese ctranslate2 huggingface lora pytorch speech-recognition transformers web whisper

Last synced: 14 May 2025

https://github.com/microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

ai csharp developer-tools directml dotnet genai mistral npu onnx onnxruntime onnxruntime-genai phi3 qnn stable-diffusion visual-studio whisper winappsdk windows winui3 wpf

Last synced: 14 May 2025

https://github.com/tmoroney/auto-subs

Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.

ai davinci davinci-19 davinci-resolve diarize openai pyannote resolve speaker speech-to-text subtitles subtitles-generator transcribe whisper

Last synced: 13 Apr 2025

https://github.com/softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

openai- openai-whisper speech-recognition speech-to-text whisper

Last synced: 14 May 2025

https://github.com/yaofanguk/video-subtitle-generator

视频音频生成字幕，生成srt文件。无需申请第三方API，本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

audio2text generation srt subtitle transcription whisper

Last synced: 16 May 2025

https://github.com/basetenlabs/truss

The simplest way to serve AI/ML models in production

artificial-intelligence easy-to-use falcon inference-api inference-server machine-learning model-serving open-source packaging stable-diffusion whisper wizardlm

Last synced: 13 May 2025

https://github.com/yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

android asr chinese ctranslate2 huggingface lora pytorch speech-recognition transformers web whisper

Last synced: 08 Feb 2025

https://github.com/ardha27/ai-waifu-vtuber

AI Vtuber for Streaming on Youtube/Twitch

ai-vtuber ai-waifu deepl openai speech-recognition speech-synthesis speech-to-text tts voicevox vtuber whisper

Last synced: 12 Apr 2025

https://github.com/substratusai/kubeai

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

ai autoscaler faster-whisper inference-operator k8s kubernetes llm ollama ollama-operator openai-api vllm vllm-operator whisper

Last synced: 15 May 2025

https://github.com/Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

openai- openai-whisper speech-recognition speech-to-text whisper

Last synced: 01 Apr 2025

https://github.com/innovatorved/whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

asr hacktoberfest innovatorved transcribe whisper

Last synced: 04 Apr 2025

https://github.com/twitchlib/twitchlib

C# Twitch Chat, Whisper, API and PubSub Library. Allows for chatting, whispering, stream event subscription and channel/account modification. Supports everything that supports .NETStandard 2.0

api bot chat client csharp events pubsub twitch whisper

Last synced: 14 May 2025

https://github.com/TwitchLib/TwitchLib

C# Twitch Chat, Whisper, API and PubSub Library. Allows for chatting, whispering, stream event subscription and channel/account modification. Supports everything that supports .NETStandard 2.0

api bot chat client csharp events pubsub twitch whisper

Last synced: 10 May 2025

https://github.com/YaoFANGUK/video-subtitle-generator

视频音频生成字幕，生成srt文件。无需申请第三方API，本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

audio2text generation srt subtitle transcription whisper

Last synced: 20 Nov 2024

https://github.com/aschmelyun/subvert

Generate subtitles, summaries, and chapters from videos in seconds

chatgpt openai transcription translation video-editing whisper

Last synced: 15 May 2025

https://github.com/transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx

Last synced: 07 Apr 2025

https://github.com/Saik0s/Whisperboard

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

audio-to-text composable-architecture ios openai speech-recognition speech-to-text swiftui tca transcription tuist whisper whisper-cpp

Last synced: 19 Apr 2025

https://github.com/saik0s/whisperboard

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

audio-to-text composable-architecture ios openai speech-recognition speech-to-text swiftui tca transcription tuist whisper whisper-cpp

Last synced: 07 Apr 2025

https://github.com/saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

machine-learning openai speech-recognition speech-to-text whisper

Last synced: 12 Apr 2025

https://github.com/go-graphite/go-carbon

Golang implementation of Graphite/Carbon server with classic architecture: Agent -> Cache -> Persister

carbon devops graphite hacktoberfest timeseries whisper

Last synced: 13 May 2025

https://github.com/mayeaux/generate-subtitles

Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

expressjs gpu libretranslate machine-learning nodejs transcription translation whisper yt-dlp

Last synced: 13 Apr 2025

https://github.com/srcnalt/openai-unity

An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.

chatgpt dalle openai openai-api unity unity3d whisper

Last synced: 14 Apr 2025

https://github.com/srcnalt/OpenAI-Unity

An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.

chatgpt dalle openai openai-api unity unity3d whisper

Last synced: 11 Mar 2025