Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/ggerganov/whisper.cpp

Port of OpenAI's Whisper model in C/C++

inference openai speech-recognition speech-to-text transformer whisper

Last synced: 30 Jul 2024

https://github.com/chidiwilliams/Buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

whisper

Last synced: 01 Aug 2024

https://github.com/chidiwilliams/buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

whisper

Last synced: 30 Jul 2024

https://github.com/PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 31 Jul 2024

https://github.com/m-bain/whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

asr speech speech-recognition speech-to-text whisper

Last synced: 30 Jul 2024

https://github.com/sanchit-gandhi/whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

deep-learning jax speech-recognition speech-to-text whisper

Last synced: 31 Jul 2024

https://github.com/wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

asr automatic-speech-recognition conformer e2e-models production-ready pytorch speech-recognition transformer whisper

Last synced: 01 Aug 2024

https://github.com/xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

artificial-intelligence chatglm deployment flan-t5 gemma ggml glm4 inference llama llama3 llamacpp llm machine-learning mistral openai-api pytorch qwen vllm whisper wizardlm

Last synced: 31 Jul 2024

https://github.com/leetcode-mafia/cheetah

Mac app for crushing remote tech interviews with AI

ai chatgpt gpt gpt-4 openai swift swiftui whisper whisper-cpp

Last synced: 01 Aug 2024

https://github.com/iurimatias/embark-framework

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper

Last synced: 09 Aug 2024

https://github.com/embarklabs/embark

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper

Last synced: 30 Jul 2024

https://github.com/embark-framework/embark

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

blockchain dapp decentralized ethereum framework ipfs serverless smart-contracts swarm whisper

Last synced: 01 Aug 2024

https://github.com/huggingface/distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

audio speech-recognition whisper

Last synced: 31 Jul 2024

https://github.com/grt1228/chatgpt-java

ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

chatgpt chatgpt-java gpt-35-turbo gpt-4 gpt-plugins java openai-api openai-chatgpt openai-images openai-whisper tiktoken-java whisper

Last synced: 02 Aug 2024

https://github.com/Grt1228/chatgpt-java

ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java

chatgpt chatgpt-java gpt-35-turbo gpt-4 gpt-plugins java openai-api openai-chatgpt openai-images openai-whisper tiktoken-java whisper

Last synced: 01 Aug 2024

https://github.com/n3d1117/chatgpt-telegram-bot

🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python

chatgpt dall-e openai python telegram-bot whisper

Last synced: 01 Aug 2024

https://github.com/SamurAIGPT/EmbedAI

An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks

chatbot chatgpt embedai embeddings generative gpt gpt4 gpt4all langchain models openai privategpt vectorstore whisper

Last synced: 30 Jul 2024

https://github.com/betalgo/openai

OpenAI .NET sdk - ChatGPT, Whisper, GPT-3, GPT-4, Azure OpenAI and DALL-E

azure-openai chatgpt csharp dall-e dotnet gpt-3 gpt-4 openai openai-api sdk whisper whisper-ai

Last synced: 01 Aug 2024

https://github.com/alexrudall/ruby-openai

OpenAI API + Ruby! 🤖❤️ NEW: Assistant Vector Stores

ai api-client chatgpt dall-e gpt-3 gpt-4 openai rails ruby whisper

Last synced: 30 Jul 2024

https://github.com/MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

asr speaker-diarization speech speech-recognition speech-to-text whisper

Last synced: 31 Jul 2024

https://github.com/toverainc/willow

Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative

alexa deep-learning echo esp-adf esp-idf esp32 google-home home-assistant home-automation privacy speech-recognition speech-to-text whisper

Last synced: 01 Aug 2024

https://github.com/argmaxinc/WhisperKit

Swift native on-device speech recognition with Whisper for Apple Silicon

inference ios macos pretrained-models speech-recognition swift transformers visionos watchos whisper

Last synced: 31 Jul 2024

https://github.com/FL33TW00D/whisper-turbo

Cross-Platform, GPU Accelerated Whisper 🏎️

audio machine-learning rust speech-recognition webgpu whisper windows

Last synced: 01 Aug 2024

https://github.com/Aallam/openai-kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

api chatgpt client coroutines dall-e gpt kotlin llm multiplatform openai whisper

Last synced: 02 Aug 2024

https://github.com/aallam/openai-kotlin

OpenAI API client for Kotlin with multiplatform and coroutines capabilities.

api chatgpt client coroutines dall-e gpt kotlin llm multiplatform openai whisper

Last synced: 01 Aug 2024

https://github.com/m1guelpf/auto-subtitle

Automatically generate and overlay subtitles for any video.

ffmpeg openai-whisper subtitle-generator subtitles subtitles-generator whisper

Last synced: 01 Aug 2024

https://github.com/m1guelpf/yt-whisper

Using OpenAI's Whisper to automatically generate YouTube subtitles

ffmpeg openai openai-whisper subtitles subtitles-generated transcribe whisper youtube youtube-dl

Last synced: 01 Aug 2024

https://github.com/graphite-project/whisper

Whisper is a file-based time-series database format for Graphite.

graphite graphite-components library metrics python time-series whisper

Last synced: 31 Jul 2024

https://github.com/abdeladim-s/subsai

🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️

cli subtitles subtitles-generator webui whisper whisper-ai

Last synced: 31 Jul 2024

https://github.com/pluja/whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

ai audio-to-text golang speech-recognition speech-to-text stt subtitles sveltekit transcription ui web web-whisper webapp whisper

Last synced: 30 Jul 2024

https://github.com/xenova/whisper-web

ML-powered speech recognition directly in your browser

javascript transformers whisper

Last synced: 01 Aug 2024

https://github.com/Chenyme/Chenyme-AAVT

这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

faster-whisper gpt-4 gpt-4o speech-recognition video-translation whisper

Last synced: 01 Aug 2024

https://github.com/innovatorved/whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

asr hacktoberfest innovatorved transcribe whisper

Last synced: 01 Aug 2024

https://github.com/TwitchLib/TwitchLib

C# Twitch Chat, Whisper, API and PubSub Library. Allows for chatting, whispering, stream event subscription and channel/account modification. Supports everything that supports .NETStandard 2.0

api bot chat client csharp events pubsub twitch whisper

Last synced: 03 Aug 2024

https://github.com/Softcatala/whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

openai- openai-whisper speech-recognition speech-to-text whisper

Last synced: 01 Aug 2024

https://github.com/go-graphite/go-carbon

Golang implementation of Graphite/Carbon server with classic architecture: Agent -> Cache -> Persister

carbon devops graphite hacktoberfest timeseries whisper

Last synced: 29 Jul 2024

https://github.com/saharmor/whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

machine-learning openai speech-recognition speech-to-text whisper

Last synced: 30 Jul 2024

https://github.com/aschmelyun/subvert

Generate subtitles, summaries, and chapters from videos in seconds

chatgpt openai transcription translation video-editing whisper

Last synced: 31 Jul 2024

https://github.com/mayeaux/generate-subtitles

Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

expressjs gpu libretranslate machine-learning nodejs transcription translation whisper yt-dlp

Last synced: 01 Aug 2024

https://github.com/chengsokdara/use-whisper

React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in

api hook openai react real-time whisper

Last synced: 01 Aug 2024

https://github.com/shirayu/whispering

Streaming transcriber with whisper

automatic-speech-recognition whisper

Last synced: 01 Aug 2024

https://github.com/YaoFANGUK/video-subtitle-generator

视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.

audio2text generation srt subtitle transcription whisper

Last synced: 04 Aug 2024

https://github.com/srcnalt/openai-unity

An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.

chatgpt dalle openai openai-api unity unity3d whisper

Last synced: 02 Aug 2024

https://github.com/mallorbc/whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

microphone speech-recognition speech-to-text whisper whisper-ai whisper-api

Last synced: 02 Aug 2024

https://github.com/srcnalt/OpenAI-Unity

An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.

chatgpt dalle openai openai-api unity unity3d whisper

Last synced: 29 Jul 2024

https://github.com/mezbaul-h/june

Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit

ai assistant-chat-bots chatbot chatbots cli-app command-line-tool coqui-tts huggingface large-language-models llm python speech-recognition speech-to-text text-to-speech whisper

Last synced: 19 Aug 2024

https://github.com/Saik0s/Whisperboard

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

audio-to-text composable-architecture ios openai speech-recognition speech-to-text swiftui tca transcription tuist whisper whisper-cpp

Last synced: 02 Aug 2024

https://github.com/PlayVoice/lora-svc

singing voice change based on whisper, and lora for singing voice clone

lora singing-voice-conversion speech-to-sing uni-svc vits vits-svc voice-change voice-cloning voice-conversion whisper

Last synced: 01 Aug 2024

https://github.com/transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx

Last synced: 01 Aug 2024

https://github.com/jina-ai/agentchain

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

artificial-intelligence blip langchain llm machine-learning multimodal nlproc stable-diffusion whisper

Last synced: 01 Aug 2024

https://github.com/exPHAT/SwiftWhisper

🎤 The easiest way to transcribe audio in Swift

ios macos openai speech-recognition speech-to-text swift transcription whisper whisper-cpp

Last synced: 01 Aug 2024

https://github.com/showlab/vlog

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.

chatgpt langchain large-language-model video-language whisper

Last synced: 02 Aug 2024

https://github.com/dsymbol/decipher

Effortlessly add AI-generated transcription subtitles to your videos

openai transcription translation whisper

Last synced: 01 Aug 2024

https://github.com/OwlAIProject/Owl

A personal wearable AI that runs locally

ai ble bluetooth esp32 llama2 mistral nrf52840 ollama wearable whisper

Last synced: 05 Aug 2024

https://github.com/showlab/VLog

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.

chatgpt langchain large-language-model video-language whisper

Last synced: 01 Aug 2024

https://github.com/harry0703/AudioNotes

快速提取音视频内容,整理成一份结构化的markdown笔记

ai asr funasr ollama python qwen2 whisper

Last synced: 31 Jul 2024

https://github.com/buxuku/VideoSubtitleGenerator

批量为本地视频生成字幕文件,并可将字幕文件翻译成其它语言, 跨平台支持 window, mac 系统

subtitle translate whisper whisper-cpp

Last synced: 30 Aug 2024

https://github.com/zh-plus/openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。

auto-subtitle faster-whisper lyrics lyrics-generator openai-api openlrc python speech-to-text subtitle-translation transcribe voice-to-text whisper

Last synced: 31 Jul 2024

https://github.com/Dadangdut33/Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python speech-transcription speech-translation tkinter-python translate whisper

Last synced: 04 Aug 2024

https://github.com/Dicklesworthstone/bulk_transcribe_youtube_videos_from_playlist

Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.

playlists transcription transcripts whisper youtube

Last synced: 01 Aug 2024

https://github.com/chrislemke/chatfred

Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.

alfred-workflow alfredapp chatbot chatgpt dall-e2 gpt-3 gpt-4 image-generation openai stable-diffusion whisper

Last synced: 02 Aug 2024

https://github.com/chrislemke/ChatFred

Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.

alfred-workflow alfredapp chatbot chatgpt dall-e2 gpt-3 gpt-4 image-generation openai stable-diffusion whisper

Last synced: 01 Aug 2024

https://github.com/Bklieger/groqnotes

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 10 Aug 2024

https://github.com/Bklieger/ScribeWizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 05 Aug 2024

https://github.com/lspahija/aiui

AIUI is a platform enabling seamless two-way verbal communication with AI.

ai artificial-intelligence chatgpt chatgpt-api conversation conversational-ai gpt gpt-3 gpt-4 machine-learning speech whisper whisper-ai

Last synced: 02 Aug 2024

https://github.com/lspahija/AIUI

AIUI is a platform enabling seamless two-way verbal communication with AI.

ai artificial-intelligence chatgpt chatgpt-api conversation conversational-ai gpt gpt-3 gpt-4 machine-learning speech whisper whisper-ai

Last synced: 01 Aug 2024

https://github.com/yohasebe/openai-chat-api-workflow

🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT-3.5/GPT-4 🤖💬 It also allows image generation 🖼️, image understanding 👀, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈

ai alfred chatbot dall-e gpt image-generation image-understanding openai speech-to-text text-to-speech whisper workflow

Last synced: 31 Jul 2024

https://github.com/Stage-Whisper/Stage-Whisper

The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.

ai-transcription audio-transcription electron-app hacktoberfest journalism openai openai-whisper whisper

Last synced: 07 Aug 2024

https://github.com/Kabanosk/whisper-website

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

audio-to-text fastapi hacktoberfest open-source openai python3 speech-to-text subtitles subtitles-generator uvicorn website whisper

Last synced: 01 Aug 2024

https://github.com/shashikg/WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr deep-learning speech-recognition speech-to-text tensorrt tensorrt-llm vad voice-activity-detection whisper

Last synced: 03 Aug 2024

https://github.com/ioanmo226/chatgpt-web-application

A web application that allows users to interact with various OpenAI's models through a simple and user-friendly interface.

ai audio-text chatgpt chatgpt-clone dalle dalle2 davinci-003 express gpt3 highlight-js image-generation markdown-to-html openai whisper

Last synced: 29 Jul 2024

https://github.com/ariym/whisper-node

Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)

ai cpp ffmpeg ml nodejs openai typescript whisper

Last synced: 06 Aug 2024

https://github.com/xf00f/web3x

Ethereum TypeScript Client Library - for perfect types and tiny builds.

api ethereum javascript swarm typescript web3 web3js whisper

Last synced: 03 Aug 2024

https://github.com/URUWorks/TeroSubtitler

Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

ai audio-to-text blu-ray captions editor ffmpeg free linux macos mpv open-source smpte subtitle-editor subtitler subtitles tero transcription whisper windows yt-dlp

Last synced: 01 Aug 2024

https://github.com/robitx/gp.nvim

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI]

ai chatgpt codeium copilot cursor gpt gpt-4 gpt4 llm lua neovim nvim openai plugin speech-to-text tabnine vim voice whisper

Last synced: 02 Aug 2024

https://github.com/nikdanilov/whisper-obsidian-plugin

Speech-to-text in Obsidian using OpenAI Whisper

obsidian openai-whisper speech-to-text stt transcribe voice whisper

Last synced: 13 Aug 2024

https://github.com/Robitx/gp.nvim

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI]

ai chatgpt codeium copilot cursor gpt gpt-4 gpt4 llm lua neovim nvim openai plugin speech-to-text tabnine vim voice whisper

Last synced: 30 Jul 2024

https://github.com/pluja/web-whisper

OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.

ai audio docker frontend go openai self-hosting speech text transcription translation web whisper

Last synced: 01 Aug 2024

https://github.com/felixbade/transcribe

Web UI for OpenAI Whisper API

speech-to-text whisper

Last synced: 01 Aug 2024

https://github.com/zhuzilin/whisper-openvino

openvino version of openai/whisper

asr openvino whisper

Last synced: 01 Aug 2024

https://github.com/arihanv/Shush

Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app

flash-attention-2 huggingface-transformers machine-learning modal shadcn-ui transcription whisper

Last synced: 10 Aug 2024

https://github.com/IgnoranceAI/hugh

A voice-powered AI built with Whisper, ChatGPT, and ElevenLabs

chatgpt elevenlabs flask whisper

Last synced: 31 Jul 2024

https://github.com/bolna-ai/bolna

End-to-end platform enabling LLM based voice driven conversational applications

anyscale deepgram elevenlabs exotel fastapi litellm llama2 llm mistral openai perplexity-api polly telephony twilio voice-assistant websocket-chat websockets whisper xtts

Last synced: 01 Aug 2024

https://github.com/dmtrKovalenko/subtitler

Free on-device web app for audio transcribing and rendering subtitles

ai rescript subtitles webcodecs whisper

Last synced: 31 Jul 2024

https://github.com/noco-ai/spellbook-docker

AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models

automatic-speech-recognition bark llama2 llm-inference mixtral musicgeneration stable-diffusion text-to-speech whisper xttsv2

Last synced: 04 Aug 2024