Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/showlab/VLog

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.

chatgpt langchain large-language-model video-language whisper

Last synced: 06 Nov 2024

https://github.com/dsymbol/decipher

Effortlessly add AI-generated transcription subtitles to your videos

openai transcription translation whisper

Last synced: 28 Dec 2024

https://github.com/dadangdut33/speech-translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python speech-transcription speech-translation tkinter-python translate whisper

Last synced: 28 Dec 2024

https://github.com/owlaiproject/owl

A personal wearable AI that runs locally

ai ble bluetooth esp32 llama2 mistral nrf52840 ollama wearable whisper

Last synced: 28 Dec 2024

https://github.com/OwlAIProject/Owl

A personal wearable AI that runs locally

ai ble bluetooth esp32 llama2 mistral nrf52840 ollama wearable whisper

Last synced: 22 Nov 2024

https://github.com/yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

android asr chinese ctranslate2 huggingface lora pytorch speech-recognition transformers web whisper

Last synced: 09 Oct 2024

https://github.com/Dadangdut33/Speech-Translate

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

python speech-transcription speech-translation tkinter-python translate whisper

Last synced: 20 Nov 2024

https://github.com/nyrahealth/crisperwhisper

Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection

asr audio detection filler recognition speech speech-processing speech-recognition timestamps transcription verbatim whisper

Last synced: 27 Dec 2024

https://github.com/ai-ng/swift

Fast voice assistant powered by Groq, Cartesia, and Vercel.

artificial-intelligence cartesia groq llama nextjs react vercel whisper

Last synced: 28 Dec 2024

https://github.com/zh-plus/openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。

auto-subtitle faster-whisper lyrics lyrics-generator openai-api openlrc python speech-to-text subtitle-translation transcribe voice-to-text whisper

Last synced: 28 Dec 2024

https://github.com/dicklesworthstone/bulk_transcribe_youtube_videos_from_playlist

Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.

playlists transcription transcripts whisper youtube

Last synced: 28 Dec 2024

https://github.com/bklieger/scribewizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 28 Dec 2024

https://github.com/Dicklesworthstone/bulk_transcribe_youtube_videos_from_playlist

Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.

playlists transcription transcripts whisper youtube

Last synced: 08 Nov 2024

https://github.com/Bklieger/ScribeWizard

ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3

ai groq groq-api llama3 replit whisper

Last synced: 22 Nov 2024

https://github.com/macoron/whisper.unity

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

asr openai speech-recognition speech-to-text stt unity3d whisper

Last synced: 28 Dec 2024

https://github.com/mybigday/whisper.rn

React Native binding of whisper.cpp.

openai react-native speech-recognition whisper whisper-cpp

Last synced: 27 Dec 2024

https://github.com/seanoliver/audioflare

An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.

ai cloudflare distilbert llama2 m2m100 openai whisper

Last synced: 23 Dec 2024

https://github.com/toverainc/willow-inference-server

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

cuda deep-learning llama llm privacy speech-recognition speech-to-text text-to-speech vicuna webrtc whisper willow

Last synced: 28 Dec 2024

https://github.com/savbell/whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

dictation faster-whisper openai openai-api openai-whisper speech-recognition speech-to-text typing-assistant whisper

Last synced: 29 Dec 2024

https://github.com/chrislemke/ChatFred

Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.

alfred-workflow alfredapp chatbot chatgpt dall-e2 gpt-3 gpt-4 image-generation openai stable-diffusion whisper

Last synced: 06 Nov 2024

https://github.com/chrislemke/chatfred

Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.

alfred-workflow alfredapp chatbot chatgpt dall-e2 gpt-3 gpt-4 image-generation openai stable-diffusion whisper

Last synced: 26 Sep 2024

https://github.com/rayfernando1337/mlx-auto-subtitled-video-generator

Generate accurate transcripts using Apple's MLX framework

apple mlx transcribe translate whisper

Last synced: 22 Dec 2024

https://github.com/RayFernando1337/MLX-Auto-Subtitled-Video-Generator

Generate accurate transcripts using Apple's MLX framework

apple mlx transcribe translate whisper

Last synced: 27 Dec 2024

https://github.com/shashikg/whispers2t

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr deep-learning speech-recognition speech-to-text tensorrt tensorrt-llm vad voice-activity-detection whisper

Last synced: 27 Dec 2024

https://github.com/lspahija/aiui

AIUI is a platform enabling seamless two-way verbal communication with AI.

ai artificial-intelligence chatgpt chatgpt-api conversation conversational-ai gpt gpt-3 gpt-4 machine-learning speech whisper whisper-ai

Last synced: 29 Dec 2024

https://github.com/lspahija/AIUI

AIUI is a platform enabling seamless two-way verbal communication with AI.

ai artificial-intelligence chatgpt chatgpt-api conversation conversational-ai gpt gpt-3 gpt-4 machine-learning speech whisper whisper-ai

Last synced: 06 Nov 2024

https://github.com/lablab-ai/whisper-transcription_and_diarization-speaker-identification-

How to use OpenAIs Whisper to transcribe and diarize audio files

openai python whisper

Last synced: 24 Dec 2024

https://github.com/shashikg/WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr deep-learning speech-recognition speech-to-text tensorrt tensorrt-llm vad voice-activity-detection whisper

Last synced: 14 Nov 2024

https://github.com/yohasebe/openai-chat-api-workflow

🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT-4o 🤖💬 It also allows image generation 🖼️, image understanding 👀, speech-to-text conversion 🎤, and text-to-speech synthesis 🔈

ai alfred chatbot dall-e gpt image-generation image-understanding openai speech-to-text text-to-speech whisper workflow

Last synced: 24 Dec 2024

https://github.com/etienneab3d/whisperhallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

asr audio-processing noise-removal sound-processing text-to-speech vad vocals whisper

Last synced: 25 Dec 2024

https://github.com/developersdigest/ai-devices

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

function-calling gpt-4-vision groq langchain langsmith llama3 llava llm openai serper tts whisper

Last synced: 22 Dec 2024

https://github.com/uruworks/terosubtitler

Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

ai audio-to-text blu-ray captions editor ffmpeg free linux macos mpv open-source smpte subtitle-editor subtitler subtitles tero transcription whisper windows yt-dlp

Last synced: 24 Dec 2024

https://github.com/kabanosk/whisper-website

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

audio-to-text fastapi hacktoberfest open-source openai python3 speech-to-text subtitles subtitles-generator uvicorn website whisper

Last synced: 23 Dec 2024

https://github.com/gtreshchev/runtimespeechrecognizer

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp

Last synced: 25 Dec 2024

https://github.com/gtreshchev/RuntimeSpeechRecognizer

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp

Last synced: 06 Nov 2024

https://github.com/stage-whisper/stage-whisper

The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.

ai-transcription audio-transcription electron-app hacktoberfest journalism openai openai-whisper whisper

Last synced: 24 Dec 2024

https://github.com/ioanmo226/chatgpt-web-application

A web application that allows users to interact with various OpenAI's models through a simple and user-friendly interface.

ai audio-text chatgpt chatgpt-clone dalle dalle2 davinci-003 express gpt3 highlight-js image-generation markdown-to-html openai whisper

Last synced: 23 Dec 2024

https://github.com/Stage-Whisper/Stage-Whisper

The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.

ai-transcription audio-transcription electron-app hacktoberfest journalism openai openai-whisper whisper

Last synced: 25 Nov 2024

https://github.com/URUWorks/TeroSubtitler

Tero Subtitler is an open source, cross-platform, and free subtitle editing software.

ai audio-to-text blu-ray captions editor ffmpeg free linux macos mpv open-source smpte subtitle-editor subtitler subtitles tero transcription whisper windows yt-dlp

Last synced: 05 Nov 2024

https://github.com/Kabanosk/whisper-website

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

audio-to-text fastapi hacktoberfest open-source openai python3 speech-to-text subtitles subtitles-generator uvicorn website whisper

Last synced: 05 Nov 2024

https://github.com/ariym/whisper-node

Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)

ai cpp ffmpeg ml nodejs openai typescript whisper

Last synced: 28 Dec 2024

https://github.com/matteofasulo/whisper-tiktok

From AI tools to TikTok video creation using FFMPEG, Microsoft Edge read aloud and OpenAI Whisper model

edge-tts ffmpeg mkdocs-material python text-to-speech tiktok whisper

Last synced: 25 Dec 2024

https://github.com/bhattbhavesh91/voice-assistant-whisper-chatgpt

This repository will guide you to create your own Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.

chatgpt chatgpt-api google-assistant gpt-3 gradio huggingface language-model language-models openapi virtual-assistant voice-assistant whisper

Last synced: 25 Dec 2024

https://github.com/microsoft/ai-dev-gallery

An open-source project for Windows developers to learn how to add AI with local models and APIs to Windows apps.

ai csharp developer-tools directml dotnet genai mistral npu onnx onnxruntime onnxruntime-genai phi3 qnn stable-diffusion visual-studio whisper winappsdk windows winui3 wpf

Last synced: 28 Dec 2024

https://github.com/xf00f/web3x

Ethereum TypeScript Client Library - for perfect types and tiny builds.

api ethereum javascript swarm typescript web3 web3js whisper

Last synced: 22 Dec 2024

https://github.com/nikdanilov/whisper-obsidian-plugin

Speech-to-text in Obsidian using OpenAI Whisper

obsidian openai-whisper speech-to-text stt transcribe voice whisper

Last synced: 04 Dec 2024

https://github.com/Robitx/gp.nvim

Gp.nvim (GPT prompt) Neovim AI plugin: ChatGPT sessions & Instructable text/code operations & Speech to text [OpenAI]

ai chatgpt codeium copilot cursor gpt gpt-4 gpt4 llm lua neovim nvim openai plugin speech-to-text tabnine vim voice whisper

Last synced: 26 Oct 2024

https://github.com/jim60105/docker-whisperx

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

asr docker-image dockerfile speech speech-recognition speech-to-text whisper

Last synced: 27 Dec 2024

https://github.com/josefalbers/whisper-turbo-mlx

Blazing fast whisper turbo for ASR (speech-to-text) tasks

asr deep-learning mlx speech-recognition speech-to-text whisper whisper-turbo

Last synced: 23 Dec 2024

https://github.com/dmtrkovalenko/subtitler

Free on-device web app for audio transcribing and rendering subtitles

ai rescript subtitles webcodecs whisper

Last synced: 28 Dec 2024

https://github.com/felixbade/transcribe

Web UI for OpenAI Whisper API

speech-to-text whisper

Last synced: 05 Nov 2024

https://github.com/jim60105/docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

asr docker-image dockerfile speech speech-recognition speech-to-text whisper

Last synced: 05 Nov 2024

https://github.com/pluja/web-whisper

OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.

ai audio docker frontend go openai self-hosting speech text transcription translation web whisper

Last synced: 08 Nov 2024

https://github.com/dmtrKovalenko/subtitler

Free on-device web app for audio transcribing and rendering subtitles

ai rescript subtitles webcodecs whisper

Last synced: 29 Oct 2024

https://github.com/arihanv/Shush

Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app

flash-attention-2 huggingface-transformers machine-learning modal shadcn-ui transcription whisper

Last synced: 30 Nov 2024

https://github.com/supershaneski/openai-whisper

A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for some time interval then uploads the audio data to the server for transcribing/translating.

nextjs openai openai-whisper reactjs whisper

Last synced: 24 Oct 2024

https://github.com/zhuzilin/whisper-openvino

openvino version of openai/whisper

asr openvino whisper

Last synced: 02 Nov 2024

https://github.com/IgnoranceAI/hugh

A voice-powered AI built with Whisper, ChatGPT, and ElevenLabs

chatgpt elevenlabs flask whisper

Last synced: 27 Oct 2024

https://github.com/geekodour/wscribe

ez audio transcription tool with flexible processing and post-processing options

audio-processing transcription whisper

Last synced: 24 Dec 2024

https://github.com/ieasybooks/tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

asr automatic-speech-recognition ctranslate2 facebook faster-whisper javascript python soundcloud srt stable-whisper subtitles twitter vtt whisper youtube

Last synced: 25 Dec 2024

https://github.com/etienneab3d/whispertimesync

Synchronize Whisper's timestamps over an existing accurate transcription

aligner asr nlp subtitles text-to-speech whisper

Last synced: 19 Nov 2024

https://github.com/Illyism/openai-whisper-api

OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example

chatgpt openai openai-whisper whisper

Last synced: 14 Nov 2024

https://github.com/noco-ai/spellbook-docker

AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models

automatic-speech-recognition bark llama2 llm-inference mixtral musicgeneration stable-diffusion text-to-speech whisper xttsv2

Last synced: 18 Nov 2024

https://github.com/illyism/openai-whisper-api

OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example

chatgpt openai openai-whisper whisper

Last synced: 27 Oct 2024

https://github.com/bits-by-brandon/whisper-ui

A GUI interface for Open AI Whisper based on Tauri and Sveltekit

rust speech-to-text svelte tauri whisper

Last synced: 09 Nov 2024

https://github.com/jimmylv/chatvox

"Chat With Any Video" project in 24 hours, challenge myself to complete in @Supabase's AI Hackathon.

ai chatgpt openai supabase video whisper

Last synced: 24 Nov 2024

https://github.com/m1guelpf/whisper-cli-rs

A Whisper CLI, built with Rust.

cli rust whisper

Last synced: 25 Dec 2024

https://github.com/johniwasz/whetstone.chatgpt

A simple light-weight library that wraps the Open AI API.

chatgpt dotnet dotnet-standard2 dotnet-standard2-1 gpt-3 gpt-35-turbo gpt-4 openai whisper whisper-ai

Last synced: 24 Dec 2024

https://github.com/aadeshkulkarni/sanchay-ai

Takes your video and generates video title, description, hashtags, transcription, subtitles and more.

generative-ai javascript object-store python rabbitmq whisper

Last synced: 14 Nov 2024

https://github.com/status-im/nim-eth

Common utilities for Ethereum

devp2p discv5 eth ethereum nim rlp whisper

Last synced: 23 Dec 2024

https://github.com/pinto0309/whisper-onnx-cpu

ONNX implementation of Whisper. PyTorch free.

cpu numpy onnx whisper

Last synced: 24 Dec 2024

https://github.com/nalbion/whisper-server

streaming speech to text server using Whisper

idiolect nlp whisper

Last synced: 19 Dec 2024

https://github.com/piotrkawa/deepfake-whisper-features

Implementation of the paper "Improved DeepFake Detection Using Whisper Features"

audio-deepfake-detection deep-learning deepfake-detection paper-implementations whisper

Last synced: 24 Oct 2024