Whisper | Ecosyste.ms: Awesome

https://github.com/yum-food/tastt

A free self-hosted STT for VRChat

speech-to-text vrchat vrchat-avatars vrchat-osc vrchat-sdk3 vrchat-tool whisper

Last synced: 09 Oct 2024

https://github.com/load1n9/openai

Unofficial Deno wrapper for the Open Ai api

ai chatgpt chatgpt-api collaborate dall-e dalle2 gpt gpt-3 gpt-35-turbo gpt-4 gpt4 gptchat openai openai-api whisper whisper-ai

Last synced: 16 Nov 2024

https://github.com/justmalhar/open-audio

Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experience, including customizable voice and speech speed settings, and the ability to download audio files directly.

chakra-ui chatgpt nextjs openai openai-api openai-chatgpt openai-whisper tailwind text-to-speech tts tts-1 tts-api vercel whisper

Last synced: 17 Dec 2024

https://github.com/JonathanFly/faster-whisper-livestream-translator

faster-whisper livestream translation, OBS noise reduction, dual language subtitles

faster-whisper speech-to-text subtitles whisper

Last synced: 22 Nov 2024

https://github.com/mharrvic/fast-audio-video-transcribe-with-whisper-and-modal

Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minute

fastapi modal openai python transcribe whisper

Last synced: 15 Nov 2024

https://github.com/innovatorved/whisper-openai-gradio-implementation

Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation

innovatorved openai speech-recognition speech-to-text whisper

Last synced: 27 Oct 2024

https://github.com/shhossain/banglaspeech2text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

bangla bangla-asr bangla-automatic-speech-recognition bangla-speech-recognition bangla-speech-to-text bangla-voice-recognition deep-learning hacktoberfest machine-learning pytorch speech speech-recognition speech-to-text transformer voice-recognition whisper whisper-model

Last synced: 21 Dec 2024

https://github.com/mouredev/tggenerator

Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)

android android-app androidstudio dall-e dalle2 gpt-3-5-turbo jetpack-compose openai openai-api whisper whisper-ai whisper-api

Last synced: 10 Nov 2024

https://github.com/echocat/puppet-graphite

Puppet module for graphite monitoring tools

carbon echocat graphite puppet whisper

Last synced: 25 Sep 2024

https://github.com/stangirard/quivr-whisper

Talk to your second brain personal assistant using speech 🧠

assistant gpts openai personal quivr speech transcribe tts whisper

Last synced: 28 Oct 2024

https://github.com/carloscdias/whisper-cpp-python

whisper.cpp bindings for python

python python3 whisper whisper-api whisper-cpp

Last synced: 24 Oct 2024

https://github.com/thesethrose/time-capsule

Time Capsule continuously captures and stores digital activities to create a comprehensive memory system. It features real-time audio recording, speech-to-text with Fast-Whisper, plugin support, database storage via Chroma, and a web interface for management. Ideal for documenting life or building digital memories.

ai audio-capture chroma data-storage digital-memory flask memory plugins python whisper

Last synced: 19 Dec 2024

https://github.com/rerender2021/heard

A simple subtitle generator powered by whisper & avernakis react.

avernakis desktop react subtitle whisper windows

Last synced: 08 Nov 2024

https://github.com/TheSethRose/Time-Capsule

Time Capsule continuously captures and stores digital activities to create a comprehensive memory system. It features real-time audio recording, speech-to-text with Fast-Whisper, plugin support, database storage via Chroma, and a web interface for management. Ideal for documenting life or building digital memories.

ai audio-capture chroma data-storage digital-memory flask memory plugins python whisper

Last synced: 04 Dec 2024

https://github.com/daymade/tiktok-whisper

Batch convert video to text using openai's whisper or the local coreML via whisper.cpp on your MacBook

coreml openai pgvector podcast postgresql sqlite tiktok whisper whisper-cpp xiaoyuzhou

Last synced: 28 Nov 2024

https://github.com/stangirard/speechdigest

Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit

audio-processing gpt gpt-3 gpt-4 llm openai recording summarization whisper

Last synced: 14 Nov 2024

https://github.com/etienneab3d/karaok-ai

Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)

djing karaoke karaoke-maker lyrics mp3-player music party-apps sound-processing speech-to-text srt-subtitles subtitles vad whisper

Last synced: 19 Nov 2024

https://github.com/RayFernando1337/MLX-Auto-Subtitled-Video-Generator

Generate accurate transcripts using Apple's MLX framework

apple mlx pinokio transcribe translate whisper

Last synced: 03 Sep 2024

https://github.com/mingkuan/voice-assistant-chatgpt

Voice Assistant based on Whisper ASR and ChatGPT API

ai-web-app asr chatbot-application chatgpt chatgpt-bot multilingual speech-recognition speech-synthesis streamlit streamlit-webapp voice-assistant whisper

Last synced: 06 Nov 2024

https://github.com/shamspias/chatgpt-voice-chatbot-telegram

ChatGPT Voice Chatbot Telegram is a Python and Flask-based GitHub repository that enables users to communicate with an AI chatbot using voice-to-text and text-to-voice technologies powered by OpenAI. The repository provides a flexible and customizable solution for building advanced voice-enabled chatbots using natural language processing.

celery chatbot chatgpt dall-e flask gpt-3 openjourney python telegram-bot telegram-voice-chat text-to-speech text-to-speech-python3 tts voice-chat voice-conversion voice-recognition voice-to-text whisper

Last synced: 04 Dec 2024

https://github.com/dartvauder/neurosandboxwebui

(Windows/Linux) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages

audioldm cogvideox demucs diffusers flux gradio llamacpp llm neural-network python rvc seamlessm4t stable-diffusion stableaudio stablefast3d transformers tts wav2lip webui whisper

Last synced: 02 Dec 2024

https://github.com/pinto0309/whisper-onnx-tensorrt

ONNX and TensorRT implementation of Whisper

cupy numpy onnx stt tensorrt whisper

Last synced: 22 Oct 2024

https://github.com/kurianbenoy/indic-subtitler

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.

asr deep-learning fastapi faster-whisper inference nextjs openai quantization speech-recognition speech-to-text transformers vegam-whisper webapp whisper whisperx

Last synced: 01 Nov 2024

https://github.com/EtienneAb3d/karaok-AI

Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)

djing karaoke karaoke-maker lyrics mp3-player music party-apps sound-processing speech-to-text srt-subtitles subtitles vad whisper

Last synced: 08 Nov 2024

https://github.com/gergovari/lazyshorts-py

Create short videos, like a lazy person.

ai ffmpeg instagram lazy mediapipe moviepy shorts tiktok video videos whisper youtube

Last synced: 03 Dec 2024

https://github.com/runpod/serverless-workers

⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.

ai anything-v3 containers docker openjourney runpod serverless stable-diffusion whisper workers

Last synced: 25 Nov 2024

https://github.com/chetanxpro/nodejs-whisper

Introducing NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.

ai cpp ml nodejs-whisper openai speech-recognition speech-to-text timestamp whisper whisper-nodejs

Last synced: 15 Nov 2024

https://github.com/gregistech/lazyshorts-py

Create short videos, like a lazy person.

ai ffmpeg instagram lazy mediapipe moviepy shorts tiktok video videos whisper youtube

Last synced: 20 Nov 2024

https://github.com/aronweiler/assistant

An intellligent AI assistant that can do anything!

ai database large-language-models llama2 llamacpp llm open-ai open-ai-api openai pgvector polly-voice postgres postgresql python streamlit transcription voice-assistant voice-recognition whisper

Last synced: 03 Dec 2024

https://github.com/i5ucc/vrctextboxstt

A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that information to VRChats textbox system and/or KillFrenzyAvatarText over OSC. Also supports various other methods like OBS via Browsersource and a SteamVR overlay!

obs openai openai-whisper openvr osc speech-recognition speech-to-text vrchat vrchat-avatars vrchat-osc vrchat-sdk3 vrchat-tool whisper

Last synced: 09 Oct 2024

https://github.com/fedirz/faster-whisper-server

docker docker-compose faster-whisper openai-api openai-whisper openai-whisper-translation transcription whisper whisper-ai

Last synced: 20 Oct 2024

https://github.com/j3soon/whisper-to-input

An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text; Supports English, Chinese, Japanese, etc. and even mixed languages.

android android-ime automatic-speech-recognition chinese-speech-recognition ime keyboard kotlin openai openai-api speech speech-recognition speech-to-text virtual-keyboard voice voice-recognition whisper

Last synced: 19 Dec 2024

https://github.com/linto-ai/linto-stt

An automatic speech recognition API

asr celery kaldi-asr linto microservices offline online speech-recognition speech-to-text stt websockets whisper

Last synced: 19 Dec 2024

https://github.com/QuantiusBenignus/blurt

Gnome shell extension for accurate speech to text input in Linux using whisper.cpp. Input text from speech anywhere.

ai asr bloat-free dictate dictation gnome gnome-extension gnome-shell-extension input input-method kiss linux machine-learning speech-recognition speech-to-text whisper whisper-cpp

Last synced: 08 Nov 2024

https://codeberg.org/pluja/web-whisper

New repo: https://codeberg.org/pluja/web-whisper-plus

ai audio go openai speech-to-text svelte transcription translation ui web whisper

Last synced: 14 Nov 2024

https://github.com/kurianbenoy/whisper_normalizer

A python package for whisper normalizer

asr asr-benchmark jupyter-notebook nbdev normalizers openai whisper

Last synced: 21 Dec 2024

https://github.com/gewoonjaap/winwhisper

Create subtitles with ease, using Whisper AI for Windows

csharp openai subtitle subtitles subtitles-generator videos whisper whisper-ai

Last synced: 27 Oct 2024

https://github.com/extremq/gptsubtitler

Automatically subtitle any video spoken in any language to a language of your choice using AI.

ai ffmpeg gpt huggingface openai subtitles transcriber translation whisper

Last synced: 18 Nov 2024

https://github.com/chromakode/coalesce

Edit audio at the speed of text

audio editor podcast whisper

Last synced: 07 Nov 2024

https://github.com/JorianWoltjer/AutoCaptions

A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.

ai premiere-pro srt subtitles whisper xml

Last synced: 20 Nov 2024

https://github.com/lucaluke13/talkybotty

Simply forward a video or voice message in any language to the bot, and it will reply with a translation.

ai osint telegram-bot text-to-speech translation voice whisper

Last synced: 12 Oct 2024

https://github.com/jorianwoltjer/autocaptions

A GUI tool that uses OpenAIs Whisper to transcribe text from an audio/video file, into a Premiere Pro sequence to automate the creation of subtitles.

ai premiere-pro srt subtitles whisper xml

Last synced: 07 Nov 2024

https://github.com/divineux23/audio-to-audio-translation

Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...

chatgpt elevenlabs flask language translator whisper

Last synced: 10 Nov 2024

https://github.com/pinto0309/faster-whisper-env

An environment where you can try out faster-whisper immediately.

whisper

Last synced: 22 Oct 2024

https://github.com/serg-plusplus/meeper

Meeper 📝 - is your secretary for any in-browser conference.

ai chatgpt extension langchain summary transcription whisper

Last synced: 02 Nov 2024

https://github.com/faker2048/youtube-faster-whisper

YTWS is a simple CLI tool that downloads YouTube videos and creates subtitles quickly. It uses yt-dlp for downloading and faster-whisper for transcribing, making it easy and efficient to use.

substitle tools transcript whisper youtube

Last synced: 11 Oct 2024

https://github.com/jovanveljanoski/jupyter-voicepilot

A JupyterLab extension for generating code and interacting with JupyterLab Notebooks via voice commands

gpt-3 jupyterlab jupyterlab-extension voice whisper

Last synced: 09 Oct 2024

https://github.com/kostasereksonas/audio-transcriber

Simple Python audio transcriber using OpenAI's Whisper speech recognition model

audio audio-to-text openai openai-whisper pip python text transcription whisper youtube youtube-dl

Last synced: 16 Nov 2024

https://github.com/KostasEreksonas/Audio-transcriber

Simple Python audio transcriber using OpenAI's Whisper speech recognition model

audio audio-to-text openai openai-whisper pip python text transcription whisper youtube youtube-dl

Last synced: 25 Nov 2024

https://github.com/saharmor/anima

Turn text into video using Stable Diffusion and Google FILM

artificial-intelligence deep-learning generativeai generativeart stable-diffusion text-to-video whisper

Last synced: 13 Nov 2024

https://github.com/Maitreyapatel/speech-conversion-between-different-modalities

Generative Adversarial Networks for different impaired speech conversions

deep-learning generative-adversarial-networks pytorch speech-conversion voice-conversion whisper

Last synced: 25 Nov 2024

https://github.com/bhattbhavesh91/diffusion-chatgpt

This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant like Google Assistant using Open AI's ChatGPT, Whisper. The entire solution is created using Python & Gradio.

chatgpt chatgpt-api google-assistant gpt-3 gradio gradio-interface language-model language-models openai stable-diffusion stable-diffusion-diffusers stable-diffusion-v2 whisper

Last synced: 16 Nov 2024

https://github.com/platisd/phonix

Generate captions for videos using the power of OpenAI's Whisper API

openai openai-api openai-whisper video-srt video-to-caption video-to-text whisper

Last synced: 27 Oct 2024

https://github.com/sabber-slt/rayvox

Audio and Video transcription with whisper API and Next.js

chakra-ui nextjs transcription typescript whisper

Last synced: 17 Nov 2024

https://github.com/appleboy/go-whisper

Speech o Text using docker image with ggerganov/whisper.cpp

golang openai whisper whisper-ai whisper-cpp

Last synced: 15 Oct 2024

https://github.com/lablab-ai/openai_whisper_streamlit

A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper

asr openai python streamlit whisper

Last synced: 24 Nov 2024

https://github.com/AIFSH/ComfyUI-WhisperX

a comfyui cuatom node for audio subtitling based on whisperX and translators

srt-subtitles sutitles translation whisper

Last synced: 19 Dec 2024

https://github.com/DivineUX23/Audio-to-Audio-translation

Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...

chatgpt elevenlabs flask language translator whisper

Last synced: 24 Oct 2024

https://github.com/nooqta/kodyfire

AI-powered code generator and automation tool

ai automation boilerplate chatgpt cli codex generator low-code no-code openai openai-api scaffold template typescript whisper yeoman

Last synced: 07 Nov 2024

https://github.com/solygambas/python-openai-projects

13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python.

auto-gpt chatbot chatgpt dall-e embeddings gpt-4 langchain langchain-python machine-learning nlp nlp-machine-learning open-ai-api openai python reddit reddit-api spotify spotify-api stable-diffusion whisper

Last synced: 27 Oct 2024

https://github.com/eyevinn/auto-subtitles

Automatically generate subtitles from an input audio or video file using OpenAI Whisper

ffmpeg openai openai-whisper subtitle-generator subtitles subtitles-generator tools transcription video video-streaming whisper

Last synced: 16 Nov 2024

https://github.com/yohasebe/whisper-stream

A bash script that uses the OpenAI Whisper API to transcribe continuous spoken audio into text

command-line dictation openai transcription voice-to-text whisper

Last synced: 08 Nov 2024

https://github.com/alxpez/alts

100% free, local & offline voice assistant with speech recognition

assistant chatbot llm local offline ollama speech-recognition stt tts voice voice-assistant whisper

Last synced: 20 Oct 2024

https://github.com/smaranjitghose/aiaudiotranscriber

A minimalistic web app to generate transciption for audio built using Python

docker open-source openai python python3 speech-recognition speech-to-text streamlit streamlit-lottie streamlit-webapp whisper

Last synced: 27 Nov 2024

https://github.com/yj-20/auto-subtitle-translate

Automatically generate, translate, and overlay subtitles for any video.

ai ai-subtitle automatic-subtitle deep-learning ffmpeg llama llama2 python subtitle-generator subtitles subtitles-generator translates translator whisper

Last synced: 27 Sep 2024

https://github.com/aifsh/comfyui-whisperx

a comfyui cuatom node for audio subtitling based on whisperX and translators

srt-subtitles sutitles translation whisper

Last synced: 08 Nov 2024

https://github.com/smaranjitghose/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

docker open-source openai python python3 speech-recognition speech-to-text streamlit streamlit-lottie streamlit-webapp whisper

Last synced: 25 Nov 2024

https://github.com/beingamanforever/tech-enhanced-ai-interview-learning-platform

Developed a sophisticated machine learning model capable of generating diverse interview questions aligned with specific topics, ensuring depth of conversation. Integrated advanced Natural Language Processing (NLP) algorithms to analyse spoken responses, identifying grammatical errors & offering accurate corrections after the interview.

ai-chatbot ai-chatbots api chatbot dataset fine flask huggingface huggingface-transformers inteview-test kaggle kaggle-notebooks large latex-document machine-learning mlops openai whisper

Last synced: 11 Oct 2024

https://github.com/CrimeIsDown/trunk-transcribe

Transcription of calls from trunk-recorder using OpenAI Whisper

celery meilisearch openai-whisper telegram-bot trunk-recorder whisper

Last synced: 25 Nov 2024

https://github.com/avencores/openai-api-telegram-bot-public

🖤ChatGPT, DALLE-2 and Whisper Telegram Bot🖤

api bot chatgpt chatgpt-bot gpt4free open-source openai openapi opensource python python-bot python3 telebot telegram telegram-bot whisper whisper-ai whisperbot

Last synced: 18 Nov 2024

https://codeberg.org/pluja/web-whisper-plus

NEW VERSION AT: https://github.com/pluja/whishper. A transcription suite on your web browser: OpenAI's whisper and many other features. Formerly "web-whisper-plus"

ai audio docker go golang speech subtitles sveltekit text transcription whisper

Last synced: 24 Oct 2024

https://github.com/umer-sheikh/bird-whisperer

[InterSpeech 2024] Official code repository of paper titled "Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification" accepted in InterSpeech 2024 conference.

bird-call-classification birdclef-2023 fine-tuning whisper

Last synced: 09 Oct 2024

https://github.com/Eyevinn/auto-subtitles

Automatically generate subtitles from an input audio or video file using OpenAI Whisper

ffmpeg openai openai-whisper subtitle-generator subtitles subtitles-generator tools transcription video video-streaming whisper

Last synced: 07 Nov 2024

https://github.com/jim-schwoebel/nala_assistant

🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.

chatbot chatgpt dolly2 fastapi fastapi-boilerplate fastapi-sqlalchemy fastapi-template large-language-models llm llms speech-recognition speech-to-text speecht5 tts voice voice-assistant voice-assistants wakeword whisper whisper-model

Last synced: 07 Nov 2024

https://github.com/Nachimak28/LAI-voice-search-openai-whisper-demo

A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo

openai speech-to-text websearch whisper

Last synced: 24 Oct 2024

https://github.com/nitaiaharoni1/whisper-speech-to-text

Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via OpenAI's Whisper, intended for web applications.

javascript openai openai-whisper react speech speech-recognition speech-to-text text-recognition typescript webapp whisper whisper-ai

Last synced: 13 Nov 2024

https://github.com/abus-aikorea/kara-audio

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.

asr demucs faster-whisper gradio karaoke mdx-net music-source-separation openai-whisper speech-recognition speech-to-text stt subtitle uvr vocal-remover webui whisper

Last synced: 10 Nov 2024

https://github.com/ultmaster/whisper-movie

Generate subtitles for long movies / podcasts with OpenAI Whisper API.

audio-transcription speech-to-text subtitles translation whisper

Last synced: 08 Nov 2024

https://github.com/codingforentrepreneurs/Smarter-Web-Scraping-with-Python

Leverage modern open-source tools to create better web scraping workflows.

apple-itunes-search-api brightdata gpt gpt3 hacker-news itunes-podcast-api llama2 llm ollama open-source openai podcast proxy-scraper python3 selenium whisper

Last synced: 15 Oct 2024

https://github.com/ireddragonicy/vixevia

An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.

ai anime api artificial-intelligence chatbot collaborate gemini-api gemini-chatbot gemini-pro gemini-pro-vision gemini-vision-pro girl google javascript python vits vtuber waifu whisper youtuber

Last synced: 10 Nov 2024

https://github.com/tensorchord/inference-benchmark

Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)

benchmark inference-server llm stable-diffusion whisper

Last synced: 12 Nov 2024

https://github.com/litongjava/whisper-cpp-server

whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++

asr inference opneai speech-recognition speech-to-text transformer whisper whisper-cpp whisper-cpp-server whisper-server

Last synced: 27 Nov 2024

https://github.com/lrq3000/futo-voiceinput-whisper

Mirror of FUTO's Voice Input, an Android Voice Keyboard for Speech-To-Text transcribing using Whisper, supporting large multilanguage models and with automatic language detection

android speech-to-text whisper

Last synced: 09 Nov 2024

https://github.com/rufuszhu/WhisperSRT

Generate subtitle for video using whisper and translate to other language using DeepL

openai python srt translation whisper

Last synced: 24 Oct 2024

https://github.com/0x9ef/openai-go

OpenAI GPT-3/3.5/4 API client written in Go

api babbage client codex curie davinci go golang gpt-3 gpt-4 openai whisper

Last synced: 24 Nov 2024

https://github.com/bzed/whisper-to-graphite

Read and send metrics from whisper files to graphite - Used to migrate to different graphite backends

golang graphite graphite-backends metrics migration whisper whisper-files

Last synced: 24 Oct 2024

https://github.com/IRedDragonICY/vixevia

An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.

ai anime api artificial-intelligence chatbot collaborate gemini-api gemini-chatbot gemini-pro gemini-pro-vision gemini-vision-pro girl google javascript python vits vtuber waifu whisper youtuber

Last synced: 24 Oct 2024

https://github.com/aviaryan/voice-writing-electron

A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hood

electron groq groq-api svelte whisper whisper-cpp

Last synced: 10 Oct 2024

https://github.com/botbahlul/whisper_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

auto-caption auto-subtitle caption faster-whisper ffmpeg google-translate-api openai openai-whisper python speech-recognition speechrecognition subtitle voice-recognition voicerecognition whisper

Last synced: 09 Oct 2024

https://github.com/impavloh/whittsper-the-lora

Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.

ai google-colab gtts llama whisper

Last synced: 22 Dec 2024

https://github.com/markgoodhead/dictate-wizard

Dictate Wizard is an open source dictation tool powered by OpenAI's Whisper. The goal is to obsolete as much typing as possible and let you speak your emails, instant messages etc instead.

conjecture faster-whisper openai soniox whisper

Last synced: 24 Oct 2024