Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-01-27 00:29:05 UTC
- JSON Representation
https://github.com/valkryst/whisper_automations
Various scripts for automating tasks using OpenAI's Whisper.
automation openai subtitle subtitle-generator transcription translation whisper
Last synced: 26 Dec 2024
https://github.com/zahidhasann88/video-summarizer
A videos by extracting audio and generating summaries based on the audio content.
nodejs openai typescript whisper
Last synced: 07 Jan 2025
https://github.com/hsiehbocheng/yt-gen-caption
This is a Porject for generating captions for YouTube videos using Faster Whisper & yt_dlp.
Last synced: 19 Dec 2024
https://github.com/cnseniorious000/dl-a2t
download, audio-to-text PyPI: https://pypi.org/p/dl-a2t
audio transcription whisper youtube
Last synced: 02 Jan 2025
https://github.com/cp3249/athena_project
Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.
Last synced: 15 Jan 2025
https://github.com/ubos-tech/node-red-contrib-speech-to-text-ubos
Learn how to turn audio into text.
ai low-code lowcode node-red node-red-contrib node-red-flow openai openai-api openai-whisper speech-to-text whisper whisper-ai whisper-api
Last synced: 20 Jan 2025
https://github.com/mottla/speech-to-text
Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.
huggingface speech-recognition stt teams transcription translation whisper zoom
Last synced: 21 Jan 2025
https://github.com/xi-rick/captains-log
Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.
audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper
Last synced: 21 Jan 2025
https://github.com/arslanex/whisperdemo
A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.
audio-processing openai openai-whisper python whisper
Last synced: 23 Nov 2024
https://github.com/philogicae/docker-faster-whisper-fr-api
Docker - Faster Whisper FR - RunPod Serverless API
ctranslate2 docker faster-whisper french runpod serverless whisper
Last synced: 08 Jan 2025
https://github.com/dheison0/subcreator
A subtitle creator, translator and embeder tool made using AI
ai machine-learning ml python subtitles video-processing whisper
Last synced: 09 Oct 2024
https://github.com/bilalhameed248/whisper-fine-tuning-for-pronunciation-learning
Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning
deep-learning deep-neural-networks dnn fine-tuning openai pronunciation python seq2seq speech speech-recognition speech-synthesis speech-to-text whisper whisper-ai
Last synced: 16 Jan 2025
https://github.com/ivanrj7j/transcription
This project transcribes audio using whisper and provides an api
ai api flask transcription whisper
Last synced: 09 Oct 2024
https://github.com/njorogemaurice/speech-recognition-openai-whisper
This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.
openai speech-recognition speech-to-text whisper
Last synced: 14 Jan 2025
https://github.com/a-iceberg/whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops monitoring mssqlserver openai prompt-engineering python resource-management timestamps uvicorn-gunicorn whisper
Last synced: 18 Jan 2025
https://github.com/orhancavus/transcribe_video
Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper
insanely-fast speach-to-text whisper
Last synced: 09 Jan 2025
https://github.com/homelab-00/longformstt
A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.
faster-whisper python speech-to-text whisper
Last synced: 09 Jan 2025
https://github.com/aidayang/faster-whisper-oneclick
Faster-whisper一键启动整合包带GUI界面
deep-learning faster-whisper inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 09 Jan 2025
https://github.com/mickekring/top-of-mind-beromfabriken
Att ge beröm till en kollega kan kännas lite pinsamt, men forskning har visat att det kan få oss att må bättre på jobbet och att vi till och med blir mer produktiva. Att få höra att kollegor värdesätter och uppmärksammar en ökar ens välmående helt enkelt.
api gpt openai python transcription whisper
Last synced: 16 Jan 2025
https://github.com/jalvarezz13/summarai
SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.
llama-index llama2 pymovie whisper
Last synced: 22 Dec 2024
https://github.com/levysantiago/upload-ai
Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.
ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod
Last synced: 12 Jan 2025
https://github.com/tristan-mcinnis/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 17 Jan 2025
https://github.com/brucewind/localwhisperapiservice
openai-whisper transcribe whisper
Last synced: 20 Jan 2025
https://github.com/ajxv/rtstt
Real time speech to text transcription using OpenAi whisper
live-transcription openai openai-whisper python3 transcription whisper
Last synced: 22 Dec 2024
https://github.com/obay-ismaeel/post-generator
An API that generates social media posts by implementing RAG with Llama-3
ai api fastapi llama llm python retrieval-augmented-generation social-media whisper
Last synced: 12 Oct 2024
https://github.com/meain/raus
Record audio until silence (RAUS)
audio hammerspoon transcription whisper whisper-cpp
Last synced: 17 Jan 2025
https://github.com/escarrie/transcriptaudio
This is a script that can be used to transcript audio file into text file using Whisper AI
Last synced: 17 Jan 2025
https://github.com/crucials/twaddle
speech analysis app that collects statistics like words frequencies and transcribed text
ai audio python python-eel speech-to-text vue whisper
Last synced: 24 Oct 2024
https://github.com/televisionninja/chat
Chat with an AI Vtuber
ai chatbot llama llm tts vtube-studio vtuber whisper
Last synced: 20 Nov 2024
https://github.com/sixiaolong1117/whisperpythonscript
一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。
pysrt python python3 whisper whisper-ai
Last synced: 16 Jan 2025
https://github.com/notyusheng/transcribe-translate_kubernetes
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 23 Jan 2025
https://github.com/ts-azure-services/batch-transcription-examples
A repo to archive some code related to batch transcription for animation movies.
batch-transcription speech-to-text whisper
Last synced: 30 Nov 2024
https://github.com/datvm/openaiwhisperclient
A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.
client-side openai subtitle timestamp transcript transcription whisper whisper-ai
Last synced: 15 Dec 2024
https://github.com/mdbecker/whisper_cpp_macos_utils
Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.
audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/madh93/whisper
🎙️ My Whisper stuff
docker openai speech-recognition speech-to-text whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/tobybenjaminclark/intermew
👨💻 Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.
computer-vision openai-api whisper
Last synced: 25 Jan 2025
https://github.com/teemow/mnote
Generates meeting notes and summaries from video recordings
ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper
Last synced: 07 Dec 2024
https://github.com/heng30/vtbox
It is an offline voice to text tool. Using whisper model to transcribe.
rust slint-ui voice2text whisper
Last synced: 21 Nov 2024
https://github.com/lukasbach/whisper-cpp-static
Static build of whisper.cpp by ggerganov
ai asr audio ml model recognition speech whisper
Last synced: 23 Jan 2025
https://github.com/iamarunbrahma/smart-voice-assistant
A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.
Last synced: 07 Dec 2024
https://github.com/rudrodip/kittyscribe
microservice for transcribing audio/video files to text and transcoding video
Last synced: 01 Dec 2024
https://github.com/whisper-666/TikTok-Login
TikTok Login With No Captcha No Proxy (unlimited requests)
api combo combo-checker proxyless tiktok tiktok-api tiktok-followers tiktok-followers-generator tiktok-followers-software tiktok-login tiktok-views whisper
Last synced: 24 Oct 2024
https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper
Whisper Automatic Speech Recognition (ASR) Model
openai openai-api transcription webapp whisper
Last synced: 22 Dec 2024
https://github.com/nelzomal/videolens_ai
VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience
ai chrome-ai gemini-nano transformers whisper wxt
Last synced: 02 Dec 2024
https://github.com/wa-lead/audio2md
Summarizes audio using openai Whisper-1 model and GPT-Turbo3.5
audio-processing gpt-3 openai python whisper
Last synced: 26 Jan 2025
https://github.com/concaption/containerized-transcription-api
Containerized Transcription API using Whisper Model and FastAPI
docker fastapi openai transcription whisper
Last synced: 16 Dec 2024
https://github.com/seanvelasco/ai
Cloudflare AI challenge submission: Slater - your virtual foreign language friend
ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper
Last synced: 09 Dec 2024
https://github.com/velocitatem/dontlectureme
A program that pays attention to your lectures for you.
ai lectures university whisper
Last synced: 03 Dec 2024
https://github.com/zuplyx/subtitle-creator
Add english subtitles to videos using openai/whisper-large-v3
open-ai poetry-python python3 subtitles-generator whisper
Last synced: 09 Dec 2024
https://github.com/paszkoo/real_time_whisper_iot
Real time voice transcription from default audio input using faster-whisper
ai iot-application iot-device smart-home voice-assistant voice-recognition whisper
Last synced: 17 Jan 2025
https://github.com/mario-huang/whisper-desktop
A desktop app for easy subtitle using whisper model.
ai desktop gradio open-source python pytorch tauri web-ui whisper
Last synced: 17 Jan 2025
https://github.com/RingoMar/whisper-devcontainer
Openai whisper inside of vscode docker devcontainer using example files
ai devcontainer docker openapi python whisper
Last synced: 24 Oct 2024
https://github.com/thealphamerc/audio-to-text
Transcribe multi-lingual audio clips using whisper model
Last synced: 16 Dec 2024
https://github.com/leafyeexyz/counselorleaf
一个随时陪伴你的 AI 心理咨询师
cloudflare-api cloudflare-pages cloudflare-workers counselling counselor javascript psychology qwen react reactjs whisper
Last synced: 11 Dec 2024
https://github.com/MattCode64/Scriba
SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.
fastapi python speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/arkaniightt/web_app_transcriptor_openai
Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.
ai app openai python streamlit tool tools transcript transcription webapp whisper
Last synced: 12 Dec 2024
https://github.com/evilfreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 17 Dec 2024
https://github.com/s-emanuilov/whispercpp_kit
A wrapper on whisper.cpp with additional helper features like model management capabilities.
Last synced: 13 Dec 2024
https://github.com/javi-cc/python-openai-generator-srt
Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.
argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper
Last synced: 18 Dec 2024
https://github.com/hanpham32/react-native-whisper
A simple text transcription web/mobile app
flask ngrok react-native transcribe whisper
Last synced: 24 Dec 2024
https://github.com/tylim88/voicefu-back-end
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/malexandersalazar/casey
Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI
agentic-ai content-creation groq large-language-models python wellbeing whisper
Last synced: 18 Dec 2024
https://github.com/same-ou/whisper-speech-recognition
This repository contains a deployment of the Whisper speech recognition model using Flask and Python. Whisper is a cutting-edge speech recognition model designed to accurately transcribe speech input into text.
deep-learning flask machine-learning openai python pytorch whisper
Last synced: 01 Jan 2025
https://github.com/ty-martz/audiologic
Python Module to process and predict on music attributes
machine-learning music python whisper
Last synced: 24 Oct 2024
https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-
This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.
artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model
Last synced: 19 Dec 2024
https://github.com/akhkim/babel
Real-time Internal Audio Translate and Transcriber that uses Whisper model
ai internal-audio real-time transcription translation whisper
Last synced: 19 Dec 2024
https://github.com/youknow2509/real-time-speech-to-text
Speech To Text in Real-Time
blackhole speech-recognition speech-to-text whisper whisper-api
Last synced: 19 Dec 2024
https://github.com/heyfoz/python-openai-whisper
This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.
ai api audio-transcription openai python speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/geo-y20/enhanced-learning-experience
IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.
chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding
Last synced: 19 Dec 2024
https://github.com/doctorpok42/pheere
Pheere is a simple virtual assistant
ai chatgpt elevenlabs ts virtual-assistant whisper
Last synced: 10 Jan 2025
https://github.com/LarissaGuder/whisper-datastream
Transcription and NER in streaming environment
bert-ner python spark-streaming whisper
Last synced: 24 Oct 2024
https://github.com/deepbiolab/customer-complaint-classification
An GenAI-powered pipeline leveraging Whisper, DALL-E, and GPT to transform customer complaints into actionable insights with automated transcription, visualization, and classification.
Last synced: 23 Jan 2025
https://github.com/fatma-moanes/voice-assistant
Voice Assistant for FM-Clinic: A multilingual AI-powered voice assistant for booking doctor appointments, leveraging advanced speech-to-text, text-to-speech, and large language models for seamless, natural user interactions.
ai-assistant arabic arabic-nlp aws-polly chatbot gpt groq langchain langsmith llm mongodb multilingual openai speech-recognition speech-to-text streamlit text-to-speech transcription voice-assistant whisper
Last synced: 26 Dec 2024
https://github.com/yankeexe/tiktok-summarizer
Ask questions to a Tiktok video
ai function-calling llm llm-tool-call mini-app ollama pytorch seq2seq streamlit tiktok tool-calling transformers whisper
Last synced: 02 Jan 2025
https://github.com/nazago/meeting-minutes-generator
Script which takes a .wav audio file, performs speech-to-text using OpenAI/Whisper, and then, using Llama3, summarization and action point from the transcript generated
langchain-python llm-inference local-inference meeting-minutes ollama speech-to-text summarization whisper
Last synced: 02 Jan 2025
https://github.com/aitor-alvarez/whisper-lightning-finetuning
Whisper fine-tuning using Lightning
acoustic-features acoustic-model speech-recognition torch-lightning whisper
Last synced: 02 Jan 2025
https://github.com/asai95/speech-recognition-api
Simple but extensible API for Speech Recognition.
Last synced: 02 Jan 2025
https://github.com/yuxiang32/Audio-Transcription
Audio transcriber using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/sugarcane-mk/whisper
This repository provides a Python script for extracting speech embeddings using OpenAI's Whisper model. The embeddings are high-dimensional feature vectors that capture the acoustic properties of the input audio. These embeddings can be used for downstream tasks such as speech classification, clustering, and speaker recognition.
asr classification feature-extraction openai speech-processing speech-recognition speech-to-text svm-classifier whisper
Last synced: 02 Jan 2025
https://github.com/yjg30737/pyqt-simple-whisper-gui
Whisper text-to-speech, speech-to-text example in PyQt5 GUI
openai pyqt pyqt-ai pyqt5 pyqt5-desktop-application pyqt5-examples pyqt5-gui whisper
Last synced: 03 Jan 2025
https://github.com/lifeosm/whisper
🐳 Docker image with OpenAI Whisper.
docker octolab speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/Franky1/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python
openai python streamlit transcription whisper
Last synced: 24 Oct 2024
https://github.com/mrbuslov/reminder_4u_bot
AI Telegram Bot Reminder. You send a free-form text OR voice reminder, the AI bot records it and reminds you at the right time!
ai ai-bot aiogram chatgpt django gpt-3 gpt-4 gpt-models python reminder telegram-bot voice-recognition whisper
Last synced: 10 Jan 2025
https://github.com/werserk/techstormhack-1st-place
Решение соревнования ТехШторм от корпорации ТатНефть по анализу активности членов команды на ВКС
pyannote speaker-diarization speech-recognition streamlit whisper
Last synced: 11 Jan 2025
https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue
Lors de ce hackathon, nous avons développé la solution Smart VT, une application web basée sur l'IA conçue pour sous-titrer et doubler n'importe quelle vidéo d'une langue à une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidéos, et Node.js pour la gestion des sous-titres vidéo.
api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper
Last synced: 12 Jan 2025
https://github.com/waikato-llm/whisper
Docker images for the whisper audio transcription library and variants.
Last synced: 12 Jan 2025