Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-01-11 00:26:03 UTC
- JSON Representation
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/chloelavrat/speech-to-text-app
Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.
audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/waikato-llm/whisper
Docker images for the whisper audio transcription library and variants.
Last synced: 13 Nov 2024
https://github.com/notyusheng/transcribe-translate_kubernetes
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 22 Nov 2024
https://github.com/ivanrj7j/transcription
This project transcribes audio using whisper and provides an api
ai api flask transcription whisper
Last synced: 09 Oct 2024
https://github.com/fatma-moanes/voice-assistant
Voice Assistant for FM-Clinic: A multilingual AI-powered voice assistant for booking doctor appointments, leveraging advanced speech-to-text, text-to-speech, and large language models for seamless, natural user interactions.
ai-assistant arabic arabic-nlp aws-polly chatbot gpt groq langchain langsmith llm mongodb multilingual openai speech-recognition speech-to-text streamlit text-to-speech transcription voice-assistant whisper
Last synced: 26 Dec 2024
https://github.com/aidayang/faster-whisper-oneclick
Faster-whisper一键启动整合包带GUI界面
deep-learning faster-whisper inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 09 Jan 2025
https://github.com/homelab-00/longformstt
A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.
faster-whisper python speech-to-text whisper
Last synced: 09 Jan 2025
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/zahidhasann88/video-summarizer
A videos by extracting audio and generating summaries based on the audio content.
nodejs openai typescript whisper
Last synced: 07 Jan 2025
https://github.com/firefly55lm/bisbigliatorev2
Automatic audio transcriber notebook based on Whisper
colab-notebook speech-to-text whisper
Last synced: 25 Nov 2024
https://github.com/danibcorr/university-helper
🧑🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.
deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper
Last synced: 19 Dec 2024
https://github.com/mariatepei/vt_thesis_mtepei
This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.
data-augmentation fastspeech2 speech-recognition whisper
Last synced: 09 Oct 2024
https://github.com/luizcalaca/transcricao-medica
Full Stack + Whisper Transcription + Node.js REST API + VITE + React.js + Railway deploy
full-stack nodejs openai openai-api railway reactjs sequelize sequelize-orm vite whisper whisper-ai
Last synced: 25 Nov 2024
https://github.com/orhancavus/transcribe_video
Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper
insanely-fast speach-to-text whisper
Last synced: 09 Jan 2025
https://github.com/yankeexe/tiktok-summarizer
Ask questions to a Tiktok video
ai function-calling llm llm-tool-call mini-app ollama pytorch seq2seq streamlit tiktok tool-calling transformers whisper
Last synced: 02 Jan 2025
https://github.com/rudrodip/kittyscribe
microservice for transcribing audio/video files to text and transcoding video
Last synced: 01 Dec 2024
https://github.com/brunogaliati/speech2text-investments
This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.
chatgpt investment openai politics python speech-recognition speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/vlazic/json-verbose-to-vtt-converter
Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.
converter groq json json-verbose openai vtt webvtt whisper
Last synced: 26 Nov 2024
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/studiowebux/tommygotchi
whisper, piper, llama-gpt, python, fun .. so much fun !
llama-gpt piper python3 whisper whisper-ai
Last synced: 05 Jan 2025
https://github.com/nazago/meeting-minutes-generator
Script which takes a .wav audio file, performs speech-to-text using OpenAI/Whisper, and then, using Llama3, summarization and action point from the transcript generated
langchain-python llm-inference local-inference meeting-minutes ollama speech-to-text summarization whisper
Last synced: 02 Jan 2025
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper
Whisper Automatic Speech Recognition (ASR) Model
openai openai-api transcription webapp whisper
Last synced: 22 Dec 2024
https://github.com/microsoft/azure-ai-foundry-whatsapp-bot
WhatsApp Bot built with Azure Functions and Azure AI Foundry, using Python.
azure-ai-foundry azure-functions azure-openai python whatsapp-api whatsapp-bot whisper
Last synced: 27 Nov 2024
https://github.com/aitor-alvarez/whisper-lightning-finetuning
Whisper fine-tuning using Lightning
acoustic-features acoustic-model speech-recognition torch-lightning whisper
Last synced: 02 Jan 2025
https://gitlab.com/ifrz/asr-multi-lite
Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition
Last synced: 24 Oct 2024
https://github.com/nelzomal/videolens_ai
VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience
ai chrome-ai gemini-nano transformers whisper wxt
Last synced: 02 Dec 2024
https://github.com/barrylee111/voicechat-llm
A chatbot with both prompt and voicechat capabilities leveraging LangChain, Elasticsearch, and FastAPI. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.
elasticsearch fastapi langchain largelanguagemodel python react speech-to-text tailwind text-to-speech typescript websocket whisper
Last synced: 19 Dec 2024
https://github.com/asai95/speech-recognition-api
Simple but extensible API for Speech Recognition.
Last synced: 02 Jan 2025
https://github.com/ty-martz/audiologic
Python Module to process and predict on music attributes
machine-learning music python whisper
Last synced: 24 Oct 2024
https://github.com/jalvarezz13/summarai
SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.
llama-index llama2 pymovie whisper
Last synced: 22 Dec 2024
https://github.com/pawelzeja098/whisper-video-transcription
Testing whisper Open-AI to transcribe videos
mp4 transcription whisper whisper-ai
Last synced: 28 Nov 2024
https://github.com/pkarpovich/kira-client
An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices
ai-assistant intent-classification porcupine trigger-word-detection whisper
Last synced: 14 Nov 2024
https://github.com/man2dev/whisper-cpp
dev fork of https://src.fedoraproject.org/rpms/whisper-cpp
fedora fedora-repository linux whisper whisper-cpp whispercpp
Last synced: 09 Oct 2024
https://github.com/concaption/containerized-transcription-api
Containerized Transcription API using Whisper Model and FastAPI
docker fastapi openai transcription whisper
Last synced: 16 Dec 2024
https://github.com/sudiptab2100/waku-user-chat
Waku Chat using Usernames
communication-protocol decentralised-application decentralized ethereum ipfs libp2p waku waku-connect web3 whisper zk-snarks zkp
Last synced: 20 Dec 2024
https://github.com/philogicae/docker-faster-whisper-fr-api
Docker - Faster Whisper FR - RunPod Serverless API
ctranslate2 docker faster-whisper french runpod serverless whisper
Last synced: 08 Jan 2025
https://github.com/fkiller/whispertranscript
Transcribe voice from mic input using OpenAI Whisper API.
llm openai transcribe transcript transcription webaudio whisper
Last synced: 06 Jan 2025
https://github.com/bluebirdback/groq-subtitles
Batch video subtitle generation using Groq Whisper API
groq speech-to-text subtitles video whisper
Last synced: 21 Dec 2024
https://github.com/seanvelasco/ai
Cloudflare AI challenge submission: Slater - your virtual foreign language friend
ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper
Last synced: 09 Dec 2024
https://github.com/userpjm/whisper-youtube
Generate a SubRip subtitle file (srt) using Whisper for the audio of a YouTube video.
faster-whisper openai speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/velocitatem/dontlectureme
A program that pays attention to your lectures for you.
ai lectures university whisper
Last synced: 03 Dec 2024
https://github.com/brucewind/localwhisperapiservice
openai-whisper transcribe whisper
Last synced: 19 Nov 2024
https://github.com/ajxv/rtstt
Real time speech to text transcription using OpenAi whisper
live-transcription openai openai-whisper python3 transcription whisper
Last synced: 22 Dec 2024
https://github.com/xawos/owt
🦙🗣️ Ollama and Whisper Telegram bot, with advanced configuration
ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper
Last synced: 08 Jan 2025
https://github.com/aixerum/faster-whisper
faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.
ctranslate2 gpu transcription whisper
Last synced: 07 Jan 2025
https://github.com/LarissaGuder/whisper-datastream
Transcription and NER in streaming environment
bert-ner python spark-streaming whisper
Last synced: 24 Oct 2024
https://github.com/zuplyx/subtitle-creator
Add english subtitles to videos using openai/whisper-large-v3
open-ai poetry-python python3 subtitles-generator whisper
Last synced: 09 Dec 2024
https://github.com/obay-ismaeel/post-generator
An API that generates social media posts by implementing RAG with Llama-3
ai api fastapi llama llm python retrieval-augmented-generation social-media whisper
Last synced: 12 Oct 2024
https://github.com/crucials/twaddle
speech analysis app that collects statistics like words frequencies and transcribed text
ai audio python python-eel speech-to-text vue whisper
Last synced: 24 Oct 2024
https://github.com/khushijtrivedi/speech
The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.
ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper
Last synced: 09 Oct 2024
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 27 Dec 2024
https://github.com/lazauk/aoai-entraidauth-sdkv1
Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x
ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper
Last synced: 13 Nov 2024
https://github.com/flo-bit/youtube-speaker-separation
simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate
speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube
Last synced: 19 Dec 2024
https://github.com/yuxiang32/Audio-Transcription
Audio transcriber using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/sugarcane-mk/whisper
This repository provides a Python script for extracting speech embeddings using OpenAI's Whisper model. The embeddings are high-dimensional feature vectors that capture the acoustic properties of the input audio. These embeddings can be used for downstream tasks such as speech classification, clustering, and speaker recognition.
asr classification feature-extraction openai speech-processing speech-recognition speech-to-text svm-classifier whisper
Last synced: 02 Jan 2025
https://github.com/thealphamerc/audio-to-text
Transcribe multi-lingual audio clips using whisper model
Last synced: 16 Dec 2024
https://github.com/cnseniorious000/dl-a2t
download, audio-to-text PyPI: https://pypi.org/p/dl-a2t
audio transcription whisper youtube
Last synced: 02 Jan 2025
https://github.com/leafyeexyz/counselorleaf
一个随时陪伴你的 AI 心理咨询师
cloudflare-api cloudflare-pages cloudflare-workers counselling counselor javascript psychology qwen react reactjs whisper
Last synced: 11 Dec 2024
https://github.com/felipecastrosales/scripts
List of useful scripts.
audio helper-functions helpers ia pip python python3 script scripts video whisper whisper-ai
Last synced: 22 Dec 2024
https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue
Lors de ce hackathon, nous avons développé la solution Smart VT, une application web basée sur l'IA conçue pour sous-titrer et doubler n'importe quelle vidéo d'une langue à une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidéos, et Node.js pour la gestion des sous-titres vidéo.
api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper
Last synced: 12 Jan 2025
https://github.com/arkaniightt/web_app_transcriptor_openai
Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.
ai app openai python streamlit tool tools transcript transcription webapp whisper
Last synced: 12 Dec 2024
https://github.com/yjg30737/pyqt-simple-whisper-gui
Whisper text-to-speech, speech-to-text example in PyQt5 GUI
openai pyqt pyqt-ai pyqt5 pyqt5-desktop-application pyqt5-examples pyqt5-gui whisper
Last synced: 03 Jan 2025
https://github.com/electroneum/electroneum-web3.js
Electroneum SmartChain JavaScript API
api electroneum ethereum etn-sc javascript swarm typescript whisper
Last synced: 26 Sep 2024
https://github.com/evilfreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 17 Dec 2024
https://github.com/hydrol0x/retriever
A new aid for the visually impaired powered by AI
elevenlabs llm palm visual-impairment-aid whisper
Last synced: 14 Nov 2024
https://github.com/lifeosm/whisper
🐳 Docker image with OpenAI Whisper.
docker octolab speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/s-emanuilov/whispercpp_kit
A wrapper on whisper.cpp with additional helper features like model management capabilities.
Last synced: 13 Dec 2024
https://github.com/Franky1/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python
openai python streamlit transcription whisper
Last synced: 24 Oct 2024
https://github.com/zdwolfe/transcription-tools
Docker video transcriber, wrapper around OpenAI
openai transcription whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/willdphan/little-jarvis-whisper
Jarvis, a GPT Voice Assistant made with speech recognition, OpenAI's Whisper, and Gradio
gradio openai voice-assistant voice-recognition whisper
Last synced: 24 Oct 2024
https://github.com/mrbuslov/reminder_4u_bot
AI Telegram Bot Reminder. You send a free-form text OR voice reminder, the AI bot records it and reminds you at the right time!
ai ai-bot aiogram chatgpt django gpt-3 gpt-4 gpt-models python reminder telegram-bot voice-recognition whisper
Last synced: 10 Jan 2025
https://github.com/javi-cc/python-openai-generator-srt
Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.
argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper
Last synced: 18 Dec 2024
https://github.com/hanpham32/react-native-whisper
A simple text transcription web/mobile app
flask ngrok react-native transcribe whisper
Last synced: 24 Dec 2024
https://github.com/tylim88/voicefu-back-end
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/luluw8071/whisper-tune
Finetuning Whisper on your own voice
Last synced: 14 Dec 2024
https://github.com/MattCode64/Scriba_Front
SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.
speech-to-text vite vue vuejs whisper
Last synced: 24 Oct 2024
https://github.com/saamerm/whisperkit-ios15
iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon
ios ios15 swiftui whisper whisper-ai
Last synced: 26 Sep 2024
https://github.com/kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization
ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo
Last synced: 14 Dec 2024
https://github.com/malexandersalazar/casey
Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI
agentic-ai content-creation groq large-language-models python wellbeing whisper
Last synced: 18 Dec 2024
https://github.com/jplhughes/whisper_logit_lens
This Alignment Jam Hackathon project explores whether the concept of "logit lens" applies to the encoder and decoder layers in Whisper, an end-to-end speech recognition model.
alignment-jam asr interpretability interpretability-jam logitlens whisper
Last synced: 24 Oct 2024
https://github.com/kitschpatrol/ambient-novel
An interface for nonlinear interactive exploration of a novel.
ambient book fiction interactive novel svelte whisper
Last synced: 19 Nov 2024
https://github.com/same-ou/whisper-speech-recognition
This repository contains a deployment of the Whisper speech recognition model using Flask and Python. Whisper is a cutting-edge speech recognition model designed to accurately transcribe speech input into text.
deep-learning flask machine-learning openai python pytorch whisper
Last synced: 01 Jan 2025
https://github.com/valkryst/whisper_automations
Various scripts for automating tasks using OpenAI's Whisper.
automation openai subtitle subtitle-generator transcription translation whisper
Last synced: 26 Dec 2024
https://github.com/whisper-666/TikTok-Login
TikTok Login With No Captcha No Proxy (unlimited requests)
api combo combo-checker proxyless tiktok tiktok-api tiktok-followers tiktok-followers-generator tiktok-followers-software tiktok-login tiktok-views whisper
Last synced: 24 Oct 2024
https://github.com/EvilFreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 24 Oct 2024
https://github.com/lukasbach/whisper-cpp-static
Static build of whisper.cpp by ggerganov
ai asr audio ml model recognition speech whisper
Last synced: 22 Nov 2024