Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-01-10 00:26:42 UTC
- JSON Representation
https://github.com/barrylee111/voicechat-llm
A chatbot with both prompt and voicechat capabilities leveraging LangChain, Elasticsearch, and FastAPI. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.
elasticsearch fastapi langchain largelanguagemodel python react speech-to-text tailwind text-to-speech typescript websocket whisper
Last synced: 19 Dec 2024
https://github.com/brunogaliati/speech2text-investments
This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.
chatgpt investment openai politics python speech-recognition speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/danibcorr/university-helper
๐งโ๐ University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.
deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper
Last synced: 19 Dec 2024
https://github.com/flaviodelgrosso/whisper-transcriber
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
ai audio-to-text diarization openai torch whisper
Last synced: 19 Dec 2024
https://github.com/patryk-ku/sasayaki
A small CLI tool that simplifies and automates the process of installing and using AI models to transcribe and translate videos.
automation cli faster-whisper gemini-api transcription translation whisper whisper-cpp
Last synced: 05 Jan 2025
https://github.com/mai-reborn/mai-offline-transcriber
Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.
asr audio-transcription offline-transcriber pyqt6 python speech-recognition video-transcription whisper
Last synced: 05 Jan 2025
https://github.com/egorsmkv/optimized-whisper-intel
Run quantized Whisper models only on CPU with Intel hardware
intel onnx onnxruntime quantized-neural-networks whisper
Last synced: 19 Dec 2024
https://github.com/devgeekm/chat-it-up
Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.
ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl
Last synced: 20 Dec 2024
https://github.com/a-iceberg/whisper_model_evaluator
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
asr audio-to-text automatic-speech-recognition data-analysis evaluation google-speech-recognition python tuning-parameters visualization vosk whisper
Last synced: 24 Oct 2024
https://github.com/zdwolfe/transcription-tools
Docker video transcriber, wrapper around OpenAI
openai transcription whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/hsiehbocheng/yt-gen-caption
This is a Porject for generating captions for YouTube videos using Faster Whisper & yt_dlp.
Last synced: 19 Dec 2024
https://github.com/ashot72/answering-questions-about-images
You can upload images, ask questions about images using voice prompts, then listen to the responses in voice
answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper
Last synced: 30 Dec 2024
https://github.com/aixerum/faster-whisper
faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.
ctranslate2 gpu transcription whisper
Last synced: 07 Jan 2025
https://github.com/xawos/owt
๐ฆ๐ฃ๏ธ Ollama and Whisper Telegram bot, with advanced configuration
ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper
Last synced: 08 Jan 2025
https://github.com/chloelavrat/speech-to-text-app
Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.
audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/valkryst/whisper_automations
Various scripts for automating tasks using OpenAI's Whisper.
automation openai subtitle subtitle-generator transcription translation whisper
Last synced: 26 Dec 2024
https://github.com/mariatepei/vt_thesis_mtepei
This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.
data-augmentation fastspeech2 speech-recognition whisper
Last synced: 09 Oct 2024
https://github.com/felipecastrosales/scripts
List of useful scripts.
audio helper-functions helpers ia pip python python3 script scripts video whisper whisper-ai
Last synced: 22 Dec 2024
https://github.com/dheison0/subcreator
A subtitle creator, translator and embeder tool made using AI
ai machine-learning ml python subtitles video-processing whisper
Last synced: 09 Oct 2024
https://github.com/ivanrj7j/transcription
This project transcribes audio using whisper and provides an api
ai api flask transcription whisper
Last synced: 09 Oct 2024
https://github.com/zahidhasann88/video-summarizer
A videos by extracting audio and generating summaries based on the audio content.
nodejs openai typescript whisper
Last synced: 07 Jan 2025
https://github.com/cnseniorious000/dl-a2t
download, audio-to-text PyPI: https://pypi.org/p/dl-a2t
audio transcription whisper youtube
Last synced: 02 Jan 2025
https://github.com/electroneum/electroneum-web3.js
Electroneum SmartChain JavaScript API
api electroneum ethereum etn-sc javascript swarm typescript whisper
Last synced: 26 Sep 2024
https://github.com/bluebirdback/groq-subtitles
Batch video subtitle generation using Groq Whisper API
groq speech-to-text subtitles video whisper
Last synced: 21 Dec 2024
https://github.com/julrog/jokes-on-you
Storyteller
ggj2024 global-game-jam openai unity whisper
Last synced: 17 Dec 2024
https://github.com/ubos-tech/node-red-contrib-speech-to-text-ubos
Learn how to turn audio into text.
ai low-code lowcode node-red node-red-contrib node-red-flow openai openai-api openai-whisper speech-to-text whisper whisper-ai whisper-api
Last synced: 19 Nov 2024
https://github.com/theaussiepom/wyoming-openai
OpenAI SST and TTS support for the Wyoming protocol
home-assistant home-assistant-assist openai sst tts whisper wyoming
Last synced: 21 Dec 2024
https://github.com/hydrol0x/retriever
A new aid for the visually impaired powered by AI
elevenlabs llm palm visual-impairment-aid whisper
Last synced: 14 Nov 2024
https://github.com/arkapravo-ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
ai fastapi huggingface machine-learning openai python3 speech-to-text transformers whisper
Last synced: 21 Dec 2024
https://github.com/miosipof/whisper_inference
OpenAI Whisper ASR inference on CPU with OpenVino, PyTorch or Huggingface
asr inference machine-learning openvino pytorch whisper
Last synced: 07 Jan 2025
https://github.com/miosipof/asr_train
Fine-tuning OpenAI Whisper for ASR tasks on low-size datasets
asr machine-learning nlp whisper
Last synced: 07 Jan 2025
https://github.com/ekito-station/whisper-api-unity
UnityใงOpenAI Whisper APIใไฝฟใฃใฆๆๅญ่ตทใใใ่กใฃใใตใณใใซ
Last synced: 20 Dec 2024
https://github.com/arslanex/whisperdemo
A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.
audio-processing openai openai-whisper python whisper
Last synced: 23 Nov 2024
https://github.com/vifill/audio-recorder-and-summarizer
This project is a Python script that records system audio on macOS using BlackHole, transcribes the audio using OpenAI's Whisper API, and summarizes the transcription using OpenAI's GPT models
ai audio blackhole gpt openai records summarize system whisper
Last synced: 20 Dec 2024
https://github.com/studiowebux/tommygotchi
whisper, piper, llama-gpt, python, fun .. so much fun !
llama-gpt piper python3 whisper whisper-ai
Last synced: 05 Jan 2025
https://github.com/flyingfathead/youwhisper-cli
A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.
cli cli-app python transcribe transcriber transcription whisper whisper-ai whisperx youtube-downloader yt-dlp yt-dlp-wrapper
Last synced: 12 Nov 2024
https://github.com/status-im/infra-role-status-go
Ansible role for status-go
ansible-role infra waku whisper
Last synced: 05 Jan 2025
https://github.com/philogicae/docker-faster-whisper-fr-api
Docker - Faster Whisper FR - RunPod Serverless API
ctranslate2 docker faster-whisper french runpod serverless whisper
Last synced: 08 Jan 2025
https://github.com/pkarpovich/kira-client
An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices
ai-assistant intent-classification porcupine trigger-word-detection whisper
Last synced: 14 Nov 2024
https://github.com/soenneker/soenneker.libraries.whisper.ctranslate
Simply adds the Whisper_CTrantlate2 Windows executable, updated daily (if available)
ai csharp ctranslate ctranslate2 dotnet faster libraries library whisper whisperctranslate
Last synced: 29 Dec 2024
https://github.com/jalvarezz13/summarai
SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.
llama-index llama2 pymovie whisper
Last synced: 22 Dec 2024
https://github.com/njorogemaurice/speech-recognition-openai-whisper
This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.
openai speech-recognition speech-to-text whisper
Last synced: 14 Nov 2024
https://github.com/fer14/videoseek
Intelligent video search tool powered by AI
bert timestamp video whisper youtube-api
Last synced: 14 Nov 2024
https://github.com/kitschpatrol/ambient-novel
An interface for nonlinear interactive exploration of a novel.
ambient book fiction interactive novel svelte whisper
Last synced: 19 Nov 2024
https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue
Lors de ce hackathon, nous avons dรฉveloppรฉ la solution Smart VT, une application web basรฉe sur l'IA conรงue pour sous-titrer et doubler n'importe quelle vidรฉo d'une langue ร une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidรฉos, et Node.js pour la gestion des sous-titres vidรฉo.
api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper
Last synced: 12 Nov 2024
https://github.com/bilelouahmed/vocal-assistant
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts
Last synced: 21 Dec 2024
https://github.com/huuquyet/phowhisper-tiny
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/lazauk/aoai-entraidauth-sdkv1
Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x
ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper
Last synced: 13 Nov 2024
https://github.com/orhancavus/transcribe_video
Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper
insanely-fast speach-to-text whisper
Last synced: 09 Jan 2025
https://github.com/deepbiolab/customer-complaint-classification
An GenAI-powered pipeline leveraging Whisper, DALL-E, and GPT to transform customer complaints into actionable insights with automated transcription, visualization, and classification.
Last synced: 23 Nov 2024
https://github.com/homelab-00/longformstt
A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.
faster-whisper python speech-to-text whisper
Last synced: 09 Jan 2025
https://github.com/aidayang/faster-whisper-oneclick
Faster-whisperไธ้ฎๅฏๅจๆดๅๅ ๅธฆGUI็้ข
deep-learning faster-whisper inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 09 Jan 2025
https://github.com/waikato-llm/whisper
Docker images for the whisper audio transcription library and variants.
Last synced: 13 Nov 2024
https://github.com/evil0ctal/whisper-speech-to-text-api
An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.
ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api
Last synced: 25 Oct 2024
https://github.com/jplhughes/whisper_logit_lens
This Alignment Jam Hackathon project explores whether the concept of "logit lens" applies to the encoder and decoder layers in Whisper, an end-to-end speech recognition model.
alignment-jam asr interpretability interpretability-jam logitlens whisper
Last synced: 24 Oct 2024
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/levysantiago/upload-ai
Este รฉ um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar tรญtulos e descriรงรตes a partir da anรกlise de vรญdeos submetidos.
ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod
Last synced: 13 Nov 2024
https://github.com/tristan-mcinnis/simultaneous-interpretation
๏ฟผSimultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 16 Nov 2024
https://github.com/cp3249/athena_project
Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.
Last synced: 03 Dec 2024
https://github.com/goktugcy/noteai
An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.
adonisjs whisper whisper-ai whisper-api
Last synced: 15 Nov 2024
https://github.com/chinese-soup/cbot-telegram-whisper
Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.
bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp
Last synced: 16 Nov 2024
https://github.com/meain/raus
Record audio until silence (RAUS)
audio hammerspoon transcription whisper whisper-cpp
Last synced: 17 Nov 2024
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/escarrie/transcriptaudio
This is a script that can be used to transcript audio file into text file using Whisper AI
Last synced: 17 Nov 2024
https://github.com/brucewind/localwhisperapiservice
openai-whisper transcribe whisper
Last synced: 19 Nov 2024
https://github.com/bilalhameed248/whisper-fine-tuning-for-pronunciation-learning
Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning
deep-learning deep-neural-networks dnn fine-tuning openai pronunciation python seq2seq speech speech-recognition speech-synthesis speech-to-text whisper whisper-ai
Last synced: 15 Nov 2024
https://github.com/televisionninja/chat
Chat with an AI Vtuber
ai chatbot llama llm tts vtube-studio vtuber whisper
Last synced: 20 Nov 2024
https://github.com/sixiaolong1117/whisperpythonscript
ไธไธช็ฎๅ็ Whisper Python ่ๆฌ๏ผๅฏไปฅๅฐๅชไฝๆไปถ็้ณ้ข้่ฟ whisper ่ฏๅซๆๆๅญ๏ผๅนถ้่ฟ pysrt ไฟๅญไธบๅญๅนใ
pysrt python python3 whisper whisper-ai
Last synced: 15 Nov 2024
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/ajxv/rtstt
Real time speech to text transcription using OpenAi whisper
live-transcription openai openai-whisper python3 transcription whisper
Last synced: 22 Dec 2024
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/obay-ismaeel/post-generator
An API that generates social media posts by implementing RAG with Llama-3
ai api fastapi llama llm python retrieval-augmented-generation social-media whisper
Last synced: 12 Oct 2024
https://github.com/yui-mhcp/speech_to_text
Speech-To-Text (STT) project
audio-transcription deepspeech jasper speech-to-text stt stt-api tensorflow2 video-transcription whisper
Last synced: 24 Oct 2024
https://github.com/EvilFreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 24 Oct 2024
https://github.com/ts-azure-services/batch-transcription-examples
A repo to archive some code related to batch transcription for animation movies.
batch-transcription speech-to-text whisper
Last synced: 30 Nov 2024
https://github.com/deshwalmahesh/interview-help-cheat-live
As the name suggests, it helps you cheat in your live interviews or video calls. It transcribes your audio and provides answers to your query in real time. Supports equation rendering, custom prompts, text selection and editing. It's basically chatGPT for cheating in interviews
audio-transcription chatgpt fastapi huggingface interview interviews live openai pyaudio realtime transcription transformers whisper whisper-large
Last synced: 31 Dec 2024
https://github.com/crucials/twaddle
speech analysis app that collects statistics like words frequencies and transcribed text
ai audio python python-eel speech-to-text vue whisper
Last synced: 24 Oct 2024
https://github.com/a-iceberg/whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops monitoring mssqlserver openai prompt-engineering python resource-management timestamps uvicorn-gunicorn whisper
Last synced: 17 Nov 2024
https://github.com/fkiller/whispertranscript
Transcribe voice from mic input using OpenAI Whisper API.
llm openai transcribe transcript transcription webaudio whisper
Last synced: 06 Jan 2025
https://github.com/datvm/openaiwhisperclient
A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.
client-side openai subtitle timestamp transcript transcription whisper whisper-ai
Last synced: 15 Dec 2024
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 27 Dec 2024
https://github.com/MattCode64/Scriba_Front
SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.
speech-to-text vite vue vuejs whisper
Last synced: 24 Oct 2024
https://github.com/darienmt/radio-listener
Speech Recognition applied to transcribe amateur radio traffic experiments
python3 radio-amateurs speach-to-text speech-recognition whisper
Last synced: 21 Nov 2024
https://github.com/mottla/speech-to-text
Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.
huggingface speech-recognition stt teams transcription translation whisper zoom
Last synced: 21 Nov 2024
https://github.com/xi-rick/captains-log
Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.
audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper
Last synced: 21 Nov 2024
https://github.com/mdbecker/whisper_cpp_macos_utils
Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.
audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/madh93/whisper
๐๏ธ My Whisper stuff
docker openai speech-recognition speech-to-text whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/kolger/forty-two-transcribe
A Telegram bot that transcribes videos and audio messages to text via OpenAI Whisper API
openai self-hosted telegram whisper
Last synced: 25 Nov 2024
https://github.com/tobybenjaminclark/intermew
๐จโ๐ป Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.
computer-vision openai-api whisper
Last synced: 25 Nov 2024