Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-01-11 00:26:03 UTC
- JSON Representation
https://github.com/philogicae/docker-faster-whisper-fr-api
Docker - Faster Whisper FR - RunPod Serverless API
ctranslate2 docker faster-whisper french runpod serverless whisper
Last synced: 08 Jan 2025
https://github.com/ekito-station/whisper-api-unity
UnityでOpenAI Whisper APIを使って文字起こしを行ったサンプル
Last synced: 20 Dec 2024
https://github.com/patryk-ku/sasayaki
A small CLI tool that simplifies and automates the process of installing and using AI models to transcribe and translate videos.
automation cli faster-whisper gemini-api transcription translation whisper whisper-cpp
Last synced: 05 Jan 2025
https://github.com/hydrol0x/retriever
A new aid for the visually impaired powered by AI
elevenlabs llm palm visual-impairment-aid whisper
Last synced: 14 Nov 2024
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-
This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.
artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model
Last synced: 19 Dec 2024
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/levysantiago/upload-ai
Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.
ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod
Last synced: 13 Nov 2024
https://github.com/tristan-mcinnis/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 16 Nov 2024
https://github.com/mai-reborn/mai-offline-transcriber
Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.
asr audio-transcription offline-transcriber pyqt6 python speech-recognition video-transcription whisper
Last synced: 05 Jan 2025
https://github.com/akhkim/babel
Real-time Internal Audio Translate and Transcriber that uses Whisper model
ai internal-audio real-time transcription translation whisper
Last synced: 19 Dec 2024
https://github.com/fkiller/whispertranscript
Transcribe voice from mic input using OpenAI Whisper API.
llm openai transcribe transcript transcription webaudio whisper
Last synced: 06 Jan 2025
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 27 Dec 2024
https://github.com/cp3249/athena_project
Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.
Last synced: 03 Dec 2024
https://github.com/ashot72/answering-questions-about-images
You can upload images, ask questions about images using voice prompts, then listen to the responses in voice
answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper
Last synced: 30 Dec 2024
https://github.com/goktugcy/noteai
An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.
adonisjs whisper whisper-ai whisper-api
Last synced: 15 Nov 2024
https://github.com/chinese-soup/cbot-telegram-whisper
Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.
bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp
Last synced: 16 Nov 2024
https://github.com/LarissaGuder/whisper-datastream
Transcription and NER in streaming environment
bert-ner python spark-streaming whisper
Last synced: 24 Oct 2024
https://github.com/youknow2509/real-time-speech-to-text
Speech To Text in Real-Time
blackhole speech-recognition speech-to-text whisper whisper-api
Last synced: 19 Dec 2024
https://github.com/werserk/techstormhack-1st-place
Решение соревнования ТехШторм от корпорации ТатНефть по анализу активности членов команды на ВКС
pyannote speaker-diarization speech-recognition streamlit whisper
Last synced: 11 Jan 2025
https://github.com/meain/raus
Record audio until silence (RAUS)
audio hammerspoon transcription whisper whisper-cpp
Last synced: 17 Nov 2024
https://github.com/yuxiang32/Audio-Transcription
Audio transcriber using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/escarrie/transcriptaudio
This is a script that can be used to transcript audio file into text file using Whisper AI
Last synced: 17 Nov 2024
https://github.com/heyfoz/python-openai-whisper
This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.
ai api audio-transcription openai python speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/educa-ch/educa24-speech-to-summary
Demonstrator for an open-source speech-to-summary workflow
langchain ollama open-source open-weight speech-to-text summarization whisper
Last synced: 11 Oct 2024
https://github.com/neiltron/autocap
ALL CAPS
closedcaptions ml subtitles transcription whisper
Last synced: 19 Dec 2024
https://github.com/sudiptab2100/waku-user-chat
Waku Chat using Usernames
communication-protocol decentralised-application decentralized ethereum ipfs libp2p waku waku-connect web3 whisper zk-snarks zkp
Last synced: 20 Dec 2024
https://github.com/bilalhameed248/whisper-fine-tuning-for-pronunciation-learning
Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning
deep-learning deep-neural-networks dnn fine-tuning openai pronunciation python seq2seq speech speech-recognition speech-synthesis speech-to-text whisper whisper-ai
Last synced: 15 Nov 2024
https://github.com/geo-y20/enhanced-learning-experience
IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.
chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding
Last synced: 19 Dec 2024
https://github.com/televisionninja/chat
Chat with an AI Vtuber
ai chatbot llama llm tts vtube-studio vtuber whisper
Last synced: 20 Nov 2024
https://github.com/sixiaolong1117/whisperpythonscript
一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。
pysrt python python3 whisper whisper-ai
Last synced: 15 Nov 2024
https://github.com/waikato-llm/whisper
Docker images for the whisper audio transcription library and variants.
Last synced: 13 Nov 2024
https://github.com/luluw8071/whisper-tune
Finetuning Whisper on your own voice
Last synced: 14 Dec 2024
https://github.com/kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization
ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo
Last synced: 14 Dec 2024
https://github.com/soenneker/soenneker.runners.whisper.ctranslate
Automatically updates the Soenneker.Whisper.CTranslate package
ai csharp ctranslate ctranslate2 dotnet faster library runner runners whisper whisperctranslate
Last synced: 28 Dec 2024
https://github.com/sivakumar-mahalingam/subtitle-generator
🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python
ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper
Last synced: 09 Oct 2024
https://github.com/aidayang/faster-whisper-oneclick
Faster-whisper一键启动整合包带GUI界面
deep-learning faster-whisper inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 09 Jan 2025
https://github.com/aixerum/faster-whisper
faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.
ctranslate2 gpu transcription whisper
Last synced: 07 Jan 2025
https://github.com/dheison0/subcreator
A subtitle creator, translator and embeder tool made using AI
ai machine-learning ml python subtitles video-processing whisper
Last synced: 09 Oct 2024
https://github.com/homelab-00/longformstt
A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.
faster-whisper python speech-to-text whisper
Last synced: 09 Jan 2025
https://github.com/ts-azure-services/batch-transcription-examples
A repo to archive some code related to batch transcription for animation movies.
batch-transcription speech-to-text whisper
Last synced: 30 Nov 2024
https://github.com/deshwalmahesh/interview-help-cheat-live
As the name suggests, it helps you cheat in your live interviews or video calls. It transcribes your audio and provides answers to your query in real time. Supports equation rendering, custom prompts, text selection and editing. It's basically chatGPT for cheating in interviews
audio-transcription chatgpt fastapi huggingface interview interviews live openai pyaudio realtime transcription transformers whisper whisper-large
Last synced: 31 Dec 2024
https://github.com/man2dev/whisper-cpp
dev fork of https://src.fedoraproject.org/rpms/whisper-cpp
fedora fedora-repository linux whisper whisper-cpp whispercpp
Last synced: 09 Oct 2024
https://github.com/orhancavus/transcribe_video
Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper
insanely-fast speach-to-text whisper
Last synced: 09 Jan 2025
https://github.com/yousofss/speechtotext
Speech-to-Text using OpenAI's Whisper model
audio-to-text openai openai-whisper speech-to-text transcription whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/a-iceberg/whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops monitoring mssqlserver openai prompt-engineering python resource-management timestamps uvicorn-gunicorn whisper
Last synced: 17 Nov 2024
https://github.com/datvm/openaiwhisperclient
A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.
client-side openai subtitle timestamp transcript transcription whisper whisper-ai
Last synced: 15 Dec 2024
https://github.com/nexuslux/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 09 Oct 2024
https://github.com/lifeosm/whisper
🐳 Docker image with OpenAI Whisper.
docker octolab speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/Franky1/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python
openai python streamlit transcription whisper
Last synced: 24 Oct 2024
https://github.com/evil0ctal/whisper-speech-to-text-api
An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.
ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api
Last synced: 25 Oct 2024
https://github.com/ubos-tech/node-red-contrib-speech-to-text-ubos
Learn how to turn audio into text.
ai low-code lowcode node-red node-red-contrib node-red-flow openai openai-api openai-whisper speech-to-text whisper whisper-ai whisper-api
Last synced: 19 Nov 2024
https://github.com/deepbiolab/customer-complaint-classification
An GenAI-powered pipeline leveraging Whisper, DALL-E, and GPT to transform customer complaints into actionable insights with automated transcription, visualization, and classification.
Last synced: 23 Nov 2024
https://github.com/doctorpok42/pheere
Pheere is a simple virtual assistant
ai chatgpt elevenlabs ts virtual-assistant whisper
Last synced: 10 Jan 2025
https://github.com/ivanrj7j/transcription
This project transcribes audio using whisper and provides an api
ai api flask transcription whisper
Last synced: 09 Oct 2024
https://github.com/saamerm/whisperkit-ios15
iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon
ios ios15 swiftui whisper whisper-ai
Last synced: 26 Sep 2024
https://github.com/miosipof/asr_train
Fine-tuning OpenAI Whisper for ASR tasks on low-size datasets
asr machine-learning nlp whisper
Last synced: 07 Jan 2025
https://github.com/darienmt/radio-listener
Speech Recognition applied to transcribe amateur radio traffic experiments
python3 radio-amateurs speach-to-text speech-recognition whisper
Last synced: 21 Nov 2024
https://github.com/mottla/speech-to-text
Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.
huggingface speech-recognition stt teams transcription translation whisper zoom
Last synced: 21 Nov 2024
https://github.com/xi-rick/captains-log
Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.
audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper
Last synced: 21 Nov 2024
https://github.com/mdbecker/whisper_cpp_macos_utils
Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.
audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/jalvarezz13/summarai
SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.
llama-index llama2 pymovie whisper
Last synced: 22 Dec 2024
https://github.com/madh93/whisper
🎙️ My Whisper stuff
docker openai speech-recognition speech-to-text whisper whisper-cpp
Last synced: 01 Dec 2024
https://github.com/kolger/forty-two-transcribe
A Telegram bot that transcribes videos and audio messages to text via OpenAI Whisper API
openai self-hosted telegram whisper
Last synced: 25 Nov 2024
https://github.com/tobybenjaminclark/intermew
👨💻 Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.
computer-vision openai-api whisper
Last synced: 25 Nov 2024
https://github.com/teemow/mnote
Generates meeting notes and summaries from video recordings
ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper
Last synced: 07 Dec 2024