Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-24 00:28:14 UTC
- JSON Representation
https://github.com/mooerslab/bash-whisper-transcription
Bash function to ease the transcription of audio files with OpenAI's whisper.
asr audio audio-file-trancription audio-messages automate-the-boring-stuff automatic-speech-recognition automation bash bash-function beginner-friendly speech-to-text stt whisper
Last synced: 14 Dec 2024
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 24 Oct 2024
https://github.com/huuquyet/phowhisper-next
Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js
nextjs onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 19 Dec 2024
https://github.com/aitor-alvarez/large-speech-models
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
arabic-speech-recognition asr asr-model finetuning-wav2vec finetuning-whisper large-speech-models speech-recognition-model wav2vec2 whisper
Last synced: 25 Nov 2024
https://github.com/maawad/luna
Personal assistant
bot openai personal-assistant whisper
Last synced: 17 Dec 2024
https://github.com/nicknaskida/insanely-fast-whisper
Incredibly fast Whisper-large-v3 with speaker diarization
diarization speaker-diarization transfromers whisper whisper-ai whisper-faster whisper-large
Last synced: 26 Sep 2024
https://github.com/wtlow003/auto-subtitles
CLI tool to transcribe (+ translate) videos and embed subtitles automatically.
faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp
Last synced: 15 Nov 2024
https://github.com/brentwong-kiel1997/ai_language_school_based_on_django_and_openai
Django and OpenAI API example use case
django gpt-4 openai openai-api whisper
Last synced: 09 Oct 2024
https://github.com/platput/pysubs
api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/silentsoft/whiscribe
🎬 A tool with a UI that transcribes audio files into subtitles using OpenAI's Whisper and runs completely on your local machine.
audio-transcription openai-whisper srt subtitle whisper
Last synced: 11 Nov 2024
https://github.com/maylad31/colab-codes
some useful colab files
clip colab-notebook speech-recognition whisper zero-shot-classification
Last synced: 12 Nov 2024
https://github.com/Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.
ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper
Last synced: 24 Oct 2024
https://github.com/bhattbhavesh91/openai-whisper-benchmarking
Comparing the performance of OpenAI's Whisper model on a GPU vs OpenAI's API
gpu openai speech-to-text whisper
Last synced: 16 Nov 2024
https://github.com/jojasadventure/whisper-client
Very simple Python based client for Whisper compatible endpoint
desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper
Last synced: 09 Oct 2024
https://github.com/mikeesto/whispercpp-android
An Android app using whisper.cpp to do voice-to-text transcriptions
android kotlin speech-to-text whisper whisper-cpp
Last synced: 17 Dec 2024
https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo
This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.
ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper
Last synced: 09 Oct 2024
https://github.com/voqal/browser
Natural speech browsing for the software developers of tomorrow
cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper
Last synced: 20 Oct 2024
https://github.com/chaoticbyte/audio-summarize
An audio summarizer (faster-whisper and BART glued together)
ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper
Last synced: 09 Oct 2024
https://github.com/oov/aviutl_subtitler
AviUtl+拡張編集の環境で Whisper による文字起こしをするためのプラグイン
Last synced: 19 Dec 2024
https://github.com/adisol07/sharpspeech
SharpSpeech is free, local and open source way to speech and wake word recognition.
audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/marty1885/useful-whisper-server
Whisper server based on useful-transformers for the RK3588
npu rk3588 rockchip useful-transformers whisper
Last synced: 05 Dec 2024
https://github.com/antoniosbarotsis/telegram-transcriber
A Telegram bot for transcribing voice messages
telegram transcribe voice whisper
Last synced: 31 Oct 2024
https://github.com/extrange/transcription-benchmarks
Speech to text model benchmarks
Last synced: 08 Dec 2024
https://github.com/rhysdg/whisper-onnx-python
A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph
ai chatbot cuda machine-learning onnxruntime speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
accessibility diarization speech-to-text storytelling-with-data transcription whisper
Last synced: 19 Dec 2024
https://github.com/gamut73/quizinator
Generating quizzes, on Android, from YouTube videos.
kotlin-android llm python whisper
Last synced: 19 Dec 2024
https://github.com/i4ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/TranBaVinhSon/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 24 Oct 2024
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 24 Oct 2024
https://github.com/shani-sinojiya/sandalquest
AI/ML project for recognizing colloquial Kannada speech and building a speech-based Q&A system focused on sandalwood cultivation.
ai audio-processing data-augmentation deep-learning machine-learning mongodb nlp python pytorch question-answering speech-based-question-answering-system speech-recognition whisper
Last synced: 02 Dec 2024
https://github.com/slinusc/speaker_identification_evaluation
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
Last synced: 09 Oct 2024
https://github.com/shtirmann/v2t
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 09 Oct 2024
https://github.com/tylim88/voicefu
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/baristikir/voice-typing
Simple Desktop Application with Voice Typing features. Runs locally, transcribes locally and works fully offline with support for real-time transcribing. Powered by OpenAI Whisper ASR-models and whisper.cpp inference engine
Last synced: 24 Dec 2024
https://github.com/aspadax/subtitlegenerator
Automatically generate a subtitle for your video.
gpt machine-learning openai rust streamlit subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/niqifan007/openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能
openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api
Last synced: 09 Oct 2024
https://github.com/aeronjl/transcribe
Python package for accurate audio transcription with speaker diarisation
audio-transcription gpt speaker-diarization whisper
Last synced: 09 Oct 2024
https://github.com/mikeesto/subber
A small CLI tool for converting video & audio to a text transcription
audio cli ffmpeg golang transcribe video whisper
Last synced: 19 Dec 2024
https://github.com/sumitesh9/localizedwhisper
A initiative to make OpenAI Whisper more localized by adding more languages.
albanian albanian-language huggingface openai speech speech-to-text whisper
Last synced: 08 Nov 2024
https://github.com/natanielf/lecsum
Automatically transcribe and summarize lecture recordings completely on-device using AI.
ollama ollama-python whisper whisper-ai
Last synced: 18 Dec 2024
https://github.com/volkansah/text-to-speech-pygui-for-whisper
This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.
openai openai-api python-gui-tkinter python3 whisper whisper-ai
Last synced: 28 Nov 2024
https://github.com/brentwong-kiel1997/brents_ai_language_school
Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos
ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube
Last synced: 08 Nov 2024
https://github.com/etienneab3d/srt-sync
Synchronize SRT timestamps over an existing accurate transcription
aligner asr nlp subtitles text-to-speech whisper
Last synced: 19 Dec 2024
https://github.com/utrechtuniversity/transcription-d-lucea
python utrecht-university whisper
Last synced: 22 Nov 2024
https://github.com/topdev0215/AudioMultifunctionChatbot
This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.
gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper
Last synced: 24 Oct 2024
https://github.com/nerdimite/meetsy-app
Frontend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/juanestban/whisper-tnode
cli ts typescript whisper whisper-cpp whisper-ia whisper-node whisper-node-ts
Last synced: 21 Dec 2024
https://github.com/egorsmkv/star-adapt-uk
Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.
asr speech-recognition ukrainian whisper
Last synced: 19 Dec 2024
https://github.com/stefanasandei/youtube-to-text
Speech to text for any YouTube video.
ai api flask openai python server speech-to-text web-server whisper youtube youtube-dl
Last synced: 09 Nov 2024
https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives
AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.
bart gpt2 video-analysis whisper yolov8
Last synced: 08 Nov 2024
https://github.com/pdcalado/waste
Whisper Audio Service for Transcription and Ergonomics
productivity rofi transcription tts whisper
Last synced: 20 Nov 2024
https://github.com/canaxs/whisper-core
An application where users can make rumor-based news and earn money in return.
mysql panel spring spring-boot whisper
Last synced: 19 Dec 2024
https://github.com/mickekring/top-of-mind-clara
Clara är en prototyp som möjliggör att anonymt kunna göra sin röst hörd. Medarbetaren kan prata eller skriva in det du vill säga och AI anonymiserar det. Medarbetaren har dessutom tillgång till en chatbot att rådfråga. Därefter analyseras och sammanställs alla medarbetares tankar i en dashboard.
ai chatbot feedback openai python streamlit transcription whisper
Last synced: 22 Dec 2024
https://github.com/notyusheng/transcribe-translate
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 11 Oct 2024
https://github.com/winstxnhdw/capgen
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper
Last synced: 23 Oct 2024
https://github.com/gabriellopesdesouza2002/funcspy
Functions to help you develop any program or script you want
automation chatbot dall-e email email-library ocr openai-api openai-chatgpt openai-whisper pdf pdf-tools python regex selenium selenium-webdriver whisper
Last synced: 30 Oct 2024
https://github.com/bbc-esq/whisper-solo-with-gui
OpenAI's Whisper program with a simple lightweight GUI.
pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper
Last synced: 12 Nov 2024
https://github.com/bbc-esq/batch-openai-whisper-ctranslate2
Batch process multiple files using the fasted ctranslate2 implementation of Open AI's Whisper
batch-processing batch-script openai openai-whisper pyside6 transcription translation whisper whisperx
Last synced: 12 Nov 2024
https://github.com/becomingbabyman/eunoia-desktop
local desktop transcription and search for apple voice memos and videos
search second-brain transcription videos voice-memos whisper
Last synced: 25 Dec 2024
https://github.com/schnoddelbotz/whisper-ui
Transcribe audio/video to text, locally on macOS, Linux and Windows. A simple whisper.cpp wrapper/UI built with Go/Fyne.
ffmpeg ffmpeg-wrapper fyne gui local privacy speech-to-text transcription whisper whisper-cpp
Last synced: 22 Dec 2024
https://github.com/nri12/filter_voice
Dự án lọc và tắt tiếng video những từ khóa mong muốn
Last synced: 19 Dec 2024
https://github.com/roman01la/sub-deep
Transcribe and translate audio with AI
deepl transcribe translate whisper
Last synced: 08 Nov 2024
https://github.com/julienvincent/whalker
Whisper talker
whisper whisper-ai whisper-cpp
Last synced: 07 Nov 2024
https://github.com/alancunningham/chatgpt-assistant
A ChatGPT assistant with voice activation and image generation, connected to a Raspberry Pi display.
chatgpt chatgpt-api dall-e dall-e-api porcupine python raspberry-pi whisper
Last synced: 10 Nov 2024
https://github.com/tranbavinhson/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 06 Nov 2024
https://github.com/ahmetoner/master-whisper
Master Whisper transcription with CTranslate2
deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 10 Nov 2024
https://github.com/ioriens/whisper-video
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
subtitle-generator video-to-audio video-to-text whisper
Last synced: 17 Nov 2024
https://github.com/rokbenko/arctic-meet
ArcticMeet is an AI meeting assistant using Streamlit as a GUI and the Snowflake Arctic LLM via the Snowflake Cortex
ffmpeg pandas plotly python pytorch snowflake snowflake-arctic snowflake-cortex snowpark streamlit transformers whisper
Last synced: 12 Nov 2024
https://github.com/luizcalaca/transcricao-medica
Full Stack + Whisper Transcription + Node.js REST API + VITE + React.js + Railway deploy
full-stack nodejs openai openai-api railway reactjs sequelize sequelize-orm vite whisper whisper-ai
Last synced: 25 Nov 2024
https://github.com/rudrodip/kittyscribe
microservice for transcribing audio/video files to text and transcoding video
Last synced: 01 Dec 2024
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/vlazic/json-verbose-to-vtt-converter
Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.
converter groq json json-verbose openai vtt webvtt whisper
Last synced: 26 Nov 2024
https://github.com/jplhughes/whisper_logit_lens
This Alignment Jam Hackathon project explores whether the concept of "logit lens" applies to the encoder and decoder layers in Whisper, an end-to-end speech recognition model.
alignment-jam asr interpretability interpretability-jam logitlens whisper
Last synced: 24 Oct 2024
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 07 Nov 2024
https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper
Whisper Automatic Speech Recognition (ASR) Model
openai openai-api transcription webapp whisper
Last synced: 22 Dec 2024
https://github.com/microsoft/azure-ai-foundry-whatsapp-bot
WhatsApp Bot built with Azure Functions and Azure AI Foundry, using Python.
azure-ai-foundry azure-functions azure-openai python whatsapp-api whatsapp-bot whisper
Last synced: 27 Nov 2024
https://github.com/nelzomal/videolens_ai
VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience
ai chrome-ai gemini-nano transformers whisper wxt
Last synced: 02 Dec 2024
https://github.com/yui-mhcp/speech_to_text
Speech-To-Text (STT) project
audio-transcription deepspeech jasper speech-to-text stt stt-api tensorflow2 video-transcription whisper
Last synced: 24 Oct 2024
https://github.com/xawos/owt
🦙🗣️ Ollama and Whisper Telegram bot, with advanced configuration
ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper
Last synced: 03 Dec 2024
https://github.com/pawelzeja098/whisper-video-transcription
Testing whisper Open-AI to transcribe videos
mp4 transcription whisper whisper-ai
Last synced: 28 Nov 2024
https://github.com/youknow2509/real-time-speech-to-text
Speech To Text in Real-Time
blackhole speech-recognition speech-to-text whisper whisper-api
Last synced: 19 Dec 2024
https://github.com/whisper-666/TikTok-Login
TikTok Login With No Captcha No Proxy (unlimited requests)
api combo combo-checker proxyless tiktok tiktok-api tiktok-followers tiktok-followers-generator tiktok-followers-software tiktok-login tiktok-views whisper
Last synced: 24 Oct 2024
https://github.com/concaption/containerized-transcription-api
Containerized Transcription API using Whisper Model and FastAPI
docker fastapi openai transcription whisper
Last synced: 16 Dec 2024
https://github.com/EvilFreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 24 Oct 2024
https://github.com/seanvelasco/ai
Cloudflare AI challenge submission: Slater - your virtual foreign language friend
ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper
Last synced: 09 Dec 2024
https://github.com/breadrock1/audio-to-text
There is simple backend project to use whisper-rs.
actix-web audio-to-text rust swagger-ui whisper
Last synced: 11 Nov 2024
https://github.com/educa-ch/educa24-speech-to-summary
Demonstrator for an open-source speech-to-summary workflow
langchain ollama open-source open-weight speech-to-text summarization whisper
Last synced: 11 Oct 2024
https://github.com/velocitatem/dontlectureme
A program that pays attention to your lectures for you.
ai lectures university whisper
Last synced: 03 Dec 2024
https://github.com/neiltron/autocap
ALL CAPS
closedcaptions ml subtitles transcription whisper
Last synced: 19 Dec 2024
https://github.com/RingoMar/whisper-devcontainer
Openai whisper inside of vscode docker devcontainer using example files
ai devcontainer docker openapi python whisper
Last synced: 24 Oct 2024
https://github.com/MattCode64/Scriba
SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.
fastapi python speech-to-text whisper
Last synced: 24 Oct 2024