Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-02-04 00:30:59 UTC
- JSON Representation
https://github.com/alancunningham/chatgpt-assistant
A ChatGPT assistant with voice activation and image generation, connected to a Raspberry Pi display.
chatgpt chatgpt-api dall-e dall-e-api porcupine python raspberry-pi whisper
Last synced: 06 Jan 2025
https://github.com/saadkh1/docqa-textsummarization-app
A Streamlit app for document question answering and text summarization.
langchain llama-2 llamacpp pytesseract question-answering streamlit summarization whisper
Last synced: 07 Jan 2025
https://github.com/i4ds/whisper-prep
Data preparation utility for the finetuning of OpenAI's Whisper model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Nov 2024
https://github.com/stefanasandei/youtube-to-text
Speech to text for any YouTube video.
ai api flask openai python server speech-to-text web-server whisper youtube youtube-dl
Last synced: 04 Jan 2025
https://github.com/sonhm3029/realtime-vietnamese-asr-react-native-and-whisper
This project implement end to end realtime vietnamese speech recognition with PhoWhisper in Backend and frontend in React Native
asr phowhiper react-native realtime realtime-speech-recognition speech-recognition speech-to-text vietnamese whisper
Last synced: 16 Nov 2024
https://github.com/water25234/ChatREP
Summary on Youtube By ChatGPT & whisper
chatgpt-api openai python python3 video whisper youtube
Last synced: 24 Oct 2024
https://github.com/knot-inc/john
John is a web app that records video, analyzes audio with AI, and identifies the speaker's native language from their English accent, simplifying language assessment.
audio-analysis machine-learning whisper
Last synced: 17 Nov 2024
https://github.com/flyingfathead/youwhisper-cli
A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.
cli cli-app python transcribe transcriber transcription whisper whisper-ai whisperx youtube-downloader yt-dlp yt-dlp-wrapper
Last synced: 11 Jan 2025
https://github.com/jemtaly/whispering
A real-time transcription and translation tool implemented in Python based on the fast-whisper library.
live-caption python real-time-transcription real-time-translation tkinter transcription translation whisper
Last synced: 09 Jan 2025
https://github.com/gurpreetkaurjethra/multimodal-ai-app-using-llava-7b
Multimodal AI App using Llava 7B and Gradio
ai generative-ai gradio large-language-models llava llavacpp llm multimodal voice-assistant whisper
Last synced: 22 Nov 2024
https://github.com/t-h-chung/note-taker
Note-taking app for online/local video/audio using Whisper transcription, ChatGPT, and Notion
chatgpt notes notion transcription whisper youtube
Last synced: 09 Oct 2024
https://github.com/daisyyedda/whisper-large-v2-atcosim_corpus
A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.
asr-model nlp whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/williamwa/mssmith
A Telegram bot that utilizes the ChatGPT API and can communicate through voice.
chatpgt-api telegram-bot tts whisper
Last synced: 31 Dec 2024
https://github.com/sanket-poojary-03/fine-tuning-whisper
Fine tuning Whisper-Small LLM for Hinglish Audio dataset
audio-dataset audio-to-text deep-learning fine-tuning huggingface-transformers python speech-recognition speech-to-text whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/tonywu71/distilling-and-forgetting-in-large-pre-trained-models
Code for my dissertation on "Distilling and Forgetting in Large Pre-Trained Models" for the MPhil in Machine Learning and Machine Intelligence (MLMI) at the University of Cambridge.
continual-learning distillation speech-recognition whisper
Last synced: 04 Dec 2024
https://github.com/amir-mohseni/voicebridge
This repository provides a dockerized Speech-to-Speech application that supports text-to-audio conversion, audio-to-text transcription, and interactive voice-based conversations. It is easy to set up and use, offering a versatile platform for speech and text processing.
docker huggingface python transformer tts whisper
Last synced: 17 Jan 2025
https://github.com/amgawishx/voiceworker
A Web App UI for OpenAI's Whisper model for audio transcription and translation.
ai audio-processing python streamlit transcription translation webapp whisper
Last synced: 17 Jan 2025
https://github.com/jacoblincool/wft
Run Whisper fine-tuning with easeโit works on MPS, CUDA, and CPU without code changes.
Last synced: 11 Dec 2024
https://github.com/kazkozdev/video-analyser
โก The YouTube Video Analyzer Pro brings AI-powered analysis capabilities to your fingertips, offering deep insights for content creators and marketers.
ai content-analytics fastapi llama3 llm ollama-api python3 video-analysis video-analysis-client whisper youtube youtube-analytics youtube-api youtube-subscribers
Last synced: 13 Jan 2025
https://github.com/bhattbhavesh91/neo4j-palm2-makersuite
Explore how to build a Q&A system on Neo4j using Google's Palm2 model with MakerSuite in this repository.
google google-api google-palm maker-suite neo4j-driver neo4j-python-scripts palm2 python table-qa voice-assistant whisper
Last synced: 17 Jan 2025
https://github.com/phidlarkson/whisper-stt-api
Easy setup for the whisper speech to text
api flask speech-to-text whisper
Last synced: 01 Jan 2025
https://github.com/ndjenkins85/afkode
Personal voice command interface for iPhone on pythonista powered by Whisper and ChatGPT.
chatgpt openai python-packaging quick-start whisper
Last synced: 12 Oct 2024
https://github.com/my-north-ai/semantic_audio_filtering
Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.
automatic-speech-recognition fine-tuning synthetic-dataset-generation text-to-speech whisper
Last synced: 24 Oct 2024
https://github.com/imsanjoykb/speech-nlp-bootcamp
Speech NLP Bootcamp
asr audio-analysis audio-applications bangla-nlp huggingface-transformers seq2seq speech speech-recognition tts wav2vec2 whisper
Last synced: 18 Jan 2025
https://github.com/JoSuru/speeka
Speeaka is an open-source project that uses the Whisper model of OpenAI to transcribe audio into text. Its intuitive web interface makes it easy to use. Contributions are welcome.
open-source python python3 speech-to-text streamlit whisper
Last synced: 24 Oct 2024
https://github.com/limdongjin/ignkafasr
Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)
apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper
Last synced: 24 Oct 2024
https://github.com/astrologos/py-speakeasy
Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.
automatic-speech-recognition chat-gpt coqui-ai coqui-tts elevenlabs-api mimic mycroftai text-to-speech whisper
Last synced: 24 Oct 2024
https://github.com/Lord-Haji/ChatAudio
chatbot gpt-3-5-turbo gpt-4 langchain langchain-python speech-recognition whisper whisper-api
Last synced: 24 Oct 2024
https://github.com/firefly55lm/bisbigliatorev2
Automatic audio transcriber notebook based on Whisper
colab-notebook speech-to-text whisper
Last synced: 25 Jan 2025
https://github.com/seitzquest/RavenWhisperer
Listens to your voice and queries a language model for answers when a question is detected
Last synced: 22 Nov 2024
https://github.com/marketcalls/openalgo-voice-based-orders
OpenAlgo Voice Based Orders
flask groq openai python speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/sakurajimamai-1202/stream-translator-gpt-webui
A web ui application that utilizes the stream-translator-gpt
faster-whisper gemini gpt transcribe translate translation translator webui whisper yt-dlp
Last synced: 11 Oct 2024
https://github.com/aitor-alvarez/large-speech-models
Fine-tuning Multilingual Large Speech Recognition Models: Wav2vec and Whisper
arabic-speech-recognition asr asr-model finetuning-wav2vec finetuning-whisper large-speech-models speech-recognition-model wav2vec2 whisper
Last synced: 25 Jan 2025
https://github.com/mooerslab/bash-whisper-transcription
Bash function to ease the transcription of audio files with OpenAI's whisper.
asr audio audio-file-trancription audio-messages automate-the-boring-stuff automatic-speech-recognition automation bash bash-function beginner-friendly speech-to-text stt whisper
Last synced: 14 Dec 2024
https://github.com/breadrock1/audio-to-text
There is simple backend project to use whisper-rs.
actix-web audio-to-text rust swagger-ui whisper
Last synced: 10 Jan 2025
https://github.com/doctorpok42/pheere-app
Pheere is a simple virtual assistant
ai chatgpt desktop-app elevenlabs nextjs scss tauri ts virtual-assistant whisper
Last synced: 10 Jan 2025
https://github.com/bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
accessibility diarization speech-to-text storytelling-with-data transcription whisper
Last synced: 19 Dec 2024
https://github.com/szilvia-csernus/openai-audio-api-calls
Speech-to-text and text-to-speech API call examples, using OpenAI's whisper-1 and tts-1 models.
jupyter-notebook openai openai-api tts-1 whisper
Last synced: 09 Oct 2024
https://github.com/rhysdg/whisper-onnx-python
A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph
ai chatbot cuda machine-learning onnxruntime speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/brentwong-kiel1997/brents_ai_language_school
Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos
ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube
Last synced: 31 Dec 2024
https://github.com/toomore/whisper
๐๐ฆ๐๐๐ Write some notes by using the GPG encrypts.
gpg notes pgp quickstart whisper
Last synced: 23 Jan 2025
https://github.com/adisol07/sharpspeech
SharpSpeech is free, local and open source way to speech and wake word recognition.
audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/tranbavinhson/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 26 Dec 2024
https://github.com/roman01la/sub-deep
Transcribe and translate audio with AI
deepl transcribe translate whisper
Last synced: 30 Dec 2024
https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives
AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.
bart gpt2 video-analysis whisper yolov8
Last synced: 02 Jan 2025
https://github.com/chaoticbyte/audio-summarize
An audio summarizer (faster-whisper and BART glued together)
ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper
Last synced: 09 Oct 2024
https://github.com/jesse-c/local-audio-toolkit
Some handy tools to do with audio locally.
large-language-models lm-studio macos side-project whisper
Last synced: 29 Jan 2025
https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo
This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.
ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper
Last synced: 09 Oct 2024
https://github.com/saamerm/whisperkit-ios15
iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon
ios ios15 swiftui whisper whisper-ai
Last synced: 19 Jan 2025
https://github.com/voqal/browser
Natural speech browsing for the software developers of tomorrow
cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper
Last synced: 20 Oct 2024
https://github.com/fukuro-kun/wortweber
Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform fรผr Spracherkennung in Deutsch und Englisch.
Last synced: 17 Jan 2025
https://github.com/cris-m/langgraph_examples
duckduckgo kokoro langgraph llama3-2 whisper
Last synced: 18 Jan 2025
https://github.com/niqifan007/openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history functionไธไธชไฝฟ็จStreamlitๅผๅ็openai็apiๆฅๅฃ็tts๏ผๆๅญ่ฝฌ่ฏญ้ณ๏ผๅstt๏ผ่ฏญ้ณ่ฝฌๆๅญ๏ผๆฅๅฃ็gui็้ข๏ผๅธฆๆๅๅฒ่ฎฐๅฝๅ่ฝ
openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api
Last synced: 09 Oct 2024
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 01 Feb 2025
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 24 Oct 2024
https://github.com/seanvelasco/ai
Cloudflare AI challenge submission: Slater - your virtual foreign language friend
ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper
Last synced: 03 Feb 2025
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 24 Oct 2024
https://github.com/valiantlynx/custom-whisper-api
This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.
ai docker-compose fastapi python whisper
Last synced: 10 Jan 2025
https://github.com/mickekring/top-of-mind-clara
Clara รคr en prototyp som mรถjliggรถr att anonymt kunna gรถra sin rรถst hรถrd. Medarbetaren kan prata eller skriva in det du vill sรคga och AI anonymiserar det. Medarbetaren har dessutom tillgรฅng till en chatbot att rรฅdfrรฅga. Dรคrefter analyseras och sammanstรคlls alla medarbetares tankar i en dashboard.
ai chatbot feedback openai python streamlit transcription whisper
Last synced: 22 Dec 2024
https://github.com/Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.
ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper
Last synced: 24 Oct 2024
https://github.com/utrechtuniversity/transcription-d-lucea
python utrecht-university whisper
Last synced: 23 Jan 2025
https://github.com/bbc-esq/whisper-solo-with-gui
OpenAI's Whisper program with a simple lightweight GUI.
pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper
Last synced: 11 Jan 2025
https://github.com/maylad31/colab-codes
some useful colab files
clip colab-notebook speech-recognition whisper zero-shot-classification
Last synced: 11 Jan 2025
https://github.com/jowadev/interview
Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.
interview-preparation job-interviews nextjs professional students whisper
Last synced: 14 Dec 2024
https://github.com/crone-ai/force-align-wordstamps
Takes audio (mp3) and text input (string) and force aligns the text to the audio. Uses stable-ts and whisperx.
captions faster-whisper force-alignment stable-ts whisper
Last synced: 17 Jan 2025
https://github.com/TranBaVinhSon/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 24 Oct 2024
https://github.com/volkansah/text-to-speech-pygui-for-whisper
This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.
openai openai-api python-gui-tkinter python3 whisper whisper-ai
Last synced: 27 Jan 2025
https://github.com/xaionaro-go/speech
A Speech-To-Text (with translation) library for Go; currently uses Whisper (runs locally if needed; no need in any API keys)
ai converter go golang library module package speech speech-recognition speech-to-text text whisper
Last synced: 13 Jan 2025
https://github.com/lazauk/aoai-entraidauth-sdkv1
Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x
ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper
Last synced: 12 Jan 2025
https://github.com/yc-w-cn/s-wave
S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.
podcast pouchdb typescript wasm whisper whisper-cpp
Last synced: 28 Dec 2024
https://github.com/nerdimite/meetsy-app
Frontend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/jojasadventure/whisper-client
Very simple Python based client for Whisper compatible endpoint
desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper
Last synced: 09 Oct 2024
https://github.com/pdcalado/waste
Whisper Audio Service for Transcription and Ergonomics
productivity rofi transcription tts whisper
Last synced: 21 Jan 2025
https://github.com/pkarpovich/kira-client
An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices
ai-assistant intent-classification porcupine trigger-word-detection whisper
Last synced: 13 Jan 2025
https://github.com/adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube
Last synced: 10 Dec 2024
https://github.com/pawelzeja098/whisper-video-transcription
Testing whisper Open-AI to transcribe videos
audio mp3 mp4 transcription video whisper whisper-ai
Last synced: 27 Jan 2025
https://github.com/gangula-karthik/memo-mate
๐ Discord meetings redefined with Memo Mate: Transcribe, summarize, and automate minutes seamlessly! โจ
discord-bot huggingface mistral py-cord speech-to-text transcribe whisper
Last synced: 22 Dec 2024
https://github.com/gabriellopesdesouza2002/funcspy
Functions to help you develop any program or script you want
automation chatbot dall-e email email-library ocr openai-api openai-chatgpt openai-whisper pdf pdf-tools python regex selenium selenium-webdriver whisper
Last synced: 30 Oct 2024
https://github.com/aspadax/subtitlegenerator
Automatically generate a subtitle for your video.
gpt machine-learning openai rust streamlit subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/shtirmann/v2t
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 09 Oct 2024
https://github.com/i4ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/nicknaskida/insanely-fast-whisper
Incredibly fast Whisper-large-v3 with speaker diarization
diarization speaker-diarization transfromers whisper whisper-ai whisper-faster whisper-large
Last synced: 19 Jan 2025
https://github.com/markshawn2020/2025-02-03_lex-fridman-deepseek
Transcription and translation scripts for Lex Fridman podcast about DeepSeek, at 2025-02-03
assemblyai deepl deepseek lexfridman whisper xunfei
Last synced: 04 Feb 2025
https://github.com/nerdimite/meetsy-backend
AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 02 Feb 2025
https://github.com/tracywong117/ai-learning-material-from-video
Support subtitling, translating, RAG to generate language learning material from video.
ai auto-subtitle gpt-translate groq groq-api rag subtitles-generator translate whisper
Last synced: 19 Jan 2025
https://github.com/slinusc/speaker_identification_evaluation
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
Last synced: 09 Oct 2024
https://github.com/mikeesto/whispercpp-android
An Android app using whisper.cpp to do voice-to-text transcriptions
android kotlin speech-to-text whisper whisper-cpp
Last synced: 17 Dec 2024
https://github.com/thewh1teagle/whisper.zig
Transcribe audio with whisper in zig
Last synced: 24 Jan 2025
https://github.com/notyusheng/transcribe-translate
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 11 Oct 2024
https://github.com/lidedongsn/cut.ai
cut.ai ๆฏไธไธชAI้ณ่ง้ขๅช่พๅทฅๅ ท๏ผ่ฏญ้ณ่ฝฌๅๅบไบwhisper
Last synced: 17 Jan 2025
https://github.com/wtlow003/auto-subtitles
CLI tool to transcribe (+ translate) videos and embed subtitles automatically.
faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp
Last synced: 15 Nov 2024
https://github.com/Shtirmann/V2T
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 24 Oct 2024