Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-24 00:28:14 UTC
- JSON Representation
https://github.com/i4ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/extrange/transcription-benchmarks
Speech to text model benchmarks
Last synced: 08 Dec 2024
https://github.com/shani-sinojiya/sandalquest
AI/ML project for recognizing colloquial Kannada speech and building a speech-based Q&A system focused on sandalwood cultivation.
ai audio-processing data-augmentation deep-learning machine-learning mongodb nlp python pytorch question-answering speech-based-question-answering-system speech-recognition whisper
Last synced: 02 Dec 2024
https://github.com/nicknaskida/insanely-fast-whisper
Incredibly fast Whisper-large-v3 with speaker diarization
diarization speaker-diarization transfromers whisper whisper-ai whisper-faster whisper-large
Last synced: 26 Sep 2024
https://github.com/volkansah/text-to-speech-pygui-for-whisper
This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.
openai openai-api python-gui-tkinter python3 whisper whisper-ai
Last synced: 28 Nov 2024
https://github.com/etienneab3d/srt-sync
Synchronize SRT timestamps over an existing accurate transcription
aligner asr nlp subtitles text-to-speech whisper
Last synced: 19 Dec 2024
https://github.com/Shtirmann/V2T
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 24 Oct 2024
https://github.com/stefanasandei/youtube-to-text
Speech to text for any YouTube video.
ai api flask openai python server speech-to-text web-server whisper youtube youtube-dl
Last synced: 09 Nov 2024
https://github.com/maawad/luna
Personal assistant
bot openai personal-assistant whisper
Last synced: 17 Dec 2024
https://github.com/adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube
Last synced: 10 Dec 2024
https://github.com/becomingbabyman/eunoia-desktop
local desktop transcription and search for apple voice memos and videos
search second-brain transcription videos voice-memos whisper
Last synced: 25 Dec 2024
https://github.com/bbc-esq/whisper-solo-with-gui
OpenAI's Whisper program with a simple lightweight GUI.
pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper
Last synced: 12 Nov 2024
https://github.com/bbc-esq/batch-openai-whisper-ctranslate2
Batch process multiple files using the fasted ctranslate2 implementation of Open AI's Whisper
batch-processing batch-script openai openai-whisper pyside6 transcription translation whisper whisperx
Last synced: 12 Nov 2024
https://github.com/szilvia-csernus/openai-audio-api-calls
Speech-to-text and text-to-speech API call examples, using OpenAI's whisper-1 and tts-1 models.
jupyter-notebook openai openai-api tts-1 whisper
Last synced: 09 Oct 2024
https://github.com/doctorpok42/pheere-app
Pheere is a simple virtual assistant
ai chatgpt desktop-app elevenlabs nextjs scss tauri ts virtual-assistant whisper
Last synced: 11 Nov 2024
https://github.com/mooerslab/bash-whisper-transcription
Bash function to ease the transcription of audio files with OpenAI's whisper.
asr audio audio-file-trancription audio-messages automate-the-boring-stuff automatic-speech-recognition automation bash bash-function beginner-friendly speech-to-text stt whisper
Last synced: 14 Dec 2024
https://github.com/marty1885/useful-whisper-server
Whisper server based on useful-transformers for the RK3588
npu rk3588 rockchip useful-transformers whisper
Last synced: 05 Dec 2024
https://github.com/mikeesto/whispercpp-android
An Android app using whisper.cpp to do voice-to-text transcriptions
android kotlin speech-to-text whisper whisper-cpp
Last synced: 17 Dec 2024
https://github.com/topdev0215/AudioMultifunctionChatbot
This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.
gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper
Last synced: 24 Oct 2024
https://github.com/ahmetoner/master-whisper
Master Whisper transcription with CTranslate2
deep-learning inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 10 Nov 2024
https://github.com/rokbenko/arctic-meet
ArcticMeet is an AI meeting assistant using Streamlit as a GUI and the Snowflake Arctic LLM via the Snowflake Cortex
ffmpeg pandas plotly python pytorch snowflake snowflake-arctic snowflake-cortex snowpark streamlit transformers whisper
Last synced: 12 Nov 2024
https://github.com/maylad31/colab-codes
some useful colab files
clip colab-notebook speech-recognition whisper zero-shot-classification
Last synced: 12 Nov 2024
https://github.com/h3yn3s/tl-dl
A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.
Last synced: 24 Oct 2024
https://github.com/schnoddelbotz/whisper-ui
Transcribe audio/video to text, locally on macOS, Linux and Windows. A simple whisper.cpp wrapper/UI built with Go/Fyne.
ffmpeg ffmpeg-wrapper fyne gui local privacy speech-to-text transcription whisper whisper-cpp
Last synced: 22 Dec 2024
https://github.com/tylim88/voicefu
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/elmiraghorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl
Last synced: 29 Nov 2024
https://github.com/baristikir/voice-typing
Simple Desktop Application with Voice Typing features. Runs locally, transcribes locally and works fully offline with support for real-time transcribing. Powered by OpenAI Whisper ASR-models and whisper.cpp inference engine
Last synced: 24 Dec 2024
https://github.com/pdcalado/waste
Whisper Audio Service for Transcription and Ergonomics
productivity rofi transcription tts whisper
Last synced: 20 Nov 2024
https://github.com/notyusheng/transcribe-translate
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 11 Oct 2024
https://github.com/natanielf/lecsum
Automatically transcribe and summarize lecture recordings completely on-device using AI.
ollama ollama-python whisper whisper-ai
Last synced: 18 Dec 2024
https://github.com/brentwong-kiel1997/brents_ai_language_school
Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos
ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube
Last synced: 08 Nov 2024
https://github.com/nerdimite/meetsy-backend
AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/utrechtuniversity/transcription-d-lucea
python utrecht-university whisper
Last synced: 22 Nov 2024
https://github.com/silentsoft/whiscribe
🎬 A tool with a UI that transcribes audio files into subtitles using OpenAI's Whisper and runs completely on your local machine.
audio-transcription openai-whisper srt subtitle whisper
Last synced: 11 Nov 2024
https://github.com/egorsmkv/star-adapt-uk
Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.
asr speech-recognition ukrainian whisper
Last synced: 19 Dec 2024
https://github.com/winstxnhdw/capgen
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper
Last synced: 23 Oct 2024
https://github.com/nri12/filter_voice
Dự án lọc và tắt tiếng video những từ khóa mong muốn
Last synced: 19 Dec 2024
https://github.com/oov/aviutl_subtitler
AviUtl+拡張編集の環境で Whisper による文字起こしをするためのプラグイン
Last synced: 19 Dec 2024
https://github.com/team-mansumugang/mansumugang-backend
만수무강 서비스의 스프링 부트 어플리케이션입니다.
aws github-actions jpa jpa-hibernate spring-boot whisper
Last synced: 09 Oct 2024
https://github.com/canaxs/whisper-core
An application where users can make rumor-based news and earn money in return.
mysql panel spring spring-boot whisper
Last synced: 19 Dec 2024
https://github.com/gabriellopesdesouza2002/funcspy
Functions to help you develop any program or script you want
automation chatbot dall-e email email-library ocr openai-api openai-chatgpt openai-whisper pdf pdf-tools python regex selenium selenium-webdriver whisper
Last synced: 30 Oct 2024
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 24 Oct 2024
https://github.com/mickekring/top-of-mind-clara
Clara är en prototyp som möjliggör att anonymt kunna göra sin röst hörd. Medarbetaren kan prata eller skriva in det du vill säga och AI anonymiserar det. Medarbetaren har dessutom tillgång till en chatbot att rådfråga. Därefter analyseras och sammanställs alla medarbetares tankar i en dashboard.
ai chatbot feedback openai python streamlit transcription whisper
Last synced: 22 Dec 2024
https://github.com/jowadev/interview
Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.
interview-preparation job-interviews nextjs professional students whisper
Last synced: 14 Dec 2024
https://github.com/aspadax/subtitlegenerator
Automatically generate a subtitle for your video.
gpt machine-learning openai rust streamlit subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/troyanovsky/llm_summarizer
Use LLM and Whisper to summarize long text and audio/video
Last synced: 13 Nov 2024
https://github.com/bhattbhavesh91/openai-whisper-benchmarking
Comparing the performance of OpenAI's Whisper model on a GPU vs OpenAI's API
gpu openai speech-to-text whisper
Last synced: 16 Nov 2024
https://github.com/abdnh/anki-asr
Anki add-on for speech recognition
anki anki-addon deepgram speech-recognition whisper
Last synced: 24 Nov 2024
https://github.com/voqal/browser
Natural speech browsing for the software developers of tomorrow
cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper
Last synced: 20 Oct 2024
https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives
AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.
bart gpt2 video-analysis whisper yolov8
Last synced: 08 Nov 2024
https://github.com/sumitesh9/localizedwhisper
A initiative to make OpenAI Whisper more localized by adding more languages.
albanian albanian-language huggingface openai speech speech-to-text whisper
Last synced: 08 Nov 2024
https://github.com/brentwong-kiel1997/ai_language_school_based_on_django_and_openai
Django and OpenAI API example use case
django gpt-4 openai openai-api whisper
Last synced: 09 Oct 2024
https://github.com/fukuro-kun/wortweber
Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform für Spracherkennung in Deutsch und Englisch.
Last synced: 17 Nov 2024
https://github.com/valiantlynx/custom-whisper-api
This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.
ai docker-compose fastapi python whisper
Last synced: 22 Dec 2024
https://github.com/wtlow003/auto-subtitles
CLI tool to transcribe (+ translate) videos and embed subtitles automatically.
faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp
Last synced: 15 Nov 2024
https://github.com/yc-w-cn/s-wave
S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.
podcast pouchdb typescript wasm whisper whisper-cpp
Last synced: 07 Nov 2024
https://github.com/julienvincent/whalker
Whisper talker
whisper whisper-ai whisper-cpp
Last synced: 07 Nov 2024
https://github.com/antoniosbarotsis/telegram-transcriber
A Telegram bot for transcribing voice messages
telegram transcribe voice whisper
Last synced: 31 Oct 2024
https://github.com/tranbavinhson/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 06 Nov 2024
https://github.com/jojasadventure/whisper-client
Very simple Python based client for Whisper compatible endpoint
desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper
Last synced: 09 Oct 2024
https://github.com/alancunningham/chatgpt-assistant
A ChatGPT assistant with voice activation and image generation, connected to a Raspberry Pi display.
chatgpt chatgpt-api dall-e dall-e-api porcupine python raspberry-pi whisper
Last synced: 10 Nov 2024
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 24 Oct 2024
https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo
This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.
ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper
Last synced: 09 Oct 2024
https://github.com/chaoticbyte/audio-summarize
An audio summarizer (faster-whisper and BART glued together)
ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper
Last synced: 09 Oct 2024
https://github.com/ioriens/whisper-video
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
subtitle-generator video-to-audio video-to-text whisper
Last synced: 17 Nov 2024
https://github.com/juanestban/whisper-tnode
cli ts typescript whisper whisper-cpp whisper-ia whisper-node whisper-node-ts
Last synced: 21 Dec 2024
https://github.com/adisol07/sharpspeech
SharpSpeech is free, local and open source way to speech and wake word recognition.
audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
accessibility diarization speech-to-text storytelling-with-data transcription whisper
Last synced: 19 Dec 2024
https://github.com/armaggheddon/whisper2me
whisper2me is a telegram bot written with pyTelegramBotAPI that uses OpenAI's whisper to perform speech2text so you no longer have listen to voice messages 🤫🔇
docker openia pytelegrambotapi python whisper
Last synced: 25 Nov 2024
https://github.com/deepbiolab/customer-complaint-classification
An GenAI-powered pipeline leveraging Whisper, DALL-E, and GPT to transform customer complaints into actionable insights with automated transcription, visualization, and classification.
Last synced: 23 Nov 2024
https://github.com/huuquyet/phowhisper-tiny
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/bilelouahmed/vocal-assistant
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts
Last synced: 21 Dec 2024
https://github.com/mariatepei/vt_thesis_mtepei
This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.
data-augmentation fastspeech2 speech-recognition whisper
Last synced: 09 Oct 2024
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/hsiehbocheng/yt-gen-caption
This is a Porject for generating captions for YouTube videos using Faster Whisper & yt_dlp.
Last synced: 19 Dec 2024
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/yjg30737/pyqt-simple-whisper-gui
Whisper text-to-speech, speech-to-text example in PyQt5 GUI
openai pyqt pyqt-ai pyqt5 pyqt5-desktop-application pyqt5-examples pyqt5-gui whisper
Last synced: 06 Dec 2024
https://github.com/vifill/audio-recorder-and-summarizer
This project is a Python script that records system audio on macOS using BlackHole, transcribes the audio using OpenAI's Whisper API, and summarizes the transcription using OpenAI's GPT models
ai audio blackhole gpt openai records summarize system whisper
Last synced: 20 Dec 2024
https://github.com/ekito-station/whisper-api-unity
UnityでOpenAI Whisper APIを使って文字起こしを行ったサンプル
Last synced: 20 Dec 2024
https://github.com/arkapravo-ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
ai fastapi huggingface machine-learning openai python3 speech-to-text transformers whisper
Last synced: 21 Dec 2024
https://github.com/theaussiepom/wyoming-openai
OpenAI SST and TTS support for the Wyoming protocol
home-assistant home-assistant-assist openai sst tts whisper wyoming
Last synced: 21 Dec 2024
https://github.com/electroneum/electroneum-web3.js
Electroneum SmartChain JavaScript API
api electroneum ethereum etn-sc javascript swarm typescript whisper
Last synced: 26 Sep 2024
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 07 Nov 2024
https://github.com/dheison0/subcreator
A subtitle creator, translator and embeder tool made using AI
ai machine-learning ml python subtitles video-processing whisper
Last synced: 09 Oct 2024
https://github.com/ivanrj7j/transcription
This project transcribes audio using whisper and provides an api
ai api flask transcription whisper
Last synced: 09 Oct 2024
https://github.com/julrog/jokes-on-you
Storyteller
ggj2024 global-game-jam openai unity whisper
Last synced: 17 Dec 2024
https://github.com/xawos/owt
🦙🗣️ Ollama and Whisper Telegram bot, with advanced configuration
ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper
Last synced: 03 Dec 2024