Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-23 00:26:54 UTC
- JSON Representation
https://github.com/ognisty321/whisper-transcription-ui
Whisper Transcription UI is a user-friendly graphical interface for whisper-standalone-win. Transcribe and translate audio/video files effortlessly with customizable settings and saved preferences.
gui python transcription ui whisper whisper-standalone-win
Last synced: 09 Oct 2024
https://github.com/kristofferv98/voiceprocessingtoolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api
Last synced: 02 Nov 2024
https://github.com/lhr0909/live-subtitles-rokid-ar
通过Rokid AR眼镜和OpenAI Whisper实现现实生活中的字幕
augmented-reality openai real-time rokid subtitles whisper
Last synced: 11 Oct 2024
https://github.com/erkara/Rise-of-Transfer-Learning
you will find brief code implementations of some of the latest developments in AI, including Stable Diffusion, Whisper, YOLO and HuggigFace Transformers
gpt-3 huggingface openai stable-diffusion transfer-learning whisper yolov5
Last synced: 24 Oct 2024
https://github.com/semyon-dev/whissage
the backend of blockchain-based messenger
blockchain blockchain-messenger ethereum geth messenger whisper whisper-protocol
Last synced: 30 Oct 2024
https://github.com/tristan-mcinnis/realtime-whisper-console-transcriber
A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.
asr console python real-time speech-recognition speech-to-text terminal transcription whisper
Last synced: 12 Oct 2024
https://github.com/botisan-ai/whisper-aws-stack
Deplay Whisper on AWS Scalably
aws cdk ecs fargate fastapi openai silero-vad whisper
Last synced: 24 Oct 2024
https://github.com/lissettecarlr/auto-subtitle
使用faster-whisper本地模型提取音频,生成srt和ass字幕文件。支持gpt等在线翻译,生成翻译后字幕文件。(Use the faster-whisper local model to extract audio and generate srt and ass subtitle files. Support online translation such as gpt to generate translated subtitle files.)
faster-whisper streamlit subtitles whisper
Last synced: 19 Nov 2024
https://github.com/umerarif01/ai-translator
AI Translator: Fast and Accurate Translations with Next.js and OpenAI's Whisper and GPT-3 APIs
Last synced: 24 Oct 2024
https://github.com/0x20f/listen-wise
Save the last 30 seconds of audio to text using ai. Send that text to a notion page, readwise, obsidian, or just save it locally in a text file.
ai notion openai speech-to-text transcription whisper
Last synced: 30 Oct 2024
https://github.com/manucabral/quick-subtitles
An easy way to generate SRT subtitles from a video in Windows.
audio-to-text srt srt-subtitles subtitles subtitles-generator transcription whisper whisper-ai windows
Last synced: 03 Nov 2024
https://github.com/yanivhaliwa/linux-stuff
ai arp automation bash cyber device-discovery gpt linux monitoring openai package-manager python scanning scripts subtitle utilities whisper
Last synced: 09 Dec 2024
https://github.com/driftingruby/395-transcribing-with-artificial-intelligence
In this episode, we look at creating an audio transcription service which allows files uploaded from Active Storage to be transcribed with Artificial Intelligence. However, there are a lot of considerations around the approach from both a performance and thread safety perspectives.
artificial-intelligence openai ruby ruby-on-rails whisper
Last synced: 23 Dec 2024
https://github.com/zaneh/heybilly
🗣️ It's like Alexa, but for your computer. Highly modular, real-time voice assistant. Built using self-assembling graphs.
contributions-welcome graph python3 rabbitmq self-hosted tts voice-assistant whisper
Last synced: 08 Dec 2024
https://github.com/achraf-oujjir/profgpt-smart-vr-professor
👨🏫🤖 ProfGPT: AI-powered VR professor with electrical circuits lab table ⚡💡 Built with Unity 🎮 GPT and Whisper APIs 🧠 and AWS Polly 🦜🗣️
ai-education aws-polly chatgpt-api csharp education oculus-quest-2 openai-api openai-whisper speech-to-text text-to-speech unity3d virtual-reality vr whisper
Last synced: 03 Nov 2024
https://github.com/hrehfeld/archlinux-whisper.cpp-model
PKGBUILD generation for whisper.cpp models
archlinux aur pkgbuild whisper whisper-cpp
Last synced: 14 Dec 2024
https://github.com/patbqc/thoughtforgeai
Forge your thoughts through an AI powered brainstorming session !
ai anthropic brainstorm brainstorming brainstorms mobile openai reactnative whisper
Last synced: 31 Oct 2024
https://github.com/lbrndnr/nutshell-macos
An AI-powered note-taking app for your meetings. Built for macOS using SwiftUI.
Last synced: 13 Nov 2024
https://github.com/chriamue/whisper-example
Docker compose environment and example for whisper.
docker-compose geth p2p-network shh web3js whisper
Last synced: 15 Dec 2024
https://github.com/sandy1990418/chinesetaiwanesewhisper
This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, model training, and evaluation, tailored to improve speech recognition accuracy for these languages.
asr chinese gradio realtime speech-to-text streaming-audio taiwanese whisper
Last synced: 09 Oct 2024
https://github.com/awaisoem/interview-lingo
(Aug 2024) AI assistant which help with interviews, hiring, personality development and communication skills
ai ai71 drizzle-orm falcon neondb nextjs postgresql tailwindcss whisper
Last synced: 09 Oct 2024
https://github.com/walkswithaswagger/whisperforge
WhisperForge is a Python tool that leverages OpenAI's Whisper model to transcribe large audio files. It automatically splits files into manageable chunks, processes them, and combines the transcriptions into a single document. Ideal for handling lengthy recordings and generating clear, organized transcriptions.
audio-transcription openai python whisper
Last synced: 21 Nov 2024
https://github.com/micartey/karl-the-voice-assistant
Voice Assistant with the power of OpenAI's ChatGPT
ai chatgpt home-assistant karl openai raspberry-pi voice-assistant whisper
Last synced: 16 Nov 2024
https://github.com/weihanchen/google-colab-python-learn
📚 Learn Google Colab、Python、ML、OpenAI、Whisper、spaCy、NLP、HuggingFace
colab-notebook huggingface matplotlib natural-language-processing nlp openai pandas python spacy whisper
Last synced: 11 Nov 2024
https://github.com/gustavz/audio-to-text
streamlit app to transcript audio to text using openai's whisper library
audio-to-text streamlit whisper
Last synced: 19 Nov 2024
https://github.com/cgbur/whisp
A lightweight and minimal desktop speech-to-text tool.
accessibility speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/jorgeandrespadilla/avtools
AV Tools - A collection of CLI tools for audio and video processing (powered by AI). Audio transcription, Video to Audio conversion, YouTube downloader.
Last synced: 13 Nov 2024
https://github.com/redocrepus/arkode
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
accessibility chatgpt chatgpt-api code-assistant coding-assistant coding-by-voice developer-tools openai openai-api programming-assistant programming-by-voice visual-studio-code visual-studio-code-extension visualstudiocode voice-coding voicecode voicecoding vscode-extension whisper whisper-api
Last synced: 24 Oct 2024
https://github.com/egorsmkv/whisper-ukrainian
Trainer and Evaluation scripts for fine-tuning Whisper models for the Ukrainian language
asr automatic-speech-recognition openai speech-recognition ukrainian whisper
Last synced: 18 Oct 2024
https://github.com/schibsted/sum
Sum, a powerful tool for enhancing your articles with the help of ChatGPT.
chatgpt nextjs nrk openai tailwindcss vg whisper
Last synced: 13 Nov 2024
https://github.com/ribartra/call-listener_bot
A bot that downloads, transcribes and analyzes calls to find insights for sales advisors.
api audio-analyser call-bot call-listener drive gcp openai python whisper
Last synced: 09 Oct 2024
https://github.com/t0mer/telessist
Telessist allows you to contact GPT3 directly from WhatsApp and not only that. Telessist also allows you to save your own personal data and later search and retrieve it using GPT3 to generate a response. In the examples folder, you can see several examples of how to use this bot so you don't have to remember anything ever again.
assistant chatgpt dall-e docker openapi python3 telegram telegram-bot weather whisper
Last synced: 06 Dec 2024
https://github.com/CrabAss/dCollab
Decentralized e-Learning Collaboration Platform as a Capstone Project (COMP4913, PolyU)
comp4913 dapp ethereum javascript react whisper
Last synced: 24 Oct 2024
https://github.com/nexuslux/realtime-whisper-console-transcriber
A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.
asr console python real-time speech-recognition speech-to-text terminal transcription whisper
Last synced: 09 Oct 2024
https://github.com/fabio-garavini/ha-groq-whisper-stt-api
HACS custom integration for using GroqCloud speech-to-text (Whisper) API in the Assist pipeline, reducing the workload on the Home Assistant server.
groq-api home-assistant stt whisper
Last synced: 29 Sep 2024
https://github.com/200ok-ch/voice_vault
voice_vault enables you to record and archive all your meetings and conversations with ease. Later, search through them with lightning speed using full-text search.
Last synced: 19 Nov 2024
https://github.com/sslava/ai-voice-chat
AI Voice Chat
nodejs openai tts voice-recognition whisper
Last synced: 20 Oct 2024
https://github.com/jech/galene-stt
Speech-to-text support for Galene
galene stt videoconference webrtc whisper whisper-cpp
Last synced: 09 Oct 2024
https://github.com/ckaznable/yt-cli-live
Youtube Text Live Streaming in CLI
asr cli rust silero-vad whisper whisper-cpp youtube
Last synced: 12 Nov 2024
https://github.com/fly-apps/cog-whisper
Run OpenAI Whisper as a Cog model on Fly GPUs
Last synced: 17 Nov 2024
https://github.com/phineas-pta/fine-tune-whisper-vi
jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2
aws docker fine-tuning lora multi-gpu-training speech-recognition speech-to-text vietnamese whisper
Last synced: 14 Oct 2024
https://github.com/danomation/Voice-Website
Talk back and forth to GPT over browser. Customize to have your own interactive voice assistant!
elevenlabs gpt stt tts whisper
Last synced: 24 Oct 2024
https://github.com/limdongjin/ignkafasr
Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)
apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper
Last synced: 24 Oct 2024
https://github.com/vimwei/whispertranscriber
Whisper Transcribe and srt Resegment
speech-to-text subtitle whisper
Last synced: 17 Oct 2024
https://github.com/upes-open/osoc-24-the-content-forge
The Content Hub Is a online platform which acts as a all in one solution helping content creators develop and generate short form video image content utilising genai models and cloud to maximize their efficiency and benefit from the ever-growing developments in ai models
aws docker fastapi genai microservices nodejs react whisper
Last synced: 09 Oct 2024
https://github.com/sovit-123/sam_molmo_whisper
An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.
molmo segment-anything-model segmentanythingmodel vlm whisper
Last synced: 18 Oct 2024
https://github.com/alessioborgi/stylealigned_multireference-multimodal
Novel framework for Zero-Shot Style Alignment in Text-to-Image generation, incorporating Multi-Modal Context-Awareness and Multi-Reference Style Alignment, using minimal attention sharing, ensuring consistent style transfer without fine-tuning.
adain blip clap context-awareness multi-modal multi-style-transfer no-fine-tuning shared-attention-heads style-aligned text-to-image-generation whisper zero-shot-learning
Last synced: 18 Oct 2024
https://github.com/andreabak/whispersubs
Generate subtitles for your video or audio files using the power of AI
ai cuda deep-learning gpu-acceleration machine-learning srt subtitles transcribe transcription translate whisper
Last synced: 16 Nov 2024
https://github.com/tensoraws/yuisub
Auto translation of new anime episodes based on Yui-MHCP001
anime chatgpt llm openai pysubs2 subtitle translation whisper
Last synced: 09 Oct 2024
https://github.com/datarabbit-ai/transcription_service
System/service with REST API for extracting text transcriptions from movies and audio recordings in most popular video formats.
containers datarabbit rest-api speech-to-text stt transcription transcription-services whisper
Last synced: 09 Oct 2024
https://github.com/otonomee/mic2transcript
CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.
audio cli cli-tool openai speech speech-transcription transcription whisper
Last synced: 09 Oct 2024
https://github.com/water25234/ChatREP
Summary on Youtube By ChatGPT & whisper
chatgpt-api openai python python3 video whisper youtube
Last synced: 24 Oct 2024
https://github.com/paulocoutinhox/py-transcriptor-ai
PyTranscriptorAi - Transcript videos to text with Ai and add subtitles - OpenAi
ai openai subtitles transcript video whisper
Last synced: 09 Nov 2024
https://github.com/williamwa/mssmith
A Telegram bot that utilizes the ChatGPT API and can communicate through voice.
chatpgt-api telegram-bot tts whisper
Last synced: 08 Nov 2024
https://github.com/i4ds/whisper-prep
Data preparation utility for the finetuning of OpenAI's Whisper model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Nov 2024
https://github.com/sonhm3029/realtime-vietnamese-asr-react-native-and-whisper
This project implement end to end realtime vietnamese speech recognition with PhoWhisper in Backend and frontend in React Native
asr phowhiper react-native realtime realtime-speech-recognition speech-recognition speech-to-text vietnamese whisper
Last synced: 16 Nov 2024
https://github.com/jemtaly/whispering
A real-time transcription and translation tool implemented in Python based on the fast-whisper library.
live-caption python real-time-transcription real-time-translation tkinter transcription translation whisper
Last synced: 11 Nov 2024
https://github.com/bhattbhavesh91/neo4j-palm2-makersuite
Explore how to build a Q&A system on Neo4j using Google's Palm2 model with MakerSuite in this repository.
google google-api google-palm maker-suite neo4j-driver neo4j-python-scripts palm2 python table-qa voice-assistant whisper
Last synced: 16 Nov 2024
https://github.com/knot-inc/john
John is a web app that records video, analyzes audio with AI, and identifies the speaker's native language from their English accent, simplifying language assessment.
audio-analysis machine-learning whisper
Last synced: 17 Nov 2024
https://github.com/imsanjoykb/speech-nlp-bootcamp
Speech NLP Bootcamp
asr audio-analysis audio-applications bangla-nlp huggingface-transformers seq2seq speech speech-recognition tts wav2vec2 whisper
Last synced: 17 Nov 2024
https://github.com/gurpreetkaurjethra/multimodal-ai-app-using-llava-7b
Multimodal AI App using Llava 7B and Gradio
ai generative-ai gradio large-language-models llava llavacpp llm multimodal voice-assistant whisper
Last synced: 22 Nov 2024
https://github.com/t-h-chung/note-taker
Note-taking app for online/local video/audio using Whisper transcription, ChatGPT, and Notion
chatgpt notes notion transcription whisper youtube
Last synced: 09 Oct 2024
https://github.com/daisyyedda/whisper-large-v2-atcosim_corpus
A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.
asr-model nlp whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/sanket-poojary-03/fine-tuning-whisper
Fine tuning Whisper-Small LLM for Hinglish Audio dataset
audio-dataset audio-to-text deep-learning fine-tuning huggingface-transformers python speech-recognition speech-to-text whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/ndjenkins85/afkode
Personal voice command interface for iPhone on pythonista powered by Whisper and ChatGPT.
chatgpt openai python-packaging quick-start whisper
Last synced: 12 Oct 2024
https://github.com/kazkozdev/video-analyser
⚡ The YouTube Video Analyzer Pro brings AI-powered analysis capabilities to your fingertips, offering deep insights for content creators and marketers.
ai content-analytics fastapi llama3 llm ollama-api python3 video-analysis video-analysis-client whisper youtube youtube-analytics youtube-api youtube-subscribers
Last synced: 22 Dec 2024
https://github.com/tonywu71/distilling-and-forgetting-in-large-pre-trained-models
Code for my dissertation on "Distilling and Forgetting in Large Pre-Trained Models" for the MPhil in Machine Learning and Machine Intelligence (MLMI) at the University of Cambridge.
continual-learning distillation speech-recognition whisper
Last synced: 04 Dec 2024
https://github.com/my-north-ai/semantic_audio_filtering
Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.
automatic-speech-recognition fine-tuning synthetic-dataset-generation text-to-speech whisper
Last synced: 24 Oct 2024
https://github.com/JoSuru/speeka
Speeaka is an open-source project that uses the Whisper model of OpenAI to transcribe audio into text. Its intuitive web interface makes it easy to use. Contributions are welcome.
open-source python python3 speech-to-text streamlit whisper
Last synced: 24 Oct 2024
https://github.com/jacoblincool/wft
Run Whisper fine-tuning with ease—it works on MPS, CUDA, and CPU without code changes.
Last synced: 11 Dec 2024
https://github.com/astrologos/py-speakeasy
Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.
automatic-speech-recognition chat-gpt coqui-ai coqui-tts elevenlabs-api mimic mycroftai text-to-speech whisper
Last synced: 24 Oct 2024
https://github.com/Lord-Haji/ChatAudio
chatbot gpt-3-5-turbo gpt-4 langchain langchain-python speech-recognition whisper whisper-api
Last synced: 24 Oct 2024
https://github.com/seitzquest/RavenWhisperer
Listens to your voice and queries a language model for answers when a question is detected
Last synced: 22 Nov 2024
https://github.com/marketcalls/openalgo-voice-based-orders
OpenAlgo Voice Based Orders
flask groq openai python speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/sakurajimamai-1202/stream-translator-gpt-webui
A web ui application that utilizes the stream-translator-gpt
faster-whisper gemini gpt transcribe translate translation translator webui whisper yt-dlp
Last synced: 11 Oct 2024
https://github.com/toLSC/tolsc-speech-to-text
Speech to text service for toLSC app implemented with OpenAI Whisper model
fastapi python speech-recognition speech-to-text tts whisper
Last synced: 24 Oct 2024
https://github.com/winstxnhdw/capgen
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper
Last synced: 23 Oct 2024
https://github.com/maylad31/colab-codes
some useful colab files
clip colab-notebook speech-recognition whisper zero-shot-classification
Last synced: 12 Nov 2024
https://github.com/abdnh/anki-asr
Anki add-on for speech recognition
anki anki-addon deepgram speech-recognition whisper
Last synced: 24 Nov 2024
https://github.com/aspadax/subtitlegenerator
Automatically generate a subtitle for your video.
gpt machine-learning openai rust streamlit subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/shtirmann/v2t
Telegram bot which automatically transcribes all voice and video messages to text.
ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper
Last synced: 09 Oct 2024
https://github.com/alancunningham/chatgpt-assistant
A ChatGPT assistant with voice activation and image generation, connected to a Raspberry Pi display.
chatgpt chatgpt-api dall-e dall-e-api porcupine python raspberry-pi whisper
Last synced: 10 Nov 2024
https://github.com/tranbavinhson/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 06 Nov 2024
https://github.com/slinusc/speaker_identification_evaluation
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
Last synced: 09 Oct 2024
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 24 Oct 2024
https://github.com/jowadev/interview
Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.
interview-preparation job-interviews nextjs professional students whisper
Last synced: 14 Dec 2024
https://github.com/maawad/luna
Personal assistant
bot openai personal-assistant whisper
Last synced: 17 Dec 2024
https://github.com/egorsmkv/star-adapt-uk
Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.
asr speech-recognition ukrainian whisper
Last synced: 19 Dec 2024
https://github.com/Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.
ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper
Last synced: 24 Oct 2024
https://github.com/i4ds/whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
fine-tuning nlp speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube
Last synced: 10 Dec 2024