Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-11-14 00:26:52 UTC
- JSON Representation
https://github.com/status-im/status-js-api
Status Javascript Client (WIP)
ethereum javascript shh status-im web3 web3js whisper
Last synced: 01 Nov 2024
https://github.com/nssharmaofficial/reddit-hole
Automated reddit scraper and video creator
amazon-polly amazon-polly-api automation aws captioning openai openai-whisper reddit reddit-bot reddit-crawler reddit-scraper tts whisper
Last synced: 09 Oct 2024
https://github.com/natehouk/flow-ai-hackathon-2023
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
ai chatgpt chatgpt-api django gpt-3-5-turbo gpt-4 marketaux-api newsapi openai openai-api python3 whisper whisper-ai whisper-api
Last synced: 14 Oct 2024
https://github.com/hisano/openai-whisper-on-docker
OpenAI Whisper on Docker
Last synced: 10 Nov 2024
https://github.com/jim60105/aichatassistant
Stream YouTube live to OpenAI, get AI-generated summaries and real-time reply options. (Chrome Extension)
chrome-extension openai typescript whisper youtube
Last synced: 23 Oct 2024
https://github.com/evilfreelancer/docker-whisper-server
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
api api-server asr cuda docker docker-compose dockerfile nvidia openai openai-api whisper whisper-cpp
Last synced: 09 Oct 2024
https://github.com/redocrepus/Whisper-Paste
Chrome extension that allows dictating anywhere using OpenAI Whisper
chrome-extension dictation openai openai-api text-to-speech voice-recognition voice-typing whisper whisper-ai
Last synced: 24 Oct 2024
https://github.com/voidful/whisper-live-asr-demo
run whisper on CPU/GPU server
Last synced: 24 Oct 2024
https://github.com/luquedaniel/whisper2subs
A CLI tool that transcribes audio using openai-whisper and translates it using DeepL.
audio cli deepl subtitle transcribe translate video weekend-project whisper
Last synced: 11 Oct 2024
https://github.com/m0rf30/shisper
A quick & dirty script to generate and view subtitles and transcriptions for your multimedia files using ggerganov/whisper.cpp
asr bash shisper whisper whispercpp
Last synced: 14 Oct 2024
https://github.com/kurianbenoy/malayalam_asr_benchmarking
A study to benchmark whisper based ASRs in Malayalam
asr benchmarking speech transformers-library whisper
Last synced: 14 Oct 2024
https://github.com/qqxufo/whisper-nodejs
whisper-nodejs is an npm package for using OpenAI's Whisper API to transcribe and translate audio. With whisper-nodejs, you can easily convert audio files into text and translate them into English or other supported languages.
nodejs openai whisper whisper-nodejs
Last synced: 13 Nov 2024
https://github.com/hoangv97/ai-chatbot
Integrate ChatGPT, Dall-E, Whisper and other AI models in Replicate into Messenger and Telegram bot
bottender chatbot chatgpt dall-e2 messenger-bot replicate telegram-bot typescript whisper
Last synced: 24 Oct 2024
https://github.com/oddlama/whisper-overlay
A wayland overlay providing speech-to-text functionality for any application via a global push-to-talk hotkey
faster-whisper hyprland realtime speech-recognition speech-to-text wayland whisper wlroots
Last synced: 09 Oct 2024
https://github.com/hiradary/simplewhisper
A simple speech-to-text transcription interface using OpenAI's Whisper API.
openai speech-to-text whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/egorsmkv/optimized-whisper
Use quantized versions of Whisper to speed up inference
faster-whisper hqq quantization whisper
Last synced: 18 Oct 2024
https://github.com/SrinadhVura/OpenAI-Stack-Hack
Our Medifix is an AI powered assistant powered on gpt-3.5 turbo (chatGPT). Medifix is designed to help people by providing preventive measures based on the symptoms mentioned.
chatgpt gtts streamlit whisper
Last synced: 24 Oct 2024
https://github.com/legendsort/openAISpeechToDatabase
AI automation to save formatted text with proper title from speech
automation chatgpt dropbox notion openai whisper zapier
Last synced: 05 Aug 2024
https://github.com/neka-nat/stenocaptioner
CLI tool for automatic subtitling using whisper.
python subtitles subtitles-generator whisper
Last synced: 14 Oct 2024
https://github.com/mharrvic/redhorse-ai-transcriber
Audio transcriber using Openai whisper ML deployed to Banana.dev
Last synced: 07 Aug 2024
https://github.com/mbotsu/mlx_speech2text
Audio transcription using mlx whisper and vad silence processing
Last synced: 09 Oct 2024
https://github.com/princejoogie/chunktube
It's YouTube.. but text!
gpt-3 openai react typescript whisper
Last synced: 09 Nov 2024
https://github.com/navalnica/whisper-finetuning-be
Finetuning Whisper ASR model for Belarusian language
asr belarus belarusian belarusian-language speech-recognition speech-to-text stt wfte whisper whisper-event
Last synced: 13 Nov 2024
https://github.com/paddy41601/faster-whisper-cli
A command-line interface wrapper for Faster Whisper
faster-whisper openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 24 Oct 2024
https://github.com/gabrielrf/voice2text
Descriรงรฃo automรกtica de mensagens de voz em conversas privadas no Telegram
automation openai openai-whisper pyrogram telegram transcription whisper
Last synced: 11 Nov 2024
https://github.com/BatuhanYilmaz26/Youtube-Transcriber
Input a YouTube video link and get a transcription as a .txt, .vtt or .srt file.
automatic-speech-recognition huggingface openai python speech-recognition streamlit whisper
Last synced: 24 Oct 2024
https://github.com/moebiussurfing/ofxsurfingtextsubtitle
Draws subtitles from an .SRT (or plain text) into a formatted styled paragraph with fading opacity and more.
openframeworks openframeworks-addon whisper whisper-cpp
Last synced: 27 Oct 2024
https://github.com/jxxe/murmur
A proof-of-concept transcription app
journalism mac macos transcribe transcription whisper
Last synced: 24 Oct 2024
https://github.com/flyingfathead/whisper-transcriber-telegram-bot
Python-based Whisper transcriber bot for Telegram
openai-whisper python telegram telegram-bot telegram-bot-api transcribe transcriber transcription whisper yt-dlp yt-dlp-wrapper
Last synced: 12 Nov 2024
https://github.com/cansik/speech-to-text-osc
Speech to text with OSC output.
Last synced: 23 Oct 2024
https://github.com/ignabelitzky/easy-subber
A Python-based tool that that takes video files and generates .srt subtitle files using Whisper for speech recognition, FFmpeg for audio processing, and a simple Tkinter GUI
ffmpeg gui python speech-recognition srt subtitles tkinter transcription video-processing whisper
Last synced: 22 Oct 2024
https://github.com/coderscreative/faster-whisper-rs
a rust crate for easily implementing faster-whisper stt into your rust programs.
ai faster-whisper rust speech-recognition speech-to-text stt whisper
Last synced: 09 Oct 2024
https://github.com/drakerossman/state-of-art-ai
State of Art AI models you can run locally.
ai chatgpt deep-learning gpt large-language-models llm machine-learning stable-diffusion transformers whisper
Last synced: 06 Nov 2024
https://github.com/VoXera/VoXera
An Open-Source Persian Language Techs Toolkit with Python
deep-learning deep-neural-networks keyword-extraction machine-learning natural-language-processing nlp openai persian persian-language speech-recognition speech-to-text text-processing vosk vosk-api whisper
Last synced: 04 Aug 2024
https://github.com/shonharsh/horizonforbiddenwest-shardmacro-logitechghub
A macro to get free metal shards in Horizon Forbidden West using Logitech G Hub
automation bow commands config forbidden free game ghub hack horizon horizon-forbidden-west hunter macro pc script sell shards west whisper windows
Last synced: 11 Oct 2024
https://github.com/gorkemkaramolla/whisper-run
Faster Whisper with Speaker Diarization
distil-whisper faster-whisper openai pyannote speaker-diarization speech-recognition transcription whisper whisper-large
Last synced: 09 Oct 2024
https://github.com/detektor777/colab_list
colab list for video
ai colab-notebook colorization dain deblur enhance instcolorization nafnet real-esrgan transcribe upscaling video whisper
Last synced: 24 Oct 2024
https://github.com/lissettecarlr/AutomaticSpeechRecognition
่ฏญ้ณ่ฝฌๆๆฌ็ๅ็ฑปpythonๅฐ่ฃ ๅฎ็ฐ๏ผparaformerใwhisper_onlineใwhisper_offlineใfunasr๏ผ๏ผ็จไบๆๅกkuonไปๅบ
ai asr audio audio-processing deepl paraformer python speech-to-text text whisper
Last synced: 24 Oct 2024
https://github.com/gcoter/extract-keywords-from-youtube-videos
This project combines youtube-dl, whisper, LangChain and ChatGPT to extract keywords from YouTube videos. It was intented as a tool for Lyon Data Science to better reference its videos.
chatgpt langchain whisper youtube-dl
Last synced: 24 Oct 2024
https://github.com/erkara/Rise-of-Transfer-Learning
you will find brief code implementations of some of the latest developments in AI, including Stable Diffusion, Whisper, YOLO and HuggigFace Transformers
gpt-3 huggingface openai stable-diffusion transfer-learning whisper yolov5
Last synced: 24 Oct 2024
https://github.com/prathamesh-mandavkar/AutoTalker
The project focuses on leveraging technology to create new courses, personalize existing ones, and enhance the assessment process, ultimately contributing to the development of 21st-century skills in students.
ai bark gdsc gdsc-dypsn gemini-api gemini-pro gen-ai ngo python solution-challenge-2024 stt subtitles tts video-creation whisper
Last synced: 24 Oct 2024
https://github.com/ognisty321/whisper-transcription-ui
Whisper Transcription UI is a user-friendly graphical interface for whisper-standalone-win. Transcribe and translate audio/video files effortlessly with customizable settings and saved preferences.
gui python transcription ui whisper whisper-standalone-win
Last synced: 09 Oct 2024
https://github.com/m0wer/aibot
Telegram bot powered by Ollama, capable of handling text and voice messages, with configurable language models and system prompts.
ai assistant llama3 ollama telegram telegram-bot tts whisper
Last synced: 10 Oct 2024
https://github.com/lhr0909/live-subtitles-rokid-ar
้่ฟRokid AR็ผ้ๅOpenAI Whisperๅฎ็ฐ็ฐๅฎ็ๆดปไธญ็ๅญๅน
augmented-reality openai real-time rokid subtitles whisper
Last synced: 11 Oct 2024
https://github.com/botisan-ai/whisper-aws-stack
Deplay Whisper on AWS Scalably
aws cdk ecs fargate fastapi openai silero-vad whisper
Last synced: 24 Oct 2024
https://github.com/0x20f/listen-wise
Save the last 30 seconds of audio to text using ai. Send that text to a notion page, readwise, obsidian, or just save it locally in a text file.
ai notion openai speech-to-text transcription whisper
Last synced: 30 Oct 2024
https://github.com/semyon-dev/whissage
the backend of blockchain-based messenger
blockchain blockchain-messenger ethereum geth messenger whisper whisper-protocol
Last synced: 30 Oct 2024
https://github.com/yjg30737/whisper_transcribe_youtube_video_example_gui
GUI Showcase of using Whisper to transcribe and analyze Youtube video
audio-to-text pyqt pyqt5 pyqt5-desktop-application python pytube qt whisper
Last synced: 07 Nov 2024
https://github.com/kristofferv98/voiceprocessingtoolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api
Last synced: 02 Nov 2024
https://github.com/tristan-mcinnis/realtime-whisper-console-transcriber
A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.
asr console python real-time speech-recognition speech-to-text terminal transcription whisper
Last synced: 12 Oct 2024
https://github.com/umerarif01/ai-translator
AI Translator: Fast and Accurate Translations with Next.js and OpenAI's Whisper and GPT-3 APIs
Last synced: 24 Oct 2024
https://github.com/jech/galene-stt
Speech-to-text support for Galene
galene stt videoconference webrtc whisper whisper-cpp
Last synced: 09 Oct 2024
https://github.com/ribartra/call-listener_bot
A bot that downloads, transcribes and analyzes calls to find insights for sales advisors.
api audio-analyser call-bot call-listener drive gcp openai python whisper
Last synced: 09 Oct 2024
https://github.com/nexuslux/realtime-whisper-console-transcriber
A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.
asr console python real-time speech-recognition speech-to-text terminal transcription whisper
Last synced: 09 Oct 2024
https://github.com/abus-aikorea/voice-pro
The best gradio web-ui for ai transcription, translation and TTS. Automatic subtitle creation using faster whisper. Easy one click installation. Fully portable.
asr faster-whisper translation tts whisper
Last synced: 09 Oct 2024
https://github.com/fabio-garavini/ha-groq-whisper-stt-api
HACS custom integration for using GroqCloud speech-to-text (Whisper) API in the Assist pipeline, reducing the workload on the Home Assistant server.
groq-api home-assistant stt whisper
Last synced: 29 Sep 2024
https://github.com/ckaznable/yt-cli-live
Youtube Text Live Streaming in CLI
asr cli rust silero-vad whisper whisper-cpp youtube
Last synced: 12 Nov 2024
https://github.com/CrabAss/dCollab
Decentralized e-Learning Collaboration Platform as a Capstone Project (COMP4913, PolyU)
comp4913 dapp ethereum javascript react whisper
Last synced: 24 Oct 2024
https://github.com/awaisoem/interview-lingo
(Aug 2024) AI assistant which help with interviews, hiring, personality development and communication skills
ai ai71 drizzle-orm falcon neondb nextjs postgresql tailwindcss whisper
Last synced: 09 Oct 2024
https://github.com/lbrndnr/nutshell-macos
An AI-powered note-taking app for your meetings. Built for macOS using SwiftUI.
Last synced: 13 Nov 2024
https://github.com/weihanchen/google-colab-python-learn
๐ Learn Google ColabใPythonใMLใOpenAIใWhisperใspaCyใNLPใHuggingFace
colab-notebook huggingface matplotlib natural-language-processing nlp openai pandas python spacy whisper
Last synced: 11 Nov 2024
https://github.com/danomation/Voice-Website
Talk back and forth to GPT over browser. Customize to have your own interactive voice assistant!
elevenlabs gpt stt tts whisper
Last synced: 24 Oct 2024
https://github.com/sandy1990418/chinesetaiwanesewhisper
This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, model training, and evaluation, tailored to improve speech recognition accuracy for these languages.
asr chinese gradio realtime speech-to-text streaming-audio taiwanese whisper
Last synced: 09 Oct 2024
https://github.com/patbqc/thoughtforgeai
Forge your thoughts through an AI powered brainstorming session !
ai anthropic brainstorm brainstorming brainstorms mobile openai reactnative whisper
Last synced: 31 Oct 2024
https://github.com/chriamue/whisper-example
Docker compose environment and example for whisper.
docker-compose geth p2p-network shh web3js whisper
Last synced: 24 Oct 2024
https://github.com/schibsted/sum
Sum, a powerful tool for enhancing your articles with the help of ChatGPT.
chatgpt nextjs nrk openai tailwindcss vg whisper
Last synced: 13 Nov 2024
https://github.com/sslava/ai-voice-chat
AI Voice Chat
nodejs openai tts voice-recognition whisper
Last synced: 20 Oct 2024
https://github.com/jorgeandrespadilla/avtools
AV Tools - A collection of CLI tools for audio and video processing (powered by AI). Audio transcription, Video to Audio conversion, YouTube downloader.
Last synced: 13 Nov 2024
https://github.com/achraf-oujjir/profgpt-smart-vr-professor
๐จโ๐ซ๐ค ProfGPT: AI-powered VR professor with electrical circuits lab table โก๐ก Built with Unity ๐ฎ GPT and Whisper APIs ๐ง and AWS Polly ๐ฆ๐ฃ๏ธ
ai-education aws-polly chatgpt-api csharp education oculus-quest-2 openai-api openai-whisper speech-to-text text-to-speech unity3d virtual-reality vr whisper
Last synced: 03 Nov 2024
https://github.com/t0mer/telessist
Telessist allows you to contact GPT3 directly from WhatsApp and not only that. Telessist also allows you to save your own personal data and later search and retrieve it using GPT3 to generate a response. In the examples folder, you can see several examples of how to use this bot so you don't have to remember anything ever again.
assistant chatgpt dall-e docker openapi python3 telegram telegram-bot weather whisper
Last synced: 15 Oct 2024
https://github.com/redocrepus/arkode
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
accessibility chatgpt chatgpt-api code-assistant coding-assistant coding-by-voice developer-tools openai openai-api programming-assistant programming-by-voice visual-studio-code visual-studio-code-extension visualstudiocode voice-coding voicecode voicecoding vscode-extension whisper whisper-api
Last synced: 24 Oct 2024
https://github.com/gustavz/audio-to-text
streamlit app to transcript audio to text using openai's whisper library
audio-to-text streamlit whisper
Last synced: 24 Oct 2024
https://github.com/phineas-pta/fine-tune-whisper-vi
jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2
aws docker fine-tuning lora multi-gpu-training speech-recognition speech-to-text vietnamese whisper
Last synced: 14 Oct 2024
https://github.com/egorsmkv/whisper-ukrainian
Trainer and Evaluation scripts for fine-tuning Whisper models for the Ukrainian language
asr automatic-speech-recognition openai speech-recognition ukrainian whisper
Last synced: 18 Oct 2024
https://github.com/manucabral/quick-subtitles
An easy way to generate SRT subtitles from a video in Windows.
audio-to-text srt srt-subtitles subtitles subtitles-generator transcription whisper whisper-ai windows
Last synced: 03 Nov 2024
https://github.com/limdongjin/ignkafasr
Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)
apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper
Last synced: 24 Oct 2024
https://github.com/upes-open/osoc-24-the-content-forge
The Content Hub Is a online platform which acts as a all in one solution helping content creators develop and generate short form video image content utilising genai models and cloud to maximize their efficiency and benefit from the ever-growing developments in ai models
aws docker fastapi genai microservices nodejs react whisper
Last synced: 09 Oct 2024
https://github.com/seitzquest/RavenWhisperer
Listens to your voice and queries a language model for answers when a question is detected
Last synced: 05 Aug 2024
https://github.com/JoSuru/speeka
Speeaka is an open-source project that uses the Whisper model of OpenAI to transcribe audio into text. Its intuitive web interface makes it easy to use. Contributions are welcome.
open-source python python3 speech-to-text streamlit whisper
Last synced: 24 Oct 2024
https://github.com/water25234/ChatREP
Summary on Youtube By ChatGPT & whisper
chatgpt-api openai python python3 video whisper youtube
Last synced: 24 Oct 2024
https://github.com/sovit-123/sam_molmo_whisper
An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.
molmo segment-anything-model segmentanythingmodel vlm whisper
Last synced: 18 Oct 2024
https://github.com/sanket-poojary-03/fine-tuning-whisper
Fine tuning Whisper-Small LLM for Hinglish Audio dataset
audio-dataset audio-to-text deep-learning fine-tuning huggingface-transformers python speech-recognition speech-to-text whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/tensoraws/yuisub
Auto translation of new anime episodes based on Yui-MHCP001
anime chatgpt llm openai pysubs2 subtitle translation whisper
Last synced: 09 Oct 2024
https://github.com/zaneh/heybilly
๐ฃ๏ธ It's like Alexa, but for your computer. Highly modular, real-time voice assistant. Built using self-assembling graphs.
contributions-welcome graph python3 rabbitmq self-hosted tts voice-assistant whisper
Last synced: 19 Oct 2024
https://github.com/my-north-ai/semantic_audio_filtering
Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.
automatic-speech-recognition fine-tuning synthetic-dataset-generation text-to-speech whisper
Last synced: 24 Oct 2024
https://github.com/sakurajimamai-1202/stream-translator-gpt-webui
A web ui application that utilizes the stream-translator-gpt
faster-whisper gemini gpt transcribe translate translation translator webui whisper yt-dlp
Last synced: 11 Oct 2024
https://github.com/marty1885/useful-whisper-server
Whisper server based on useful-transformers for the RK3588
npu rk3588 rockchip useful-transformers whisper
Last synced: 15 Oct 2024
https://github.com/driftingruby/395-transcribing-with-artificial-intelligence
In this episode, we look at creating an audio transcription service which allows files uploaded from Active Storage to be transcribed with Artificial Intelligence. However, there are a lot of considerations around the approach from both a performance and thread safety perspectives.
artificial-intelligence openai ruby ruby-on-rails whisper
Last synced: 05 Nov 2024
https://github.com/paulocoutinhox/py-transcriptor-ai
PyTranscriptorAi - Transcript videos to text with Ai and add subtitles - OpenAi
ai openai subtitles transcript video whisper
Last synced: 09 Nov 2024
https://github.com/otonomee/mic2transcript
CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.
audio cli cli-tool openai speech speech-transcription transcription whisper
Last synced: 09 Oct 2024
https://github.com/fly-apps/cog-whisper
Run OpenAI Whisper as a Cog model on Fly GPUs
Last synced: 24 Oct 2024