Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-26 00:29:40 UTC
- JSON Representation
https://github.com/neiltron/autocap
ALL CAPS
closedcaptions ml subtitles transcription whisper
Last synced: 19 Dec 2024
https://github.com/sudiptab2100/waku-user-chat
Waku Chat using Usernames
communication-protocol decentralised-application decentralized ethereum ipfs libp2p waku waku-connect web3 whisper zk-snarks zkp
Last synced: 20 Dec 2024
https://github.com/khushijtrivedi/speech
The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.
ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper
Last synced: 09 Oct 2024
https://github.com/malexandersalazar/casey
Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI
agentic-ai content-creation groq large-language-models python wellbeing whisper
Last synced: 18 Dec 2024
https://github.com/nexuslux/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 09 Oct 2024
https://github.com/evil0ctal/whisper-speech-to-text-api
An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.
ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api
Last synced: 25 Oct 2024
https://github.com/LarissaGuder/whisper-datastream
Transcription and NER in streaming environment
bert-ner python spark-streaming whisper
Last synced: 24 Oct 2024
https://github.com/yuxiang32/Audio-Transcription
Audio transcriber using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/meain/raus
Record audio until silence (RAUS)
audio hammerspoon transcription whisper whisper-cpp
Last synced: 17 Nov 2024
https://github.com/escarrie/transcriptaudio
This is a script that can be used to transcript audio file into text file using Whisper AI
Last synced: 17 Nov 2024
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/doctorpok42/pheere
Pheere is a simple virtual assistant
ai chatgpt elevenlabs ts virtual-assistant whisper
Last synced: 11 Nov 2024
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/chinese-soup/cbot-telegram-whisper
Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.
bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp
Last synced: 16 Nov 2024
https://github.com/bilalhameed248/whisper-fine-tuning-for-pronunciation-learning
Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning
deep-learning deep-neural-networks dnn fine-tuning openai pronunciation python seq2seq speech speech-recognition speech-synthesis speech-to-text whisper whisper-ai
Last synced: 15 Nov 2024
https://github.com/kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization
ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo
Last synced: 14 Dec 2024
https://github.com/televisionninja/chat
Chat with an AI Vtuber
ai chatbot llama llm tts vtube-studio vtuber whisper
Last synced: 20 Nov 2024
https://github.com/sixiaolong1117/whisperpythonscript
一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。
pysrt python python3 whisper whisper-ai
Last synced: 15 Nov 2024
https://github.com/goktugcy/noteai
An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.
adonisjs whisper whisper-ai whisper-api
Last synced: 15 Nov 2024
https://github.com/lifeosm/whisper
🐳 Docker image with OpenAI Whisper.
docker octolab speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/cp3249/athena_project
Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.
Last synced: 03 Dec 2024
https://github.com/tristan-mcinnis/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 16 Nov 2024
https://github.com/levysantiago/upload-ai
Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.
ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod
Last synced: 13 Nov 2024
https://github.com/waikato-llm/whisper
Docker images for the whisper audio transcription library and variants.
Last synced: 13 Nov 2024
https://github.com/electroneum/electroneum-web3.js
Electroneum SmartChain JavaScript API
api electroneum ethereum etn-sc javascript swarm typescript whisper
Last synced: 26 Sep 2024
https://github.com/lazauk/aoai-entraidauth-sdkv1
Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x
ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper
Last synced: 13 Nov 2024
https://github.com/ts-azure-services/batch-transcription-examples
A repo to archive some code related to batch transcription for animation movies.
batch-transcription speech-to-text whisper
Last synced: 30 Nov 2024
https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue
Lors de ce hackathon, nous avons développé la solution Smart VT, une application web basée sur l'IA conçue pour sous-titrer et doubler n'importe quelle vidéo d'une langue à une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidéos, et Node.js pour la gestion des sous-titres vidéo.
api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper
Last synced: 12 Nov 2024
https://github.com/ainoya/chrome-extension-web-transcriptor-ai
Privacy-focused Chrome extension that transcribes audio from browser tabs locally using transformers.js and the TabCapture API. All processing occurs within the browser, ensuring that audio data is never sent to external servers.
chrome-extension chrome-extensions transformersjs whisper
Last synced: 11 Nov 2024
https://github.com/kitschpatrol/ambient-novel
An interface for nonlinear interactive exploration of a novel.
ambient book fiction interactive novel svelte whisper
Last synced: 19 Nov 2024
https://github.com/a-iceberg/whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops monitoring mssqlserver openai prompt-engineering python resource-management timestamps uvicorn-gunicorn whisper
Last synced: 17 Nov 2024
https://github.com/fer14/videoseek
Intelligent video search tool powered by AI
bert timestamp video whisper youtube-api
Last synced: 14 Nov 2024
https://github.com/Franky1/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python
openai python streamlit transcription whisper
Last synced: 24 Oct 2024
https://github.com/datvm/openaiwhisperclient
A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.
client-side openai subtitle timestamp transcript transcription whisper whisper-ai
Last synced: 15 Dec 2024
https://github.com/mariatepei/vt_thesis_mtepei
This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.
data-augmentation fastspeech2 speech-recognition whisper
Last synced: 09 Oct 2024
https://github.com/njorogemaurice/speech-recognition-openai-whisper
This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.
openai speech-recognition speech-to-text whisper
Last synced: 14 Nov 2024
https://github.com/kristofferv98/semanthavoiceassistant
A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.
ai-companion autogen elevenlabs intent-detection local-llm natural-language-processing openai personalized-interactions picovoice python rag semantic-routing sentiment-analysis text-to-speech voice-activity-detection voice-assistant voice-processing voice-recognition websearch whisper
Last synced: 11 Nov 2024
https://github.com/pkarpovich/kira-client
An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices
ai-assistant intent-classification porcupine trigger-word-detection whisper
Last synced: 14 Nov 2024
https://github.com/flyingfathead/youwhisper-cli
A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.
cli cli-app python transcribe transcriber transcription whisper whisper-ai whisperx youtube-downloader yt-dlp yt-dlp-wrapper
Last synced: 12 Nov 2024
https://github.com/status-im/infra-role-status-go
Ansible role for status-go
ansible-role infra waku whisper
Last synced: 09 Nov 2024
https://github.com/studiowebux/tommygotchi
whisper, piper, llama-gpt, python, fun .. so much fun !
llama-gpt piper python3 whisper whisper-ai
Last synced: 09 Nov 2024
https://github.com/arslanex/whisperdemo
A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.
audio-processing openai openai-whisper python whisper
Last synced: 23 Nov 2024
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/yousofss/speechtotext
Speech-to-Text using OpenAI's Whisper model
audio-to-text openai openai-whisper speech-to-text transcription whisper whisper-ai
Last synced: 09 Oct 2024