Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-02-10 00:33:03 UTC
- JSON Representation
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/heng30/vtbox
It is an offline voice to text tool. Using whisper model to transcribe.
rust slint-ui voice2text whisper
Last synced: 21 Nov 2024
https://github.com/tobybenjaminclark/intermew
π¨βπ» Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.
computer-vision openai-api whisper
Last synced: 25 Jan 2025
https://github.com/paszkoo/real_time_whisper_iot
Real time voice transcription from default audio input using faster-whisper
ai iot-application iot-device smart-home voice-assistant voice-recognition whisper
Last synced: 17 Jan 2025
https://github.com/mario-huang/whisper-desktop
A desktop app for easy subtitle using whisper model.
ai desktop gradio open-source python pytorch tauri web-ui whisper
Last synced: 17 Jan 2025
https://github.com/iamarunbrahma/smart-voice-assistant
A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.
Last synced: 02 Feb 2025
https://github.com/brucewind/localwhisperapiservice
openai-whisper transcribe whisper
Last synced: 20 Jan 2025
https://github.com/ajxv/rtstt
Real time speech to text transcription using OpenAi whisper
live-transcription openai openai-whisper python3 transcription whisper
Last synced: 22 Dec 2024
https://github.com/huuquyet/phowhisper-tiny
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 01 Feb 2025
https://github.com/evilfreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 09 Feb 2025
https://github.com/leafyeexyz/counselorleaf
δΈδΈͺιζΆιͺδΌ΄δ½ η AI εΏηε¨θ―’εΈ
cloudflare-api cloudflare-pages cloudflare-workers counselling counselor javascript psychology qwen react reactjs whisper
Last synced: 11 Dec 2024
https://github.com/zahidhasann88/video-summarizer
A videos by extracting audio and generating summaries based on the audio content.
nodejs openai typescript whisper
Last synced: 07 Jan 2025
https://github.com/bluebirdback/groq-subtitles
Batch video subtitle generation using Groq Whisper API
groq speech-to-text subtitles video whisper
Last synced: 21 Dec 2024
https://github.com/flo-bit/youtube-speaker-separation
simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate
speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube
Last synced: 19 Dec 2024
https://github.com/s-emanuilov/whispercpp_kit
A wrapper on whisper.cpp with additional helper features like model management capabilities.
Last synced: 13 Dec 2024
https://github.com/yui-mhcp/speech_to_text
Speech-To-Text (STT) project
audio-transcription deepspeech jasper speech-to-text stt stt-api tensorflow2 video-transcription whisper
Last synced: 24 Oct 2024
https://github.com/obay-ismaeel/post-generator
An API that generates social media posts by implementing RAG with Llama-3
ai api fastapi llama llm python retrieval-augmented-generation social-media whisper
Last synced: 12 Oct 2024
https://github.com/cnseniorious000/dl-a2t
download, audio-to-text PyPI: https://pypi.org/p/dl-a2t
audio transcription whisper youtube
Last synced: 02 Jan 2025
https://github.com/EvilFreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 24 Oct 2024
https://github.com/codewithdark-git/talktube
A powerful Streamlit application that allows users to analyze and interact with YouTube video content through natural language questions.
agents genai genai-domain groq groq-api langchain langchain-python llm lvlm lvlms pyhton3 python rag streamlit webapp whisper youtube youtube-bot
Last synced: 10 Feb 2025
https://github.com/thealphamerc/audio-to-text
Transcribe multi-lingual audio clips using whisper model
Last synced: 02 Feb 2025
https://github.com/hanpham32/react-native-whisper
A simple text transcription web/mobile app
flask ngrok react-native transcribe whisper
Last synced: 24 Dec 2024
https://github.com/crucials/twaddle
speech analysis app that collects statistics like words frequencies and transcribed text
ai audio python python-eel speech-to-text vue whisper
Last synced: 24 Oct 2024
https://github.com/tylim88/voicefu-back-end
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/userpjm/whisper-youtube
Generate a SubRip subtitle file (srt) using Whisper for the audio of a YouTube video.
faster-whisper openai speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/cp3249/athena_project
Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.
Last synced: 15 Jan 2025
https://github.com/samliebl/ai-whisper
Simple Node.js app: speech-to-text via whisper by OpenAI with file download.
nodejs openai speect-to-text transcription whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/arslanex/whisperdemo
A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.
audio-processing openai openai-whisper python whisper
Last synced: 23 Nov 2024
https://github.com/malexandersalazar/casey
Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI
agentic-ai content-creation groq large-language-models python wellbeing whisper
Last synced: 18 Dec 2024