Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/sivakumar-mahalingam/subtitle-generator

🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python

ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper

Last synced: 08 Feb 2025

https://github.com/zuplyx/subtitle-creator

Add english subtitles to videos using openai/whisper-large-v3

open-ai poetry-python python3 subtitles-generator whisper

Last synced: 09 Dec 2024

https://github.com/sixiaolong1117/whisperpythonscript

一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。

pysrt python python3 whisper whisper-ai

Last synced: 16 Jan 2025

https://github.com/zdwolfe/transcription-tools

Docker video transcriber, wrapper around OpenAI

openai transcription whisper whisper-ai

Last synced: 02 Jan 2025

https://gitlab.com/ifrz/asr-multi-lite

Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition

distilhubert wav2vec2 whisper

Last synced: 24 Oct 2024

https://github.com/aixerum/faster-whisper

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

ctranslate2 gpu transcription whisper

Last synced: 07 Jan 2025

https://github.com/pjarbas/azure-ai

Examples using Azure AI services (DALLE3, Text to Speech, Whisper)

azure-openai dalle-3 image-generation-ai speech-synthesis text-to-speech whisper

Last synced: 21 Jan 2025

https://github.com/arslanex/whisperdemo

A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.

audio-processing openai openai-whisper python whisper

Last synced: 23 Nov 2024

https://github.com/paszkoo/real_time_whisper_iot

Real time voice transcription from default audio input using faster-whisper

ai iot-application iot-device smart-home voice-assistant voice-recognition whisper

Last synced: 17 Jan 2025

https://github.com/mario-huang/whisper-desktop

A desktop app for easy subtitle using whisper model.

ai desktop gradio open-source python pytorch tauri web-ui whisper

Last synced: 17 Jan 2025

https://github.com/studiowebux/tommygotchi

whisper, piper, llama-gpt, python, fun .. so much fun !

llama-gpt piper python3 whisper whisper-ai

Last synced: 05 Jan 2025

https://github.com/umlx5h/llplayer

The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!

asr csharp flyleaf language-learning media-player ocr player tesseract video video-player whisper wpf yt-dlp

Last synced: 01 Feb 2025

https://github.com/tylim88/Voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 24 Oct 2024

https://github.com/philogicae/docker-faster-whisper-fr-api

Docker - Faster Whisper FR - RunPod Serverless API

ctranslate2 docker faster-whisper french runpod serverless whisper

Last synced: 08 Jan 2025

https://github.com/kristofferv98/whisper_turboapi

An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization

ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo

Last synced: 14 Dec 2024

https://github.com/levysantiago/upload-ai

Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.

ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod

Last synced: 12 Jan 2025

https://github.com/theaussiepom/wyoming-openai

OpenAI SST and TTS support for the Wyoming protocol

home-assistant home-assistant-assist openai sst tts whisper wyoming

Last synced: 13 Feb 2025

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 10 Feb 2025

https://github.com/RingoMar/whisper-devcontainer

Openai whisper inside of vscode docker devcontainer using example files

ai devcontainer docker openapi python whisper

Last synced: 24 Oct 2024

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 17 Jan 2025

https://github.com/bluebirdback/groq-subtitles

Batch video subtitle generation using Groq Whisper API

groq speech-to-text subtitles video whisper

Last synced: 21 Dec 2024

https://github.com/mdbecker/whisper_cpp_macos_utils

Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.

audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp

Last synced: 29 Jan 2025

https://github.com/huuquyet/phowhisper-tiny

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 01 Feb 2025

https://github.com/jplhughes/whisper_logit_lens

This Alignment Jam Hackathon project explores whether the concept of "logit lens" applies to the encoder and decoder layers in Whisper, an end-to-end speech recognition model.

alignment-jam asr interpretability interpretability-jam logitlens whisper

Last synced: 24 Oct 2024

https://github.com/josemarcosrf/Lexicap-QA

QA retrieval for Lex Fridman's podcast transcriptions

lexicap qa search whisper

Last synced: 24 Oct 2024

https://github.com/fkiller/whispertranscript

Transcribe voice from mic input using OpenAI Whisper API.

llm openai transcribe transcript transcription webaudio whisper

Last synced: 06 Jan 2025

https://github.com/s-emanuilov/whispercpp_kit

A wrapper on whisper.cpp with additional helper features like model management capabilities.

asr whisper

Last synced: 13 Dec 2024

https://github.com/MattCode64/Scriba

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

fastapi python speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/suchith-2002/whisperwave

Transcribe any Audio to Text.

openai whisper

Last synced: 03 Feb 2025

https://github.com/403errors/tubequery

TubeQuery is a LLM based model, fetching all the queries related to your video. Just input the video link and all the qestiones are welcomed!

huggingface-transformers langchain nlp-machine-learning pipeline python3 tiktoken whisper yt-dlp

Last synced: 14 Feb 2025

https://github.com/cybergen49/ai-note-taker

A clientside-only webapp that uses OpenAI's whisper and GPT models to transcribe audio and convert the transcript to notes, summaries, or other more concise content.

ai api gpt note-taking openai productivity summarizer whisper

Last synced: 14 Feb 2025

https://github.com/stefanangelovski/voice_to_tweet

Tweet with your Voice using Whisper STT from OpenAI and Twitter4J flow to connect and talk with any account.

ai frontend openai twitter website whisper x

Last synced: 15 Dec 2024