Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/Franky1/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

openai python streamlit transcription whisper

Last synced: 24 Oct 2024

https://github.com/antosser/whisper-ui-web

Web App for interacting with the OpenAI Whisper API visually, written in Svelte

app english svelte text voice voice-recognition voice-to-text web whisper

Last synced: 07 Feb 2025

https://github.com/educa-ch/educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

langchain ollama open-source open-weight speech-to-text summarization whisper

Last synced: 11 Oct 2024

https://github.com/deshwalmahesh/whisper-fastapi-realtime

It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription

audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large

Last synced: 25 Oct 2024

https://github.com/umlx5h/llplayer

The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!

asr csharp flyleaf language-learning media-player ocr player tesseract video video-player whisper wpf yt-dlp

Last synced: 01 Feb 2025

https://github.com/huuquyet/phowhisper-tiny

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 01 Feb 2025

https://github.com/samliebl/ai-whisper

Simple Node.js app: speech-to-text via whisper by OpenAI with file download.

nodejs openai speect-to-text transcription whisper whisper-ai

Last synced: 19 Dec 2024

https://github.com/diegoseg15/ia-tesis-backend

About Proyecto de tesis - Asistente Robot DORIS - Frontend

artificial-intelligence express gpt nodejs openai tts whisper

Last synced: 08 Feb 2025

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 07 Feb 2025

https://github.com/huuquyet/phowhisper-small

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 01 Feb 2025

https://github.com/ifeech/subtitler

Creating subtitles from video

subtitles whisper

Last synced: 08 Feb 2025

https://github.com/senkita/gabriel

视频总结工具。

summarizer whisper

Last synced: 08 Feb 2025

https://github.com/pjarbas/azure-ai

Examples using Azure AI services (DALLE3, Text to Speech, Whisper)

azure-openai dalle-3 image-generation-ai speech-synthesis text-to-speech whisper

Last synced: 21 Jan 2025

https://github.com/teemow/mnote

Generates meeting notes and summaries from video recordings

ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper

Last synced: 02 Feb 2025

https://github.com/dheison0/subcreator

A subtitle creator, translator and embeder tool made using AI

ai machine-learning ml python subtitles video-processing whisper

Last synced: 08 Feb 2025

https://github.com/tristan-mcinnis/Simultaneous-Interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 08 Feb 2025

https://github.com/man2dev/whisper-cpp

dev fork of https://src.fedoraproject.org/rpms/whisper-cpp

fedora fedora-repository linux whisper whisper-cpp whispercpp

Last synced: 08 Feb 2025

https://github.com/datvm/openaiwhisperclient

A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.

client-side openai subtitle timestamp transcript transcription whisper whisper-ai

Last synced: 08 Feb 2025

https://github.com/fkiller/whispertranscript

Transcribe voice from mic input using OpenAI Whisper API.

llm openai transcribe transcript transcription webaudio whisper

Last synced: 06 Jan 2025

https://github.com/suchith-2002/whisperwave

Transcribe any Audio to Text.

openai whisper

Last synced: 03 Feb 2025

https://github.com/tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

colab jupyter python transcribe transformers translate whisper

Last synced: 27 Dec 2024

https://github.com/thealphamerc/audio-to-text

Transcribe multi-lingual audio clips using whisper model

openai whisper

Last synced: 02 Feb 2025

https://github.com/mariatepei/vt_thesis_mtepei

This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.

data-augmentation fastspeech2 speech-recognition whisper

Last synced: 08 Feb 2025

https://github.com/patryk-ku/sasayaki

A small CLI tool that simplifies and automates the process of installing and using AI models to transcribe and translate videos.

automation cli faster-whisper gemini-api transcription translation whisper whisper-cpp

Last synced: 05 Jan 2025

https://github.com/mai-reborn/mai-offline-transcriber

Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.

asr audio-transcription offline-transcriber pyqt6 python speech-recognition video-transcription whisper

Last synced: 05 Jan 2025

https://github.com/iamarunbrahma/smart-voice-assistant

A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.

chatgpt tts whisper

Last synced: 02 Feb 2025

https://github.com/orhancavus/transcribe_video

Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper

insanely-fast speach-to-text whisper

Last synced: 09 Jan 2025

https://github.com/homelab-00/longformstt

A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.

faster-whisper python speech-to-text whisper

Last synced: 09 Jan 2025