Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

GitHub: https://github.com/topics/whisper
Repo: https://github.com/openai/whisper
Created by: OpenAI
Released: August 2021
Related Topics: machine-learning, artificial-intelligence, language-modeling,
Last updated: 2025-02-19 00:30:04 UTC
JSON Representation

https://github.com/sivakumar-mahalingam/subtitle-generator

🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python

ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper

Last synced: 08 Feb 2025

https://github.com/paszkoo/real_time_whisper_iot

Real time voice transcription from default audio input using faster-whisper

ai iot-application iot-device smart-home voice-assistant voice-recognition whisper

Last synced: 17 Jan 2025

https://github.com/mario-huang/whisper-desktop

A desktop app for easy subtitle using whisper model.

ai desktop gradio open-source python pytorch tauri web-ui whisper

Last synced: 17 Jan 2025

https://github.com/umlx5h/llplayer

The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!

asr csharp flyleaf language-learning media-player ocr player tesseract video video-player whisper wpf yt-dlp

Last synced: 01 Feb 2025

https://github.com/pratikpakhale/terravis

Voice guided GIS system

genai gis lam llm voice whisper

Last synced: 08 Feb 2025

https://github.com/ty-martz/audiologic

Python Module to process and predict on music attributes

machine-learning music python whisper

Last synced: 24 Oct 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 15 Feb 2025

https://github.com/soenneker/soenneker.runners.whisper.ctranslate

Automatically updates the Soenneker.Whisper.CTranslate package

ai csharp ctranslate ctranslate2 dotnet faster library runner runners whisper whisperctranslate

Last synced: 18 Feb 2025

https://github.com/huuquyet/phowhisper-tiny

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 01 Feb 2025

https://github.com/xi-rick/captains-log

Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.

audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper

Last synced: 21 Jan 2025

https://github.com/simongino/whisper-fastapi

A FastAPI-based application integrating Whisper for efficient speech recognition and processing.

ai docker fastapi python whisper

Last synced: 08 Feb 2025

https://github.com/theaussiepom/wyoming-openai

OpenAI SST and TTS support for the Wyoming protocol

home-assistant home-assistant-assist openai sst tts whisper wyoming

Last synced: 13 Feb 2025

https://github.com/ashot72/answering-questions-about-images

You can upload images, ask questions about images using voice prompts, then listen to the responses in voice

answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper

Last synced: 20 Feb 2025

https://github.com/mottla/speech-to-text

Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.

huggingface speech-recognition stt teams transcription translation whisper zoom

Last synced: 21 Jan 2025

https://github.com/ubos-tech/node-red-contrib-speech-to-text-ubos

Learn how to turn audio into text.

ai low-code lowcode node-red node-red-contrib node-red-flow openai openai-api openai-whisper speech-to-text whisper whisper-ai whisper-api

Last synced: 20 Jan 2025

https://github.com/leafyeexyz/counselorleaf

一个随时陪伴你的 AI 心理咨询师

cloudflare-api cloudflare-pages cloudflare-workers counselling counselor javascript psychology qwen react reactjs whisper

Last synced: 11 Dec 2024

https://github.com/cp3249/athena_project

Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.

coqui-tts llm ollama whisper

Last synced: 15 Jan 2025

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/cnseniorious000/dl-a2t

download, audio-to-text PyPI: https://pypi.org/p/dl-a2t

audio transcription whisper youtube

Last synced: 02 Jan 2025

https://github.com/zahidhasann88/video-summarizer

A videos by extracting audio and generating summaries based on the audio content.

nodejs openai typescript whisper

Last synced: 07 Jan 2025

https://github.com/samliebl/ai-whisper

Simple Node.js app: speech-to-text via whisper by OpenAI with file download.

nodejs openai speect-to-text transcription whisper whisper-ai

Last synced: 12 Feb 2025

https://github.com/sskorol/home-assistant-voice

Home Assistant Voice PE Setup Guide

docker home-assistant home-automation piper smart-home speech-recognition speech-synthesis voice-assistant whisper

Last synced: 04 Feb 2025

https://github.com/khushijtrivedi/speech

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.

ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper

Last synced: 08 Feb 2025

https://github.com/educa-ch/educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

langchain ollama open-source open-weight speech-to-text summarization whisper

Last synced: 11 Feb 2025

https://github.com/julrog/jokes-on-you

Storyteller

ggj2024 global-game-jam openai unity whisper

Last synced: 09 Feb 2025

https://github.com/s-emanuilov/whispercpp_kit

A wrapper on whisper.cpp with additional helper features like model management capabilities.

asr whisper

Last synced: 13 Dec 2024

https://github.com/a-iceberg/whisper_model_evaluator

WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator

asr audio-to-text automatic-speech-recognition data-analysis evaluation google-speech-recognition python tuning-parameters visualization vosk whisper

Last synced: 24 Oct 2024

https://github.com/diegoseg15/ia-tesis-backend

About Proyecto de tesis - Asistente Robot DORIS - Frontend

artificial-intelligence express gpt nodejs openai tts whisper

Last synced: 08 Feb 2025

https://github.com/403errors/tubequery

TubeQuery is a LLM based model, fetching all the queries related to your video. Just input the video link and all the qestiones are welcomed!

huggingface-transformers langchain nlp-machine-learning pipeline python3 tiktoken whisper yt-dlp

Last synced: 14 Feb 2025

https://github.com/cybergen49/ai-note-taker

A clientside-only webapp that uses OpenAI's whisper and GPT models to transcribe audio and convert the transcript to notes, summaries, or other more concise content.

ai api gpt note-taking openai productivity summarizer whisper

Last synced: 14 Feb 2025

https://github.com/chloelavrat/speech-to-text-app

Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.

audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/codewithdark-git/talktube

A powerful Streamlit application that allows users to analyze and interact with YouTube video content through natural language questions.

agents genai genai-domain groq groq-api langchain langchain-python llm lvlm lvlms pyhton3 python rag streamlit webapp whisper youtube youtube-bot

Last synced: 10 Feb 2025

https://github.com/deshwalmahesh/interview-help-cheat-live

As the name suggests, it helps you cheat in your live interviews or video calls. It transcribes your audio and provides answers to your query in real time. Supports equation rendering, custom prompts, text selection and editing. It's basically chatGPT for cheating in interviews

audio-transcription chatgpt fastapi huggingface interview interviews live openai pyaudio realtime transcription transformers whisper whisper-large

Last synced: 20 Feb 2025

https://github.com/josemarcosrf/Lexicap-QA

QA retrieval for Lex Fridman's podcast transcriptions

lexicap qa search whisper

Last synced: 24 Oct 2024

https://github.com/zdwolfe/transcription-tools

Docker video transcriber, wrapper around OpenAI

openai transcription whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/webmural/rewind

rewind mural

mural whisper wind

Last synced: 29 Jan 2025

https://github.com/tylim88/Voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 24 Oct 2024

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 10 Feb 2025

https://gitlab.com/ifrz/asr-multi-lite

Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition

distilhubert wav2vec2 whisper

Last synced: 24 Oct 2024

https://github.com/aixerum/faster-whisper

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

ctranslate2 gpu transcription whisper

Last synced: 07 Jan 2025

Last synced: 23 Jan 2025

Whisper Awesome Lists

Awesome-Korean-Speech-Recognition 30 awesome-openai-whisper 58

Whisper Categories

Tutorials 109 Applications 21 Model Variants 11 한국어 데이터셋에 덧붙여 11 한국어 음성인식 API 9 API Ready / Playground / Demo 6 **왜 CER로 계산하나요? (Character Error Rate)** 5 Articles 4 General Resources 4 Videos 4 Awesome 한국어 음성인식을 소개하면서 3