Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/extrange/transcription-benchmarks

Speech to text model benchmarks

transcription whisper

Last synced: 08 Dec 2024

https://github.com/markshawn2020/2025-02-03_lex-fridman-deepseek

Transcription and translation scripts for Lex Fridman podcast about DeepSeek, at 2025-02-03

assemblyai deepl deepseek lexfridman whisper xunfei

Last synced: 04 Feb 2025

https://github.com/pawelzeja098/whisper-video-transcription

Testing whisper Open-AI to transcribe videos

audio mp3 mp4 transcription video whisper whisper-ai

Last synced: 27 Jan 2025

https://github.com/jgw96/speech-to-text-web-toolkit

Making Speech-To-Text on the web easy, both local and in the cloud

ai lit transformersjs webcomponents whisper

Last synced: 01 Feb 2025

https://github.com/mickekring/top-of-mind-clara

Clara är en prototyp som möjliggör att anonymt kunna göra sin röst hörd. Medarbetaren kan prata eller skriva in det du vill säga och AI anonymiserar det. Medarbetaren har dessutom tillgång till en chatbot att rådfråga. Därefter analyseras och sammanställs alla medarbetares tankar i en dashboard.

ai chatbot feedback openai python streamlit transcription whisper

Last synced: 22 Dec 2024

https://github.com/topdev0215/AudioMultifunctionChatbot

This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.

gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper

Last synced: 24 Oct 2024

https://github.com/pkarpovich/kira-client

An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices

ai-assistant intent-classification porcupine trigger-word-detection whisper

Last synced: 13 Jan 2025

https://github.com/volkansah/text-to-speech-pygui-for-whisper

This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.

openai openai-api python-gui-tkinter python3 whisper whisper-ai

Last synced: 27 Jan 2025

https://github.com/becomingbabyman/eunoia-desktop

local desktop transcription and search for apple voice memos and videos

search second-brain transcription videos voice-memos whisper

Last synced: 25 Dec 2024

https://github.com/jojasadventure/whisper-client

Very simple Python based client for Whisper compatible endpoint

desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper

Last synced: 08 Feb 2025

https://github.com/chaoticbyte/audio-summarize

An audio summarizer (faster-whisper and BART glued together)

ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper

Last synced: 08 Feb 2025

https://github.com/team-mansumugang/mansumugang-backend

만수무강 서비스의 스프링 부트 어플리케이션입니다.

aws github-actions jpa jpa-hibernate spring-boot whisper

Last synced: 08 Feb 2025

https://github.com/yc-w-cn/s-wave

S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.

podcast pouchdb typescript wasm whisper whisper-cpp

Last synced: 28 Dec 2024

https://github.com/maawad/luna

Personal assistant

bot openai personal-assistant whisper

Last synced: 17 Dec 2024

https://github.com/ivanrj7j/transcription

This project transcribes audio using whisper and provides an api

ai api flask transcription whisper

Last synced: 08 Feb 2025

https://github.com/aeronjl/transcribe

Python package for accurate audio transcription with speaker diarisation

audio-transcription gpt speaker-diarization whisper

Last synced: 08 Feb 2025

https://github.com/huuquyet/phowhisper-next

Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js

nextjs onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 19 Dec 2024

https://github.com/brentwong-kiel1997/brents_ai_language_school

Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos

ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube

Last synced: 31 Dec 2024

https://github.com/xeloxa/wtosrt

Effortlessly convert your whisper timestamped subtitles in an unknown/rarely used format to the more familiar SRT format.

conversion python srt-subtitles subtitle subtitle-edit subtitle-format timestamp timestamp-convert whisper

Last synced: 04 Feb 2025

https://github.com/aaishikdutta/notebook-lm-podcast-audiogram

a simple project to convert notebook-lm (or any audio in that case) into a podcast audiogram with subtitles powered by openai whisper

audiogram openai podcast remotion whisper

Last synced: 08 Dec 2024

https://github.com/voqal/browser

Natural speech browsing for the software developers of tomorrow

cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper

Last synced: 20 Oct 2024

https://github.com/niqifan007/openai-tts-stt-streamlit

A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能

openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api

Last synced: 08 Feb 2025

https://github.com/shtirmann/v2t

Telegram bot which automatically transcribes all voice and video messages to text.

ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper

Last synced: 08 Feb 2025

https://github.com/seanvelasco/ai

Cloudflare AI challenge submission: Slater - your virtual foreign language friend

ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper

Last synced: 03 Feb 2025

https://github.com/rhysdg/whisper-onnx-python

A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph

ai chatbot cuda machine-learning onnxruntime speech-to-text whisper

Last synced: 08 Feb 2025

https://github.com/jpzinn654/speaker-diarization-portuguese

This project implements speaker diarization for Portuguese audio using WhisperX for transcription and PyAnotAudio's Speaker-Diarization 3.1 for speaker separation. It includes a Flask UI for easy file upload, transcription, and speaker identification.

flask gender-detection portuguese-language speaker-diarization speaker-recognition speech-recognition transcription whisper

Last synced: 28 Jan 2025

https://github.com/carlosulisesochoa/whisper-ai-transcription-audio-to-text-file

A Python tool that uses OpenAI's Whisper model to batch transcribe audio files with GPU acceleration. Features include multi-language support, timestamp-based output, automatic file status checking, and CUDA support for faster processing. Perfect for transcribing lectures, interviews, or any audio content with high accuracy.

ai audio-to-text transcription whisper

Last synced: 28 Jan 2025

https://github.com/bbc-esq/batch-openai-whisper-ctranslate2

Batch process multiple files using the fasted ctranslate2 implementation of Open AI's Whisper

batch-processing batch-script openai openai-whisper pyside6 transcription translation whisper whisperx

Last synced: 11 Jan 2025

https://github.com/xawos/owt

🦙🗣️ Ollama and Whisper Telegram bot, with advanced configuration

ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper

Last synced: 28 Jan 2025

https://github.com/saamerm/whisperkit-ios15

iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon

ios ios15 swiftui whisper whisper-ai

Last synced: 19 Jan 2025

https://github.com/tposcic/audio-to-srt-transcriber

Audio to srt transcriber in Python using whisper for transcription and Tcl/Tk for GUI

audio python3 srt transcription whisper

Last synced: 05 Jan 2025

https://github.com/kunesj/holo-subs-search

Tool for searching transcriptions of vtuber videos.

holodex pyannote transcription vtuber whisper youtube

Last synced: 19 Jan 2025

https://github.com/rokbenko/arctic-meet

ArcticMeet is an AI meeting assistant using Streamlit for the GUI and the Snowflake Arctic LLM via the Snowflake Cortex for the AI features

ffmpeg pandas plotly python pytorch snowflake snowflake-arctic snowflake-cortex snowpark streamlit transformers whisper

Last synced: 11 Jan 2025

https://github.com/crone-ai/force-align-wordstamps

Takes audio (mp3) and text input (string) and force aligns the text to the audio. Uses stable-ts and whisperx.

captions faster-whisper force-alignment stable-ts whisper

Last synced: 17 Jan 2025

https://github.com/abdnh/anki-asr

Anki add-on for speech recognition

anki anki-addon deepgram speech-recognition whisper

Last synced: 24 Nov 2024

https://github.com/h3yn3s/tl-dl

A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.

sveltekit tailwind whisper

Last synced: 24 Oct 2024

https://github.com/lidedongsn/cut.ai

cut.ai 是一个AI音视频剪辑工具,语音转写基于whisper

whisper whisper-ui

Last synced: 17 Jan 2025

https://github.com/shani-sinojiya/sandalquest

AI/ML project for recognizing colloquial Kannada speech and building a speech-based Q&A system focused on sandalwood cultivation.

ai audio-processing data-augmentation deep-learning machine-learning mongodb nlp python pytorch question-answering speech-based-question-answering-system speech-recognition whisper

Last synced: 10 Jan 2025

https://github.com/jesse-c/local-audio-toolkit

Some handy tools to do with audio locally.

large-language-models lm-studio macos side-project whisper

Last synced: 29 Jan 2025

https://github.com/mikeesto/subber

A small CLI tool for converting video & audio to a text transcription

audio cli ffmpeg golang transcribe video whisper

Last synced: 19 Dec 2024

https://github.com/etienneab3d/srt-sync

Synchronize SRT timestamps over an existing accurate transcription

aligner asr nlp subtitles text-to-speech whisper

Last synced: 19 Dec 2024

https://github.com/tylim88/voicefu

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/baristikir/voice-typing

Simple Desktop Application with Voice Typing features. Runs locally, transcribes locally and works fully offline with support for real-time transcribing. Powered by OpenAI Whisper ASR-models and whisper.cpp inference engine

electron whisper whisper-cpp

Last synced: 24 Dec 2024

https://github.com/sugarcane-mk/speaker_classification

This repository provides a Python script for extracting speech embeddings using OpenAI's Whisper model. The embeddings are high-dimensional feature vectors that capture the acoustic properties of the input audio. These embeddings can be used for downstream tasks such as speech classification, clustering, and speaker recognition.

asr classification feature-extraction openai speech-processing speech-recognition speech-to-text svm-classifier whisper

Last synced: 09 Jan 2025

https://github.com/natanielf/lecsum

Automatically transcribe and summarize lecture recordings completely on-device using AI.

ollama ollama-python whisper whisper-ai

Last synced: 18 Dec 2024

https://github.com/roman01la/sub-deep

Transcribe and translate audio with AI

deepl transcribe translate whisper

Last synced: 30 Dec 2024

https://github.com/kristofferv98/whisper_turboapi

An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization

ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo

Last synced: 14 Dec 2024

https://github.com/aquibali01/voice-to-text-and-voice-chatbot

Voice-to-Voice Chatbot using Whisper, LLaMA, and Groq API

chatbot gtts llama8b llm opeai python voice whisper

Last synced: 19 Dec 2024

https://github.com/njorogemaurice/speech-recognition-openai-whisper

This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.

openai speech-recognition speech-to-text whisper

Last synced: 14 Jan 2025

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 07 Feb 2025

https://github.com/deshwalmahesh/whisper-fastapi-realtime

It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription

audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large

Last synced: 25 Oct 2024

https://github.com/orhancavus/transcribe_video

Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper

insanely-fast speach-to-text whisper

Last synced: 09 Jan 2025

https://github.com/homelab-00/longformstt

A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.

faster-whisper python speech-to-text whisper

Last synced: 09 Jan 2025

https://github.com/flo-bit/youtube-speaker-separation

simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate

speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube

Last synced: 19 Dec 2024

https://github.com/xi-rick/captains-log

Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.

audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper

Last synced: 21 Jan 2025

https://github.com/mickekring/top-of-mind-beromfabriken

Att ge beröm till en kollega kan kännas lite pinsamt, men forskning har visat att det kan få oss att må bättre på jobbet och att vi till och med blir mer produktiva. Att få höra att kollegor värdesätter och uppmärksammar en ökar ens välmående helt enkelt.

api gpt openai python transcription whisper

Last synced: 16 Jan 2025

https://github.com/mottla/speech-to-text

Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.

huggingface speech-recognition stt teams transcription translation whisper zoom

Last synced: 21 Jan 2025

https://github.com/khushijtrivedi/speech

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.

ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper

Last synced: 08 Feb 2025

https://github.com/levysantiago/upload-ai

Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.

ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod

Last synced: 12 Jan 2025

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 17 Jan 2025

https://github.com/barrylee111/voicechat-llm

A chatbot with both prompt and voicechat capabilities leveraging LangChain, Elasticsearch, and FastAPI. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.

elasticsearch fastapi langchain largelanguagemodel python react speech-to-text tailwind text-to-speech typescript websocket whisper

Last synced: 19 Dec 2024

https://github.com/brunogaliati/speech2text-investments

This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.

chatgpt investment openai politics python speech-recognition speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/danibcorr/university-helper

🧑‍🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.

deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper

Last synced: 19 Dec 2024

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 19 Dec 2024

https://github.com/cp3249/athena_project

Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.

coqui-tts llm ollama whisper

Last synced: 15 Jan 2025

https://github.com/egorsmkv/optimized-whisper-intel

Run quantized Whisper models only on CPU with Intel hardware

intel onnx onnxruntime quantized-neural-networks whisper

Last synced: 19 Dec 2024

https://github.com/meain/raus

Record audio until silence (RAUS)

audio hammerspoon transcription whisper whisper-cpp

Last synced: 17 Jan 2025

https://github.com/patryk-ku/sasayaki

A small CLI tool that simplifies and automates the process of installing and using AI models to transcribe and translate videos.

automation cli faster-whisper gemini-api transcription translation whisper whisper-cpp

Last synced: 05 Jan 2025

https://github.com/escarrie/transcriptaudio

This is a script that can be used to transcript audio file into text file using Whisper AI

ai transcription whisper

Last synced: 17 Jan 2025

https://github.com/devgeekm/chat-it-up

Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.

ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl

Last synced: 20 Dec 2024

https://github.com/teemow/mnote

Generates meeting notes and summaries from video recordings

ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper

Last synced: 02 Feb 2025

https://github.com/ts-azure-services/batch-transcription-examples

A repo to archive some code related to batch transcription for animation movies.

batch-transcription speech-to-text whisper

Last synced: 28 Jan 2025

https://github.com/televisionninja/chat

Chat with an AI Vtuber

ai chatbot llama llm tts vtube-studio vtuber whisper

Last synced: 20 Nov 2024

https://github.com/sixiaolong1117/whisperpythonscript

一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。

pysrt python python3 whisper whisper-ai

Last synced: 16 Jan 2025

https://github.com/cnseniorious000/dl-a2t

download, audio-to-text PyPI: https://pypi.org/p/dl-a2t

audio transcription whisper youtube

Last synced: 02 Jan 2025

https://github.com/datvm/openaiwhisperclient

A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.

client-side openai subtitle timestamp transcript transcription whisper whisper-ai

Last synced: 08 Feb 2025

https://github.com/notyusheng/transcribe-translate_kubernetes

Local web app for transcription and translation services for audio and video using Whisper models

docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper

Last synced: 23 Jan 2025

https://github.com/zahidhasann88/video-summarizer

A videos by extracting audio and generating summaries based on the audio content.

nodejs openai typescript whisper

Last synced: 07 Jan 2025

https://github.com/mai-reborn/mai-offline-transcriber

Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.

asr audio-transcription offline-transcriber pyqt6 python speech-recognition video-transcription whisper

Last synced: 05 Jan 2025

https://github.com/deshwalmahesh/interview-help-cheat-live

As the name suggests, it helps you cheat in your live interviews or video calls. It transcribes your audio and provides answers to your query in real time. Supports equation rendering, custom prompts, text selection and editing. It's basically chatGPT for cheating in interviews

audio-transcription chatgpt fastapi huggingface interview interviews live openai pyaudio realtime transcription transformers whisper whisper-large

Last synced: 31 Dec 2024

https://github.com/josemarcosrf/Lexicap-QA

QA retrieval for Lex Fridman's podcast transcriptions

lexicap qa search whisper

Last synced: 24 Oct 2024

https://github.com/stefanangelovski/voice_to_tweet

Tweet with your Voice using Whisper STT from OpenAI and Twitter4J flow to connect and talk with any account.

ai frontend openai twitter website whisper x

Last synced: 15 Dec 2024

https://github.com/mdbecker/whisper_cpp_macos_utils

Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.

audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp

Last synced: 29 Jan 2025

https://github.com/valkryst/whisper_automations

Various scripts for automating tasks using OpenAI's Whisper.

automation openai subtitle subtitle-generator transcription translation whisper

Last synced: 26 Dec 2024