Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/breadrock1/audio-to-text

There is simple backend project to use whisper-rs.

actix-web audio-to-text rust swagger-ui whisper

Last synced: 10 Jan 2025

https://github.com/marquesafonso/multilang-asr-captioner

A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.

automatic-speech-recognition captioning-videos faster-whisper whisper

Last synced: 24 Oct 2024

https://github.com/tranbavinhson/eth-decentralized-chat

Decentralized chat app by Ethereum Whisper protocol + Vuejs

ethereum vue vuejs whisper whisper-protocol

Last synced: 26 Dec 2024

https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives

AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.

bart gpt2 video-analysis whisper yolov8

Last synced: 02 Jan 2025

https://github.com/Op27/meeting_minutes_generator

This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.

ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper

Last synced: 24 Oct 2024

https://github.com/TranBaVinhSon/eth-decentralized-chat

Decentralized chat app by Ethereum Whisper protocol + Vuejs

ethereum vue vuejs whisper whisper-protocol

Last synced: 24 Oct 2024

https://github.com/nerdimite/meetsy-app

Frontend for the Workshop on Building an End-to-End AI Meeting Assistant

gpt-3 nextjs sentence-transformers tailwindcss whisper

Last synced: 24 Oct 2024

https://github.com/fukuro-kun/wortweber

Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform für Spracherkennung in Deutsch und Englisch.

speech-to-text whisper

Last synced: 17 Jan 2025

https://github.com/Shtirmann/V2T

Telegram bot which automatically transcribes all voice and video messages to text.

ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper

Last synced: 24 Oct 2024

https://github.com/valiantlynx/custom-whisper-api

This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.

ai docker-compose fastapi python whisper

Last synced: 10 Jan 2025

https://github.com/bbc-esq/whisper-solo-with-gui

OpenAI's Whisper program with a simple lightweight GUI.

pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper

Last synced: 11 Jan 2025

https://github.com/lazauk/aoai-entraidauth-sdkv1

Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x

ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper

Last synced: 12 Jan 2025

https://github.com/marty1885/useful-whisper-server

Whisper server based on useful-transformers for the RK3588

npu rk3588 rockchip useful-transformers whisper

Last synced: 05 Dec 2024

https://github.com/tposcic/audio-to-srt-transcriber

Audio to srt transcriber in Python using whisper for transcription and Tcl/Tk for GUI

audio python3 srt transcription whisper

Last synced: 05 Jan 2025

https://github.com/tracywong117/ai-learning-material-from-video

Support subtitling, translating, RAG to generate language learning material from video.

ai auto-subtitle gpt-translate groq groq-api rag subtitles-generator translate whisper

Last synced: 19 Jan 2025

https://github.com/antoniosbarotsis/telegram-transcriber

A Telegram bot for transcribing voice messages

telegram transcribe voice whisper

Last synced: 26 Dec 2024

https://github.com/yc-w-cn/s-wave

S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.

podcast pouchdb typescript wasm whisper whisper-cpp

Last synced: 28 Dec 2024

https://github.com/kitschpatrol/ambient-novel

An interface for nonlinear interactive exploration of a novel.

ambient book fiction interactive novel svelte whisper

Last synced: 19 Nov 2024

https://github.com/pratikpakhale/terravis

Voice guided GIS system

genai gis lam llm voice whisper

Last synced: 09 Oct 2024

https://github.com/njorogemaurice/speech-recognition-openai-whisper

This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.

openai speech-recognition speech-to-text whisper

Last synced: 14 Jan 2025

https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper

Whisper Automatic Speech Recognition (ASR) Model

openai openai-api transcription webapp whisper

Last synced: 22 Dec 2024

https://github.com/microsoft/azure-ai-foundry-whatsapp-bot

WhatsApp Bot built with Azure Functions and Azure AI Foundry, using Python.

azure-ai-foundry azure-functions azure-openai python whatsapp-api whatsapp-bot whisper

Last synced: 27 Nov 2024

https://github.com/educa-ch/educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

langchain ollama open-source open-weight speech-to-text summarization whisper

Last synced: 11 Oct 2024

https://github.com/willdphan/little-jarvis-whisper

Jarvis, a GPT Voice Assistant made with speech recognition, OpenAI's Whisper, and Gradio

gradio openai voice-assistant voice-recognition whisper

Last synced: 24 Oct 2024

https://github.com/nelzomal/videolens_ai

VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience

ai chrome-ai gemini-nano transformers whisper wxt

Last synced: 02 Dec 2024

https://github.com/userpjm/whisper-youtube

Generate a SubRip subtitle file (srt) using Whisper for the audio of a YouTube video.

faster-whisper openai speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/msrsaditya/speech2speech

A Personal Digital Assistant designed to help you with quick responses.

ollama openai phi3 sox tts whisper

Last synced: 28 Nov 2024

https://github.com/pawelzeja098/whisper-video-transcription

Testing whisper Open-AI to transcribe videos

mp4 transcription whisper whisper-ai

Last synced: 28 Nov 2024

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 14 Dec 2024

https://github.com/concaption/containerized-transcription-api

Containerized Transcription API using Whisper Model and FastAPI

docker fastapi openai transcription whisper

Last synced: 16 Dec 2024

https://gitlab.com/ifrz/asr-multi-lite

Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition

distilhubert wav2vec2 whisper

Last synced: 24 Oct 2024

https://github.com/seanvelasco/ai

Cloudflare AI challenge submission: Slater - your virtual foreign language friend

ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper

Last synced: 09 Dec 2024

https://github.com/werserk/techstormhack-1st-place

Решение соревнования ТехШторм от корпорации ТатНефть по анализу активности членов команды на ВКС

pyannote speaker-diarization speech-recognition streamlit whisper

Last synced: 11 Jan 2025

https://github.com/velocitatem/dontlectureme

A program that pays attention to your lectures for you.

ai lectures university whisper

Last synced: 03 Dec 2024

https://github.com/tylim88/Voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 24 Oct 2024

https://github.com/philogicae/docker-faster-whisper-fr-api

Docker - Faster Whisper FR - RunPod Serverless API

ctranslate2 docker faster-whisper french runpod serverless whisper

Last synced: 08 Jan 2025

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/josemarcosrf/Lexicap-QA

QA retrieval for Lex Fridman's podcast transcriptions

lexicap qa search whisper

Last synced: 24 Oct 2024

https://github.com/zuplyx/subtitle-creator

Add english subtitles to videos using openai/whisper-large-v3

open-ai poetry-python python3 subtitles-generator whisper

Last synced: 09 Dec 2024

https://github.com/dheison0/subcreator

A subtitle creator, translator and embeder tool made using AI

ai machine-learning ml python subtitles video-processing whisper

Last synced: 09 Oct 2024

https://github.com/simongino/whisper-fastapi

A FastAPI-based application integrating Whisper for efficient speech recognition and processing.

ai docker fastapi python whisper

Last synced: 09 Oct 2024

https://github.com/ivanrj7j/transcription

This project transcribes audio using whisper and provides an api

ai api flask transcription whisper

Last synced: 09 Oct 2024

https://github.com/bilelouahmed/vocal-assistant

Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.

chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts

Last synced: 21 Dec 2024

https://github.com/arslanex/whisperdemo

A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.

audio-processing openai openai-whisper python whisper

Last synced: 23 Nov 2024

https://github.com/devgeekm/chat-it-up

Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.

ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl

Last synced: 20 Dec 2024

https://github.com/paszkoo/real_time_whisper_iot

Real time voice transcription from default audio input using faster-whisper

ai iot-application iot-device smart-home voice-assistant voice-recognition whisper

Last synced: 17 Jan 2025

https://github.com/mario-huang/whisper-desktop

A desktop app for easy subtitle using whisper model.

ai desktop gradio open-source python pytorch tauri web-ui whisper

Last synced: 17 Jan 2025

https://github.com/egorsmkv/optimized-whisper-intel

Run quantized Whisper models only on CPU with Intel hardware

intel onnx onnxruntime quantized-neural-networks whisper

Last synced: 19 Dec 2024

https://github.com/jt-427/whisper-ui

A minimalist and elegant UI for OpenAI's Whisper speech-to-text model, built with React + Vite and Flask

flask openai react speech-to-text transcription vite whisper

Last synced: 19 Jan 2025

https://github.com/ifeech/subtitler

Creating subtitles from video

subtitles whisper

Last synced: 09 Oct 2024

https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue

Lors de ce hackathon, nous avons développé la solution Smart VT, une application web basée sur l'IA conçue pour sous-titrer et doubler n'importe quelle vidéo d'une langue à une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidéos, et Node.js pour la gestion des sous-titres vidéo.

api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper

Last synced: 12 Jan 2025

https://github.com/cp3249/athena_project

Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.

coqui-tts llm ollama whisper

Last synced: 15 Jan 2025

https://github.com/thealphamerc/audio-to-text

Transcribe multi-lingual audio clips using whisper model

openai whisper

Last synced: 16 Dec 2024

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 19 Dec 2024

https://github.com/danibcorr/university-helper

🧑‍🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.

deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper

Last synced: 19 Dec 2024

https://github.com/jalvarezz13/summarai

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.

llama-index llama2 pymovie whisper

Last synced: 22 Dec 2024

https://github.com/brunogaliati/speech2text-investments

This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.

chatgpt investment openai politics python speech-recognition speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/barrylee111/voicechat-llm

A chatbot with both prompt and voicechat capabilities leveraging LangChain, Elasticsearch, and FastAPI. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.

elasticsearch fastapi langchain largelanguagemodel python react speech-to-text tailwind text-to-speech typescript websocket whisper

Last synced: 19 Dec 2024

https://github.com/arkaniightt/web_app_transcriptor_openai

Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.

ai app openai python streamlit tool tools transcript transcription webapp whisper

Last synced: 12 Dec 2024

https://github.com/cnseniorious000/dl-a2t

download, audio-to-text PyPI: https://pypi.org/p/dl-a2t

audio transcription whisper youtube

Last synced: 02 Jan 2025

https://github.com/khushijtrivedi/speech

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.

ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper

Last synced: 09 Oct 2024

https://github.com/evilfreelancer/whisper-tests

Collection of experiments on OpenAI Whisper models

api-server docker-compose testing transcription whisper

Last synced: 17 Dec 2024

https://github.com/zahidhasann88/video-summarizer

A videos by extracting audio and generating summaries based on the audio content.

nodejs openai typescript whisper

Last synced: 07 Jan 2025

https://github.com/lifeosm/whisper

🐳 Docker image with OpenAI Whisper.

docker octolab speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/s-emanuilov/whispercpp_kit

A wrapper on whisper.cpp with additional helper features like model management capabilities.

asr whisper

Last synced: 13 Dec 2024

https://github.com/flo-bit/youtube-speaker-separation

simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate

speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube

Last synced: 19 Dec 2024

https://github.com/valkryst/whisper_automations

Various scripts for automating tasks using OpenAI's Whisper.

automation openai subtitle subtitle-generator transcription translation whisper

Last synced: 26 Dec 2024

https://github.com/chloelavrat/speech-to-text-app

Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.

audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/xawos/owt

🦙🗣️ Ollama and Whisper Telegram bot, with advanced configuration

ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper

Last synced: 08 Jan 2025

https://github.com/ashot72/answering-questions-about-images

You can upload images, ask questions about images using voice prompts, then listen to the responses in voice

answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper

Last synced: 30 Dec 2024

https://github.com/tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

colab jupyter python transcribe transformers translate whisper

Last synced: 27 Dec 2024

https://github.com/zdwolfe/transcription-tools

Docker video transcriber, wrapper around OpenAI

openai transcription whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/Franky1/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

openai python streamlit transcription whisper

Last synced: 24 Oct 2024

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 18 Dec 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/waikato-llm/whisper

Docker images for the whisper audio transcription library and variants.

audio transcription whisper

Last synced: 12 Jan 2025

https://github.com/aixerum/faster-whisper

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

ctranslate2 gpu transcription whisper

Last synced: 07 Jan 2025

https://github.com/huuquyet/phowhisper-tiny

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024

https://github.com/ajxv/rtstt

Real time speech to text transcription using OpenAi whisper

live-transcription openai openai-whisper python3 transcription whisper

Last synced: 22 Dec 2024

https://github.com/bloodworks-io/phlox

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 18 Dec 2024

https://github.com/malexandersalazar/casey

Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI

agentic-ai content-creation groq large-language-models python wellbeing whisper

Last synced: 18 Dec 2024

https://github.com/suchith-2002/whisperwave

Transcribe any Audio to Text.

openai whisper

Last synced: 08 Dec 2024