Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/andreabak/whispersubs

Generate subtitles for your video or audio files using the power of AI

ai cuda deep-learning gpu-acceleration machine-learning srt subtitles transcribe transcription translate whisper

Last synced: 16 Nov 2024

https://github.com/romiconez/konspecto-llm

LLM agent that provides tools for convenient work with personal documents using voice or text.

agent backend docker docx frontend google langchain llamaindex llm managment nlp rag whisper

Last synced: 05 Jan 2025

https://github.com/firefly55lm/bisbigliatorev2

Automatic audio transcriber notebook based on Whisper

colab-notebook speech-to-text whisper

Last synced: 25 Jan 2025

https://github.com/otonomee/mic2transcript

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.

audio cli cli-tool openai speech speech-transcription transcription whisper

Last synced: 09 Oct 2024

https://github.com/flyingfathead/youwhisper-cli

A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.

cli cli-app python transcribe transcriber transcription whisper whisper-ai whisperx youtube-downloader yt-dlp yt-dlp-wrapper

Last synced: 11 Jan 2025

https://github.com/fer14/videoseek

Intelligent video search tool powered by AI

bert timestamp video whisper youtube-api

Last synced: 14 Jan 2025

https://github.com/datarabbit-ai/transcription_service

System/service with REST API for extracting text transcriptions from movies and audio recordings in most popular video formats.

containers datarabbit rest-api speech-to-text stt transcription transcription-services whisper

Last synced: 09 Oct 2024

https://github.com/tensoraws/yuisub

Auto translation of new anime episodes based on Yui-MHCP001

anime chatgpt llm openai pysubs2 subtitle translation whisper

Last synced: 09 Oct 2024

https://github.com/alancunningham/chatgpt-assistant

A ChatGPT assistant with voice activation and image generation, connected to a Raspberry Pi display.

chatgpt chatgpt-api dall-e dall-e-api porcupine python raspberry-pi whisper

Last synced: 06 Jan 2025

https://github.com/t-h-chung/note-taker

Note-taking app for online/local video/audio using Whisper transcription, ChatGPT, and Notion

chatgpt notes notion transcription whisper youtube

Last synced: 09 Oct 2024

https://github.com/saadkh1/docqa-textsummarization-app

A Streamlit app for document question answering and text summarization.

langchain llama-2 llamacpp pytesseract question-answering streamlit summarization whisper

Last synced: 07 Jan 2025

https://github.com/daisyyedda/whisper-large-v2-atcosim_corpus

A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.

asr-model nlp whisper whisper-ai

Last synced: 09 Oct 2024

https://github.com/i4ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.

fine-tuning nlp speech-to-text whisper

Last synced: 09 Nov 2024

https://github.com/upes-open/osoc-24-the-content-forge

The Content Hub Is a online platform which acts as a all in one solution helping content creators develop and generate short form video image content utilising genai models and cloud to maximize their efficiency and benefit from the ever-growing developments in ai models

aws docker fastapi genai microservices nodejs react whisper

Last synced: 09 Oct 2024

https://github.com/sonhm3029/realtime-vietnamese-asr-react-native-and-whisper

This project implement end to end realtime vietnamese speech recognition with PhoWhisper in Backend and frontend in React Native

asr phowhiper react-native realtime realtime-speech-recognition speech-recognition speech-to-text vietnamese whisper

Last synced: 16 Nov 2024

https://github.com/bharathajjarapu/voicecipher

Local Speech transcription

transformerjs whisper

Last synced: 09 Oct 2024

https://github.com/ksylvest/omniai-openai

An implementation of the OmniAI interface for OpenAI.

chatgpt omniai openai ruby whisper

Last synced: 10 Jan 2025

https://github.com/kazkozdev/video-analyser

⚡ The YouTube Video Analyzer Pro brings AI-powered analysis capabilities to your fingertips, offering deep insights for content creators and marketers.

ai content-analytics fastapi llama3 llm ollama-api python3 video-analysis video-analysis-client whisper youtube youtube-analytics youtube-api youtube-subscribers

Last synced: 13 Jan 2025

https://github.com/ndjenkins85/afkode

Personal voice command interface for iPhone on pythonista powered by Whisper and ChatGPT.

chatgpt openai python-packaging quick-start whisper

Last synced: 12 Oct 2024

https://github.com/vimwei/whispertranscriber

Whisper Transcribe and srt Resegment

speech-to-text subtitle whisper

Last synced: 17 Oct 2024

https://github.com/knot-inc/john

John is a web app that records video, analyzes audio with AI, and identifies the speaker's native language from their English accent, simplifying language assessment.

audio-analysis machine-learning whisper

Last synced: 17 Nov 2024

https://github.com/abhishtagatya/polly

☎️ Language Learning Chatbot

chatbot chatgpt python telegram whisper

Last synced: 17 Nov 2024

https://github.com/jemtaly/whispering

A real-time transcription and translation tool implemented in Python based on the fast-whisper library.

live-caption python real-time-transcription real-time-translation tkinter transcription translation whisper

Last synced: 09 Jan 2025

https://github.com/my-north-ai/semantic_audio_filtering

Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.

automatic-speech-recognition fine-tuning synthetic-dataset-generation text-to-speech whisper

Last synced: 24 Oct 2024

https://github.com/TheGuysBrushes/Whisper

Secured chat application

android chat socket whisper

Last synced: 24 Oct 2024

https://github.com/williamwa/mssmith

A Telegram bot that utilizes the ChatGPT API and can communicate through voice.

chatpgt-api telegram-bot tts whisper

Last synced: 31 Dec 2024

https://github.com/szilvia-csernus/openai-audio-api-calls

Speech-to-text and text-to-speech API call examples, using OpenAI's whisper-1 and tts-1 models.

jupyter-notebook openai openai-api tts-1 whisper

Last synced: 09 Oct 2024

https://github.com/team-mansumugang/mansumugang-backend

만수무강 서비스의 스프링 부트 어플리케이션입니다.

aws github-actions jpa jpa-hibernate spring-boot whisper

Last synced: 09 Oct 2024

https://github.com/i4ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

fine-tuning nlp speech-to-text whisper

Last synced: 09 Oct 2024

https://github.com/shtirmann/v2t

Telegram bot which automatically transcribes all voice and video messages to text.

ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper

Last synced: 09 Oct 2024

https://github.com/aspadax/subtitlegenerator

Automatically generate a subtitle for your video.

gpt machine-learning openai rust streamlit subtitles-generator whisper

Last synced: 09 Oct 2024

https://github.com/roman01la/sub-deep

Transcribe and translate audio with AI

deepl transcribe translate whisper

Last synced: 30 Dec 2024

https://github.com/sumitesh9/localizedwhisper

An initiative to make OpenAI Whisper more localized by adding support for more languages.

albanian albanian-language huggingface openai speech speech-to-text whisper

Last synced: 02 Jan 2025

https://github.com/abdnh/anki-asr

Anki add-on for speech recognition

anki anki-addon deepgram speech-recognition whisper

Last synced: 24 Nov 2024

https://github.com/mickekring/top-of-mind-clara

Clara är en prototyp som möjliggör att anonymt kunna göra sin röst hörd. Medarbetaren kan prata eller skriva in det du vill säga och AI anonymiserar det. Medarbetaren har dessutom tillgång till en chatbot att rådfråga. Därefter analyseras och sammanställs alla medarbetares tankar i en dashboard.

ai chatbot feedback openai python streamlit transcription whisper

Last synced: 22 Dec 2024

https://github.com/adamelkholyy/whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles

asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube

Last synced: 10 Dec 2024

https://github.com/nerdimite/meetsy-backend

AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant

gpt-3 nextjs sentence-transformers tailwindcss whisper

Last synced: 24 Oct 2024

https://github.com/huuquyet/phowhisper-next

Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js

nextjs onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 19 Dec 2024

https://github.com/pdcalado/waste

Whisper Audio Service for Transcription and Ergonomics

productivity rofi transcription tts whisper

Last synced: 21 Jan 2025

https://github.com/maawad/luna

Personal assistant

bot openai personal-assistant whisper

Last synced: 17 Dec 2024

https://github.com/winstxnhdw/capgen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.

asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper

Last synced: 23 Oct 2024

https://github.com/jojasadventure/whisper-client

Very simple Python based client for Whisper compatible endpoint

desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper

Last synced: 09 Oct 2024

https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo

This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.

ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper

Last synced: 09 Oct 2024

https://github.com/chaoticbyte/audio-summarize

An audio summarizer (faster-whisper and BART glued together)

ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper

Last synced: 09 Oct 2024

https://github.com/adisol07/sharpspeech

SharpSpeech is free, local and open source way to speech and wake word recognition.

audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai

Last synced: 19 Dec 2024

https://github.com/crone-ai/force-align-wordstamps

Takes audio (mp3) and text input (string) and force aligns the text to the audio. Uses stable-ts and whisperx.

captions faster-whisper force-alignment stable-ts whisper

Last synced: 17 Jan 2025

https://github.com/rhysdg/whisper-onnx-python

A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph

ai chatbot cuda machine-learning onnxruntime speech-to-text whisper

Last synced: 09 Oct 2024

https://github.com/becomingbabyman/eunoia-desktop

local desktop transcription and search for apple voice memos and videos

search second-brain transcription videos voice-memos whisper

Last synced: 25 Dec 2024

https://github.com/bigyaa/transcription-system

This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.

accessibility diarization speech-to-text storytelling-with-data transcription whisper

Last synced: 19 Dec 2024

https://github.com/xaionaro-go/speech

A Speech-To-Text (with translation) library for Go; currently uses Whisper (runs locally if needed; no need in any API keys)

ai converter go golang library module package speech speech-recognition speech-to-text text whisper

Last synced: 13 Jan 2025

https://github.com/gamut73/quizinator

Generating quizzes, on Android, from YouTube videos.

kotlin-android llm python whisper

Last synced: 19 Dec 2024

https://github.com/wtlow003/auto-subtitles

CLI tool to transcribe (+ translate) videos and embed subtitles automatically.

faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp

Last synced: 15 Nov 2024

https://github.com/antoniosbarotsis/telegram-transcriber

A Telegram bot for transcribing voice messages

telegram transcribe voice whisper

Last synced: 26 Dec 2024

https://github.com/stnderror/robotron

🤖 A personal robot assistant for Telegram

assistant bot dall-e gpt-35-turbo openai telegram-bot whisper

Last synced: 25 Jan 2025

https://github.com/slinusc/speaker_identification_evaluation

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

wav2vec2 whisper xls-r

Last synced: 09 Oct 2024

https://github.com/tposcic/audio-to-srt-transcriber

Audio to srt transcriber in Python using whisper for transcription and Tcl/Tk for GUI

audio python3 srt transcription whisper

Last synced: 05 Jan 2025

https://github.com/h3yn3s/tl-dl

A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.

sveltekit tailwind whisper

Last synced: 24 Oct 2024

https://github.com/aaishikdutta/notebook-lm-podcast-audiogram

a simple project to convert notebook-lm (or any audio in that case) into a podcast audiogram with subtitles powered by openai whisper

audiogram openai podcast remotion whisper

Last synced: 08 Dec 2024

https://github.com/voqal/browser

Natural speech browsing for the software developers of tomorrow

cef jcef openai realtime-api voice voice-assistant voice-browser voice-commands voice-control whisper

Last synced: 20 Oct 2024

https://github.com/thewh1teagle/whisper.zig

Transcribe audio with whisper in zig

asr openai whisper zig

Last synced: 24 Jan 2025

https://github.com/tracywong117/ai-learning-material-from-video

Support subtitling, translating, RAG to generate language learning material from video.

ai auto-subtitle gpt-translate groq groq-api rag subtitles-generator translate whisper

Last synced: 19 Jan 2025

https://github.com/lazauk/aoai-entraidauth-sdkv1

Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x

ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper

Last synced: 12 Jan 2025

https://github.com/marty1885/useful-whisper-server

Whisper server based on useful-transformers for the RK3588

npu rk3588 rockchip useful-transformers whisper

Last synced: 05 Dec 2024

https://github.com/marquesafonso/multilang-asr-captioner

A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.

automatic-speech-recognition captioning-videos faster-whisper whisper

Last synced: 24 Oct 2024

https://github.com/bbc-esq/whisper-solo-with-gui

OpenAI's Whisper program with a simple lightweight GUI.

pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper

Last synced: 11 Jan 2025

https://github.com/valiantlynx/custom-whisper-api

This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.

ai docker-compose fastapi python whisper

Last synced: 10 Jan 2025

https://github.com/fukuro-kun/wortweber

Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform für Spracherkennung in Deutsch und Englisch.

speech-to-text whisper

Last synced: 17 Jan 2025

https://github.com/jowadev/interview

Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.

interview-preparation job-interviews nextjs professional students whisper

Last synced: 14 Dec 2024

https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives

AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.

bart gpt2 video-analysis whisper yolov8

Last synced: 02 Jan 2025

https://github.com/tranbavinhson/eth-decentralized-chat

Decentralized chat app by Ethereum Whisper protocol + Vuejs

ethereum vue vuejs whisper whisper-protocol

Last synced: 26 Dec 2024

https://github.com/breadrock1/audio-to-text

There is simple backend project to use whisper-rs.

actix-web audio-to-text rust swagger-ui whisper

Last synced: 10 Jan 2025

https://github.com/volkansah/text-to-speech-pygui-for-whisper

This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.

openai openai-api python-gui-tkinter python3 whisper whisper-ai

Last synced: 27 Jan 2025

https://github.com/Op27/meeting_minutes_generator

This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.

ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper

Last synced: 24 Oct 2024

https://github.com/pkarpovich/kira-client

An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices

ai-assistant intent-classification porcupine trigger-word-detection whisper

Last synced: 13 Jan 2025

https://github.com/canaxs/whisper-core

An application where users can make rumor-based news and earn money in return.

mysql panel spring spring-boot whisper

Last synced: 19 Dec 2024

https://github.com/pawelzeja098/whisper-video-transcription

Testing whisper Open-AI to transcribe videos

audio mp3 mp4 transcription video whisper whisper-ai

Last synced: 27 Jan 2025

https://github.com/saamerm/whisperkit-ios15

iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon

ios ios15 swiftui whisper whisper-ai

Last synced: 19 Jan 2025

https://github.com/brentwong-kiel1997/brents_ai_language_school

Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos

ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube

Last synced: 31 Dec 2024

https://github.com/oov/aviutl_subtitler

AviUtl+拡張編集の環境で Whisper による文字起こしをするためのプラグイン

aviutl aviutl-plugin whisper

Last synced: 19 Dec 2024

https://github.com/nri12/filter_voice

Dự án lọc và tắt tiếng video những từ khóa mong muốn

python tools whisper

Last synced: 19 Dec 2024