An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with transcribe

A curated list of projects in awesome lists tagged with transcribe .

https://github.com/m1guelpf/yt-whisper

Using OpenAI's Whisper to automatically generate YouTube subtitles

ffmpeg openai openai-whisper subtitles subtitles-generated transcribe whisper youtube youtube-dl

Last synced: 16 May 2025

https://github.com/innovatorved/whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.

asr hacktoberfest innovatorved transcribe whisper

Last synced: 04 Apr 2025

https://github.com/azkadev/whisper

Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models

ai android dart flutter ggml indonesia ios linux macos openai speech speech-recognition speech-synthesis speech-to-text transcribe transformer whisper whisper-dart whisper-flutter windows

Last synced: 15 May 2025

https://github.com/zh-plus/openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。

auto-subtitle faster-whisper lyrics lyrics-generator openai-api openlrc python speech-to-text subtitle-translation transcribe voice-to-text whisper

Last synced: 06 Oct 2025

https://github.com/RayFernando1337/MLX-Auto-Subtitled-Video-Generator

Generate accurate transcripts using Apple's MLX framework

apple mlx transcribe translate whisper

Last synced: 30 Aug 2025

https://github.com/rayfernando1337/mlx-auto-subtitled-video-generator

Generate accurate transcripts using Apple's MLX framework

apple mlx transcribe translate whisper

Last synced: 16 May 2025

https://github.com/nikdanilov/whisper-obsidian-plugin

Speech-to-text in Obsidian using OpenAI Whisper

obsidian openai-whisper speech-to-text stt transcribe voice whisper

Last synced: 30 Jul 2025

https://github.com/wendy7756/AI-Video-Transcriber

Transcribe and summarize video content using AI. Open-source, multi-platform, and supports multiple languages.

aitool tiktok transcribe videototext youtube

Last synced: 15 Sep 2025

https://github.com/simalexan/s3-lambda-transcribe-audio-to-text-s3

Transcribe your audio to text with this serverless component

audio lambda s3 serverless speech-to-text transcribe transcribe-audio-files

Last synced: 16 Mar 2025

https://github.com/mharrvic/fast-audio-video-transcribe-with-whisper-and-modal

Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in ~1 minute

fastapi modal openai python transcribe whisper

Last synced: 13 Apr 2025

https://github.com/stangirard/quivr-whisper

Talk to your second brain personal assistant using speech 🧠

assistant gpts openai personal quivr speech transcribe tts whisper

Last synced: 21 Jul 2025

https://github.com/bbc-esq/ctranslate2-faster-whisper-transcriber

Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.

audio-recorder audio-transcribing audio-transcription ctranslate2 faster-whisper transcribe transcriber

Last synced: 01 May 2025

https://github.com/BBC-Esq/ctranslate2-faster-whisper-transcriber

Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.

audio-recorder audio-transcribing audio-transcription ctranslate2 faster-whisper transcribe transcriber

Last synced: 06 Mar 2025

https://github.com/lukasbach/pensieve

Desktop app for recording meetings from locally running apps and transcribing and summarizing them with a local LLM

audio llm meeting mp3 note notes notetaker record recording summarize transcribe transcript

Last synced: 05 Apr 2025

https://github.com/gusper/SongTxt-vscode

Visual Studio Code extension that adds support for editing text files for songs including lyrics, chords, guitar tablature, etc.

chords guitar guitartabs lyrics music songs tabs transcribe ultimate-guitar visualstudio vscode

Last synced: 24 Jul 2025

https://github.com/deepsingh132/aionair

A cutting-edge AI SaaS platform that enables users to create, discover, and enjoy podcasts with advanced features like text-to-audio conversion with multi-voice AI, podcast thumbnail image generation, and seamless playback. The platform is built using Next.js, TypeScript, Convex, OpenAI, Stripe, Clerk, ShadCN, and Tailwind CSS.

ai clerk convex music nextjs openai player podcast react saas saas-application shadcn-ui stripe subscriptions tailwind tailwindcss transcribe transcription typescript zod

Last synced: 17 Apr 2025

https://github.com/dabit3/amplify-ml-ai-predictions-example

This is a general overview of the Predictions category of Amplify. It shows examples of Machine Learning and AI service integration in a React app with AWS Amplify Predictions category

ai aws aws-amplify machine-learning polly rekognition serverless transcribe

Last synced: 17 Nov 2025

https://github.com/general-developer/whisper_library

Whisper Is Library for transcribe sound wav AKA Speech To Text Or Extract Text From Audio

ai artificial-intelligence dart flutter ggml machine-learning ml openai speech-to-text transcribe translate whisper

Last synced: 05 Apr 2025

https://github.com/tikene/video-caption-and-translate

Video URL transcriber and translator using AI. Download from Youtube and translate automatically by adding subtitles to the video

ai captions chatgpt downloader openai subtitle subtitles transcribe transcriber translation translator url videos whisper youtube

Last synced: 11 Apr 2025

https://github.com/luquedaniel/whisper2subs

A CLI tool that transcribes audio using openai-whisper and translates it using DeepL.

audio cli deepl subtitle transcribe translate video weekend-project whisper

Last synced: 26 Oct 2025

https://github.com/loglux/flexaudioprint

FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file

ai artificial-intelligence audio-processing gradio openai-whisper transcribe transcribe-audio-files

Last synced: 02 Sep 2025

https://github.com/jeanjerome/echoinstone

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

alignment diarization localhost pyannote python transcribe whisper

Last synced: 10 Apr 2025

https://github.com/jxxe/murmur

A proof-of-concept transcription app

journalism mac macos transcribe transcription whisper

Last synced: 28 Aug 2025

https://github.com/7ds7/videojs-vjstranscribe

Creates searchable transcripts from text tracks

transcribe transcript videojs videojs-player videojs-plugin

Last synced: 19 Aug 2025

https://github.com/ruankie/vid-qa

An app that summarises and answers questions about arbitrary YouTube videos using LangChain and LLMs

app autogpt chatgpt embedding gpt langchain large-language-models openai qa streamlit summary tldr transcribe youtube youtube-summary

Last synced: 11 Mar 2025

https://github.com/ave-sergeev/dictator

Speech-to-Text translation service (Rust, Tonic) (2025)

audio rust silero tonic transcribe voice-activity-detection vosk

Last synced: 03 Apr 2025

https://github.com/assemblyai/assemblyai-semantic-kernel

Transcribe audio using AssemblyAI with Semantic Kernel plugins.

ai assemblyai llm semantic-kernel transcribe

Last synced: 16 Mar 2025

https://github.com/AssemblyAI/assemblyai-semantic-kernel

Transcribe audio using AssemblyAI with Semantic Kernel plugins.

ai assemblyai llm semantic-kernel transcribe

Last synced: 03 Apr 2025

https://github.com/tuzibr/Real_time_caption_translate

A real-time caption translation tool based on VOSK speech recognition and machine translation, which supports transcribing audio into target language subtitles in real time and displaying the translated content.

captions microphone real-time speaker speech-recognition subtitles transcribe translate

Last synced: 25 Mar 2025

https://github.com/ayushsoni1010/textify

🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.

ai audio nextjs openai openai-api radix-ui shadcn-ui speech-to-text tailwind-css tailwindcss transcribe translation typescript video whisper-api

Last synced: 04 Oct 2025

https://github.com/abdtriedcoding/notesgpt

NotesGPT 📋✨ seamlessly converts your voice notes into organized summaries and clear action items using AI 🤖.

clerkauth convex gemini next14 nextjs reactjs shadcn-ui tailwindcss transcribe typescript voice-recognition voice-to-text-transcription voice-transcription

Last synced: 08 Jul 2025

https://github.com/donny-hikari/realtime-transcribe

Transcribe your speech or the audio playing on your computer with Whisper in realtime, and show the captions on your screen.

ai machine-learning speech-to-text transcribe transcription

Last synced: 02 Dec 2025

https://github.com/mrbelka12000/speak_freely

Сервис для практики разговора на иностранных языках

chatgpt golang minio postgresql redis transactions transcribe

Last synced: 19 Aug 2025

https://github.com/build-on-aws/aiml-like-api-in-your-app

Sample code for adding AI/ML services to your app

aws polly rekognition textract transcribe

Last synced: 09 Jul 2025

https://github.com/benderscript/vidbot

This program will create a Video RAG Chatbot from a video you upload or a youtube URL.

chatbot mp3 openai transcribe transcription video

Last synced: 05 Oct 2025

https://github.com/gangula-karthik/memo-mate

🚀 Discord meetings redefined with Memo Mate: Transcribe, summarize, and automate minutes seamlessly! ✨

discord-bot huggingface mistral py-cord speech-to-text transcribe whisper

Last synced: 09 Apr 2025

https://github.com/lukeocodes/supported-formats

A package to test if your media file is supported for transcription.

ai audio deepgram javascript node srt subtitles transcribe transcript typescript video webvtt

Last synced: 31 Dec 2025

https://github.com/andreabak/whispersubs

Generate subtitles for your video or audio files using the power of AI

ai cuda deep-learning gpu-acceleration machine-learning srt subtitles transcribe transcription translate whisper

Last synced: 14 Apr 2025

https://github.com/hosuaby/transcriptionist

Tool to transcribe videos using AI.

ai captions deepgram transcribe video-processing

Last synced: 22 Jun 2025

https://github.com/flyingfathead/youwhisper-cli

A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.

cli cli-app python transcribe transcriber transcription whisper whisper-ai whisperx youtube-downloader yt-dlp yt-dlp-wrapper

Last synced: 04 Oct 2025

https://github.com/notyusheng/transcribe-translate

Local web app for transcription and translation services for audio and video using Whisper models

docker full-stack nodejs react reactjs self-hosted speech-to-text transcribe translate whisper

Last synced: 26 Oct 2025

https://github.com/roman01la/sub-deep

Transcribe and translate audio with AI

deepl transcribe translate whisper

Last synced: 20 Feb 2025

https://github.com/mikeesto/subber

A small CLI tool for converting video & audio to a text transcription

cli ffmpeg golang transcribe whisper whispercpp

Last synced: 06 Apr 2025

https://github.com/mihirkudale/end-to-end-youtube-video-transcribe-summarizer-llm-app-with-google-gemini-pro

This repository contains an end-to-end application that leverages Google Gemini Pro's language models and Streamlit to transcribe and summarize YouTube videos.

google-gemini-pro llm python streamlit-webapp summarizer transcribe youtube-video

Last synced: 20 Jun 2025

https://github.com/antoniosbarotsis/telegram-transcriber

A Telegram bot for transcribing voice messages

telegram transcribe voice whisper

Last synced: 16 May 2025

https://github.com/andrewwango/aws-custom-calls

An AWS serverless AI services pipeline for analysing call center calls, with custom analytics built around AWS Transcribe and AWS Comprehend and insights published on AWS Quicksight.

aws call-center-analytics comprehend lambda machine-learning natural-language-processing sentiment-analysis serverless transcribe

Last synced: 28 Mar 2025

https://github.com/muhammadadilnaeem/youtube-video-sentiment-and-summarization

The YouTube Video Sentiment and Summarization project is a comprehensive tool designed to analyze YouTube videos by transcribing their content, summarizing it, and performing sentiment analysis on the comments. This project leverages advanced machine learning models and APIs to provide insightful data.

gemini-api github google markdown project-repository python sentiment-analysis streamlit summarization transcribe youtube youtube-api-v3

Last synced: 07 Jan 2026

https://github.com/bbc-esq/whisper-solo-with-gui

OpenAI's Whisper program with a simple lightweight GUI.

pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper

Last synced: 28 Feb 2025

https://github.com/symblai/connect-symbl-to-zoom-without-ui

Open a terminal. Connect Symbl to your Zoom call through the Command Line Interface without a UI. Add Symbl.ai to Five9 / Zoom calls

bash cli command-line-interface command-line-tool five9 javascript shebang symbl transcribe transcription zoom

Last synced: 18 Mar 2025

https://github.com/michael-ortiz/terraform-aws-s3-audio-pii-guardian

🕵️‍♂️ Personally Identifiable Information (PII) Detection and Redaction for Voice Audio Files Stored in S3 and AWS Transcribe

audio-to-text aws aws-transcribe ffmpeg lambda personal-identifiable-information pii pii-detection pii-detector redaction s3 terraform transcribe typescript

Last synced: 15 Jul 2025

https://github.com/soumyagautam/sentiment-speech-classification

This api provides various functions which helps users to transcribe the audio and check the sentiment of each line of the transcription. This api uses Assembly AI at the backend.

api sentiment sentiment-analysis sentiment-classification speech speech-recognition transcribe

Last synced: 20 Feb 2025

https://github.com/symblai/real-time-sentiment-analysis-with-websockets

Add Real-Time Sentiment Analysis to your WebSockets. Transcribe `.onmessage` events through live, automated speech recognition with JavaScripts, Symbl.ai's APIs and your streams.

apis automated-speech-recognition javascript onmessage real-time realtime rest sentiment sentiment-analysis symbl transcribe transcription websocket websockets

Last synced: 18 Mar 2025

https://github.com/amingheibi/AWS_transcribe_client

To send and receive ASR requests to AWS transcribe

asr aws farsi transcribe

Last synced: 09 Jul 2025

https://github.com/catanduyago/subtitler-live

Aplicación para transcribir a texto el audio recibido al iniciar un sistema de compartir pantalla.

amazon-transcribe audio-to-text audio-transcribing live live-subtitles subtitles transcribe

Last synced: 06 Oct 2025

https://github.com/woheller69/whispergui

GUI for OpenAI Whisper for local transcription

notes-tool transcribe

Last synced: 08 Oct 2025

https://github.com/nbhirud/pypodcasts

A simple podcasts player/viewer being developed using python. The goal is to have a modern yet simple podcasts functionality. Should retrive RSS based on xml, save them, play audio, display description, audio transcript, transcript summary, display image from show/episode, and display a wordcloud based on transcript to give a quick idea.

audio-player ollama podcast python rss rss-feed rss-reader sqlite3 summarize transcribe whisper-ai wordcloud xml xml-parser

Last synced: 02 Jul 2025

https://github.com/aznironman/transcriptgen

TranscriptGen is an application for transcribing audio and video files. Transcription output is .txt or .srt. Most audio and video formats supported (with ffmpeg).

audio-to-text audio-transcription srt srt-files srt-subtitle srt-subtitle-format srt-subtitles subtitle subtitles transcribe transcription video-audio-to-text video-transcription whisper

Last synced: 30 Jun 2025

https://github.com/brendancsmith/subtitler

A set of tools for transcribing video files and adding subtitles using ffmpeg and the whisper library.

audio-processing openai subtitles transcribe transcript transcription whisper

Last synced: 10 Jun 2025

https://github.com/syedahmedullah14/ai-on-air

A cutting-edge AI SaaS platform that enables users to create, discover, and enjoy podcasts with advanced features like text-to-audio conversion with multi-voice AI, podcast thumbnail image generation, and seamless playback. The platform is built using Next.js, TypeScript, Convex, OpenAI, Stripe, Clerk, ShadCN, and Tailwind CSS.

ai clerk convex javascript jwt nexys4ddr openai openai-api reactjs saas saas-application shadcn-ui subscriptions tailwindcss transcribe transcription typescript zod

Last synced: 11 Mar 2025

https://github.com/paladini/echo-transcribe

An open-source desktop application for audio transcription using local AI. Private, secure and efficient.

ai free open-source speach-to-text srt srt-subtitles transcribe transcriber whisper whisper-ai

Last synced: 04 Sep 2025

https://github.com/ophickedo/telegram-voice-qna-llm

Voice message to text (Whisper) + local LLM answers (Mistral-7B). Works offline, 100+ languages. Perfect for interviews & automated Q&A.

ai audio-processing bot chatbot llm mistral offline openai python qna speach-recognition speach-to-text transcribe voice-recognition whisper whisper-ai

Last synced: 21 Jul 2025

https://github.com/tanmay-chandgude/transcripto

Blog Dashboard for seamless text and video content input, transcription, translation, and publishing in multiple languages. Features include AI-powered transcription, translation into multiple languages, dynamic SEO-optimized blog publishing, and server-side rendering for fast performance.

blogs gemini-ai kinde-auth nextjs reactjs shadcn-ui supabase-db transcribe translation typescript

Last synced: 02 Apr 2025

https://github.com/fkiller/whispertranscript

Transcribe voice from mic input using OpenAI Whisper API.

llm openai transcribe transcript transcription webaudio whisper

Last synced: 24 Feb 2025

https://github.com/iann0036/custom-vocab-builder

Construct Amazon Transcribe Custom Vocabulary lists, including IPA support

aws custom-vocabulary polly transcribe

Last synced: 29 Oct 2025

https://github.com/tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

colab jupyter python transcribe transformers translate whisper

Last synced: 06 Nov 2025

https://github.com/miozilla/ct3p

ct3p :leaves::sheep: : AI Global Consulting Service # Amazon Comprehend # Textract # Translate # Transcribe # Polly # SageMaker AI # S3

ai amazon audio boto3 comprehend polly sagemaker speech text textract transcribe translate

Last synced: 29 Aug 2025

https://github.com/aathifzahir/whisprsplit

A powerful, local speech-to-text transcription system that combines OpenAI's Whisper for accurate transcription with pyannote.audio for speaker diarization (identifying who spoke when). Perfect for meetings, interviews, podcasts, and any audio/video content that needs accurate transcription with speaker identification.

diarization speaker-recognition speech speech-diarization speech-recognition speech-to-text transcribe transcript transcription

Last synced: 19 Aug 2025

https://github.com/mateusjssilva/dna-transcription

Java implementation of the program that transcribes a DNA strand.

bioinformatics dna java transcribe

Last synced: 04 Sep 2025

https://github.com/keatonkirk55cfc/audio-to-text

🎧 audio-to-text transcribes audio files to text using the Web Speech API in a headless browser via Puppeteer, supporting ffmpeg formats and PulseAudio on Linux.

ai audio-generation audio-processing generative-ai jupyter-notebook language large-language-models latent-diffusion linux macos open-source openai openai-whisper speech text-to-speech transcribe translation whisper

Last synced: 19 Aug 2025

https://github.com/avneeshchaudhary/speech2textweb

Simple transcription Web Application

speech-to-text transcribe

Last synced: 31 Jul 2025

https://github.com/mikedidomizio/transcriber-summarizer

Takes audio, determines speakers and outputs summary of discussion using AWS Transcribe & OpenAI

chatgpt openai-api remix-run transcribe

Last synced: 12 Oct 2025

https://github.com/alchemist-aloha/explicitutil

A utility library for managing media files, especially focused on conversion, organization, and archival.

batch-processing media-management namer nfo-file rename-files transcribe whisper-cpp

Last synced: 07 Apr 2025

https://github.com/alchemist-aloha/explicit_util

A utility library for managing media files, especially focused on conversion, organization, and archival.

batch-processing media-management namer nfo-file rename-files transcribe whisper-cpp

Last synced: 30 Mar 2025

https://github.com/dharun-416/vibe

Vibe is an AI-powered desktop browser that enhances your web experience with intelligent features. Explore the future of browsing and join our community on GitHub! 🐙✨

async concurrency cross-platform cvpr20 cvpr2020 design-system desktop human-pose-estimation mcp mcp-server monday openai pytorch transcribe ui-components vibe vibe-coding video-pose-estimation

Last synced: 05 Sep 2025

https://github.com/hanpham32/react-native-whisper

A simple OpenAI Whisper transcription React Native app

flask ngrok react-native transcribe whisper

Last synced: 11 Apr 2025

https://github.com/veralvx/trainscribe

A command-line tool for transcribing audio files in a folder to a metadata.csv file, using OpenAI's Whisper.

audio-processing audio-transcribing audio-transcription ljspeech ljspeech-format openai-whisper training transcribe transcribe-audio-files transcriber transcription whisper

Last synced: 18 Nov 2025

https://github.com/davidpacascdb/transcribe-meetings

InsightLens transforms business meetings into actionable, searchable knowledge. It ingests meeting recordings, slides, and chat logs, then uses advanced AI to transcribe.

cognitive-services deep-learning gpt-api hobby-project llm meeting mp3 note notetaker ollama record slack teams-bot teams-meeting-app teams-side-panel transcribe transcript transcription

Last synced: 24 Aug 2025

https://github.com/social-ali/scribe2translate

Scribe2Translate is a modern web application built with React.js and Vite, designed to facilitate the process of recording audio, transcribing it into text, and translating it into multiple languages.

reactjs records scribe transcribe translate

Last synced: 16 Jun 2025

https://github.com/bra1ndump/llm-batch-image-transcription

Batch-transcribe images using AI vision models like OpenAI and Google Gemini

batch deno gemani llm openai transcribe utility vision

Last synced: 09 Mar 2025

https://github.com/xhafievps/meeting-summarizer

Record, transcribe, and summarize meetings effortlessly with the AI Meeting Summarizer. Upload audio and get structured summaries using OpenAI or Google Gemini. 🐙💻

abstractive-summarization audio bert flask llm machine-learning meeting-minutes meeting-summarization mp3 notes notetaker python quiz-generator record summarization tensorflow texar transcribe

Last synced: 22 Aug 2025

https://github.com/alchemist-aloha/whisper.cpp_batch_subtitle

Powershell script to batch transcribe videos to subtitles with ffmpeg and whisper.cpp.

ffmpeg subtitles transcribe whisper-cpp

Last synced: 07 Apr 2025

https://github.com/notyusheng/transcribe-translate_kubernetes

Local web app for transcription and translation services for audio and video using Whisper models

docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper

Last synced: 31 Dec 2025