Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/jgw96/speech-to-text-web-toolkit

Making Speech-To-Text on the web easy, both local and in the cloud

ai lit transformersjs webcomponents whisper

Last synced: 06 Dec 2024

https://github.com/evil0ctal/whisper-speech-to-text-api

An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.

ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api

Last synced: 25 Oct 2024

https://github.com/zdwolfe/transcription-tools

Docker video transcriber, wrapper around OpenAI

openai transcription whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/RingoMar/whisper-devcontainer

Openai whisper inside of vscode docker devcontainer using example files

ai devcontainer docker openapi python whisper

Last synced: 24 Oct 2024

https://github.com/huuquyet/phowhisper-tiny

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024

https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-

This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.

artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model

Last synced: 19 Dec 2024

https://github.com/akhkim/babel

Real-time Internal Audio Translate and Transcriber that uses Whisper model

ai internal-audio real-time transcription translation whisper

Last synced: 19 Dec 2024

https://github.com/brunogaliati/speech2text-investments

This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.

chatgpt investment openai politics python speech-recognition speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/danibcorr/university-helper

🧑‍🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.

deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper

Last synced: 19 Dec 2024

https://github.com/deepbiolab/customer-complaint-classification

An GenAI-powered pipeline leveraging Whisper, DALL-E, and GPT to transform customer complaints into actionable insights with automated transcription, visualization, and classification.

azure dalle gpt whisper

Last synced: 23 Nov 2024

https://github.com/heyfoz/python-openai-whisper

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.

ai api audio-transcription openai python speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/geo-y20/enhanced-learning-experience

IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.

chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding

Last synced: 19 Dec 2024

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 19 Dec 2024

https://github.com/jplhughes/whisper_logit_lens

This Alignment Jam Hackathon project explores whether the concept of "logit lens" applies to the encoder and decoder layers in Whisper, an end-to-end speech recognition model.

alignment-jam asr interpretability interpretability-jam logitlens whisper

Last synced: 24 Oct 2024

https://github.com/doctorpok42/pheere

Pheere is a simple virtual assistant

ai chatgpt elevenlabs ts virtual-assistant whisper

Last synced: 10 Jan 2025

https://github.com/MattCode64/Scriba

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

fastapi python speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/aixerum/faster-whisper

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

ctranslate2 gpu transcription whisper

Last synced: 07 Jan 2025

https://github.com/baomeomeo/speech

A Speech-To-Text (with translation) library for Go; currently uses Whisper (runs locally if needed; no need in any API keys)

ai converter go golang library module package speech speech-recognition speech-to-text text whisper

Last synced: 13 Jan 2025

https://github.com/deshwalmahesh/whisper-fastapi-realtime

It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription

audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large

Last synced: 25 Oct 2024

https://github.com/werserk/techstormhack-1st-place

Решение соревнования ТехШторм от корпорации ТатНефть по анализу активности членов команды на ВКС

pyannote speaker-diarization speech-recognition streamlit whisper

Last synced: 11 Jan 2025

https://github.com/flo-bit/youtube-speaker-separation

simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate

speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube

Last synced: 19 Dec 2024

https://github.com/ty-martz/audiologic

Python Module to process and predict on music attributes

machine-learning music python whisper

Last synced: 24 Oct 2024

https://github.com/fatma-moanes/voice-assistant

Voice Assistant for FM-Clinic: A multilingual AI-powered voice assistant for booking doctor appointments, leveraging advanced speech-to-text, text-to-speech, and large language models for seamless, natural user interactions.

ai-assistant arabic arabic-nlp aws-polly chatbot gpt groq langchain langsmith llm mongodb multilingual openai speech-recognition speech-to-text streamlit text-to-speech transcription voice-assistant whisper

Last synced: 26 Dec 2024

https://github.com/patryk-ku/sasayaki

A small CLI tool that simplifies and automates the process of installing and using AI models to transcribe and translate videos.

automation cli faster-whisper gemini-api transcription translation whisper whisper-cpp

Last synced: 05 Jan 2025

https://github.com/mai-reborn/mai-offline-transcriber

Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.

asr audio-transcription offline-transcriber pyqt6 python speech-recognition video-transcription whisper

Last synced: 05 Jan 2025

https://github.com/nazago/meeting-minutes-generator

Script which takes a .wav audio file, performs speech-to-text using OpenAI/Whisper, and then, using Llama3, summarization and action point from the transcript generated

langchain-python llm-inference local-inference meeting-minutes ollama speech-to-text summarization whisper

Last synced: 02 Jan 2025

https://github.com/asai95/speech-recognition-api

Simple but extensible API for Speech Recognition.

speech-recognition whisper

Last synced: 02 Jan 2025

https://github.com/MattCode64/Scriba_Front

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

speech-to-text vite vue vuejs whisper

Last synced: 24 Oct 2024

https://github.com/televisionninja/chat

Chat with an AI Vtuber

ai chatbot llama llm tts vtube-studio vtuber whisper

Last synced: 20 Nov 2024

https://github.com/sixiaolong1117/whisperpythonscript

一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。

pysrt python python3 whisper whisper-ai

Last synced: 15 Nov 2024

https://github.com/huuquyet/phowhisper-small

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024

https://github.com/ivanrj7j/transcription

This project transcribes audio using whisper and provides an api

ai api flask transcription whisper

Last synced: 09 Oct 2024

https://github.com/nicknaskida/cog-whisper-diarization

Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx

Last synced: 27 Sep 2024

https://github.com/ifeech/subtitler

Creating subtitles from video

subtitles whisper

Last synced: 09 Oct 2024

https://github.com/escarrie/transcriptaudio

This is a script that can be used to transcript audio file into text file using Whisper AI

ai transcription whisper

Last synced: 17 Nov 2024

https://github.com/meain/raus

Record audio until silence (RAUS)

audio hammerspoon transcription whisper whisper-cpp

Last synced: 17 Nov 2024

https://github.com/kristofferv98/whisper_turboapi

An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization

ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo

Last synced: 14 Dec 2024

https://github.com/dtbuchholz/yt-timestamps-subtitles

Generate YouTube timestamps and subtitles from a video file with OpenAI Whisper and GPT-4

gpt-4 subtitles timestamp whisper youtube

Last synced: 15 Dec 2024

https://github.com/ts-azure-services/batch-transcription-examples

A repo to archive some code related to batch transcription for animation movies.

batch-transcription speech-to-text whisper

Last synced: 30 Nov 2024

https://github.com/deshwalmahesh/interview-help-cheat-live

As the name suggests, it helps you cheat in your live interviews or video calls. It transcribes your audio and provides answers to your query in real time. Supports equation rendering, custom prompts, text selection and editing. It's basically chatGPT for cheating in interviews

audio-transcription chatgpt fastapi huggingface interview interviews live openai pyaudio realtime transcription transformers whisper whisper-large

Last synced: 31 Dec 2024

https://github.com/sivakumar-mahalingam/subtitle-generator

🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python

ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper

Last synced: 09 Oct 2024

https://github.com/chinese-soup/cbot-telegram-whisper

Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.

bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp

Last synced: 16 Nov 2024

https://github.com/simongino/whisper-fastapi

A FastAPI-based application integrating Whisper for efficient speech recognition and processing.

ai docker fastapi python whisper

Last synced: 09 Oct 2024

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 16 Nov 2024

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/datvm/openaiwhisperclient

A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.

client-side openai subtitle timestamp transcript transcription whisper whisper-ai

Last synced: 15 Dec 2024

https://github.com/stefanangelovski/voice_to_tweet

Tweet with your Voice using Whisper STT from OpenAI and Twitter4J flow to connect and talk with any account.

ai frontend openai twitter website whisper x

Last synced: 15 Dec 2024

https://github.com/levysantiago/upload-ai

Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.

ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod

Last synced: 12 Jan 2025

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 14 Dec 2024

https://github.com/devgeekm/chat-it-up

Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.

ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl

Last synced: 20 Dec 2024

https://github.com/bluebirdback/groq-subtitles

Batch video subtitle generation using Groq Whisper API

groq speech-to-text subtitles video whisper

Last synced: 21 Dec 2024

https://github.com/jalvarezz13/summarai

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.

llama-index llama2 pymovie whisper

Last synced: 22 Dec 2024

https://github.com/dheison0/subcreator

A subtitle creator, translator and embeder tool made using AI

ai machine-learning ml python subtitles video-processing whisper

Last synced: 09 Oct 2024

https://github.com/homelab-00/longformstt

A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.

faster-whisper python speech-to-text whisper

Last synced: 09 Jan 2025

https://github.com/orhancavus/transcribe_video

Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper

insanely-fast speach-to-text whisper

Last synced: 09 Jan 2025

https://github.com/waikato-llm/whisper

Docker images for the whisper audio transcription library and variants.

audio transcription whisper

Last synced: 12 Jan 2025

https://github.com/darienmt/radio-listener

Speech Recognition applied to transcribe amateur radio traffic experiments

python3 radio-amateurs speach-to-text speech-recognition whisper

Last synced: 21 Nov 2024

https://github.com/mottla/speech-to-text

Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.

huggingface speech-recognition stt teams transcription translation whisper zoom

Last synced: 21 Nov 2024

https://github.com/xi-rick/captains-log

Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.

audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper

Last synced: 21 Nov 2024

https://github.com/mdbecker/whisper_cpp_macos_utils

Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.

audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp

Last synced: 01 Dec 2024

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/kolger/forty-two-transcribe

A Telegram bot that transcribes videos and audio messages to text via OpenAI Whisper API

openai self-hosted telegram whisper

Last synced: 25 Nov 2024

https://github.com/tobybenjaminclark/intermew

👨‍💻 Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.

computer-vision openai-api whisper

Last synced: 25 Nov 2024

https://github.com/teemow/mnote

Generates meeting notes and summaries from video recordings

ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper

Last synced: 07 Dec 2024

https://github.com/armaggheddon/whisper2me

whisper2me is a telegram bot written with pyTelegramBotAPI that uses OpenAI's whisper to perform speech2text so you no longer have listen to voice messages 🤫🔇

docker openia pytelegrambotapi python whisper

Last synced: 25 Nov 2024

https://github.com/heng30/vtbox

It is an offline voice to text tool. Using whisper model to transcribe.

rust slint-ui voice2text whisper

Last synced: 21 Nov 2024

https://github.com/kitschpatrol/ambient-novel

An interface for nonlinear interactive exploration of a novel.

ambient book fiction interactive novel svelte whisper

Last synced: 19 Nov 2024

https://github.com/iamarunbrahma/smart-voice-assistant

A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.

chatgpt tts whisper

Last synced: 07 Dec 2024

https://github.com/njorogemaurice/speech-recognition-openai-whisper

This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.

openai speech-recognition speech-to-text whisper

Last synced: 14 Jan 2025

https://github.com/pratikpakhale/terravis

Voice guided GIS system

genai gis lam llm voice whisper

Last synced: 09 Oct 2024

https://github.com/notyusheng/transcribe-translate_kubernetes

Local web app for transcription and translation services for audio and video using Whisper models

docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper

Last synced: 22 Nov 2024

https://github.com/senkita/gabriel

视频总结工具。

summarizer whisper

Last synced: 09 Oct 2024