Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/rhysdg/whisper-onnx-python

A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph

ai chatbot cuda machine-learning onnxruntime speech-to-text whisper

Last synced: 09 Oct 2024

https://github.com/natanielf/lecsum

Automatically transcribe and summarize lecture recordings completely on-device using AI.

ollama ollama-python whisper whisper-ai

Last synced: 18 Dec 2024

https://github.com/platput/pysubs

api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/schnoddelbotz/whisper-ui

Transcribe audio/video to text, locally on macOS, Linux and Windows. A simple whisper.cpp wrapper/UI built with Go/Fyne.

ffmpeg ffmpeg-wrapper fyne gui local privacy speech-to-text transcription whisper whisper-cpp

Last synced: 22 Dec 2024

https://github.com/toomore/whisper

🔐📦📜🔑🍞 Write some notes by using the GPG encrypts.

gpg notes pgp quickstart whisper

Last synced: 23 Jan 2025

https://github.com/adisol07/sharpspeech

SharpSpeech is free, local and open source way to speech and wake word recognition.

audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai

Last synced: 19 Dec 2024

https://github.com/tposcic/audio-to-srt-transcriber

Audio to srt transcriber in Python using whisper for transcription and Tcl/Tk for GUI

audio python3 srt transcription whisper

Last synced: 05 Jan 2025

https://github.com/brentwong-kiel1997/brents_ai_language_school

Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos

ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube

Last synced: 31 Dec 2024

https://github.com/sugarcane-mk/speaker_classification

This repository provides a Python script for extracting speech embeddings using OpenAI's Whisper model. The embeddings are high-dimensional feature vectors that capture the acoustic properties of the input audio. These embeddings can be used for downstream tasks such as speech classification, clustering, and speaker recognition.

asr classification feature-extraction openai speech-processing speech-recognition speech-to-text svm-classifier whisper

Last synced: 09 Jan 2025

https://github.com/chinese-soup/cbot-telegram-whisper

Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.

bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp

Last synced: 17 Jan 2025

https://github.com/topdev0215/AudioMultifunctionChatbot

This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.

gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper

Last synced: 24 Oct 2024

https://github.com/wtlow003/auto-subtitles

CLI tool to transcribe (+ translate) videos and embed subtitles automatically.

faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp

Last synced: 15 Nov 2024

https://github.com/egorsmkv/star-adapt-uk

Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.

asr speech-recognition ukrainian whisper

Last synced: 19 Dec 2024

https://github.com/maawad/luna

Personal assistant

bot openai personal-assistant whisper

Last synced: 17 Dec 2024

https://github.com/etienneab3d/srt-sync

Synchronize SRT timestamps over an existing accurate transcription

aligner asr nlp subtitles text-to-speech whisper

Last synced: 19 Dec 2024

https://github.com/nri12/filter_voice

Dự án lọc và tắt tiếng video những từ khóa mong muốn

python tools whisper

Last synced: 19 Dec 2024

https://github.com/rokbenko/arctic-meet

ArcticMeet is an AI meeting assistant using Streamlit for the GUI and the Snowflake Arctic LLM via the Snowflake Cortex for the AI features

ffmpeg pandas plotly python pytorch snowflake snowflake-arctic snowflake-cortex snowpark streamlit transformers whisper

Last synced: 11 Jan 2025

https://github.com/oov/aviutl_subtitler

AviUtl+拡張編集の環境で Whisper による文字起こしをするためのプラグイン

aviutl aviutl-plugin whisper

Last synced: 19 Dec 2024

https://github.com/xaionaro-go/speech

A Speech-To-Text (with translation) library for Go; currently uses Whisper (runs locally if needed; no need in any API keys)

ai converter go golang library module package speech speech-recognition speech-to-text text whisper

Last synced: 13 Jan 2025

https://github.com/h3yn3s/tl-dl

A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.

sveltekit tailwind whisper

Last synced: 24 Oct 2024

https://github.com/kunesj/holo-subs-search

Tool for searching transcriptions of vtuber videos.

holodex pyannote transcription vtuber whisper youtube

Last synced: 19 Jan 2025

https://github.com/canaxs/whisper-core

An application where users can make rumor-based news and earn money in return.

mysql panel spring spring-boot whisper

Last synced: 19 Dec 2024

https://github.com/bhattbhavesh91/openai-whisper-benchmarking

Comparing the performance of OpenAI's Whisper model on a GPU vs OpenAI's API

gpu openai speech-to-text whisper

Last synced: 16 Nov 2024

https://github.com/mikeesto/subber

A small CLI tool for converting video & audio to a text transcription

audio cli ffmpeg golang transcribe video whisper

Last synced: 19 Dec 2024

https://github.com/elmiraghorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl

Last synced: 29 Nov 2024

https://github.com/aeronjl/transcribe

Python package for accurate audio transcription with speaker diarisation

audio-transcription gpt speaker-diarization whisper

Last synced: 09 Oct 2024

https://github.com/saamerm/whisperkit-ios15

iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon

ios ios15 swiftui whisper whisper-ai

Last synced: 19 Jan 2025

https://github.com/bbc-esq/batch-openai-whisper-ctranslate2

Batch process multiple files using the fasted ctranslate2 implementation of Open AI's Whisper

batch-processing batch-script openai openai-whisper pyside6 transcription translation whisper whisperx

Last synced: 11 Jan 2025

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 18 Dec 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/vifill/audio-recorder-and-summarizer

This project is a Python script that records system audio on macOS using BlackHole, transcribes the audio using OpenAI's Whisper API, and summarizes the transcription using OpenAI's GPT models

ai audio blackhole gpt openai records summarize system whisper

Last synced: 20 Dec 2024

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/bloodworks-io/phlox

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 18 Dec 2024

https://github.com/malexandersalazar/casey

Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI

agentic-ai content-creation groq large-language-models python wellbeing whisper

Last synced: 18 Dec 2024

https://github.com/same-ou/whisper-speech-recognition

This repository contains a deployment of the Whisper speech recognition model using Flask and Python. Whisper is a cutting-edge speech recognition model designed to accurately transcribe speech input into text.

deep-learning flask machine-learning openai python pytorch whisper

Last synced: 01 Jan 2025

https://github.com/jalvarezz13/summarai

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.

llama-index llama2 pymovie whisper

Last synced: 22 Dec 2024

https://github.com/mai-reborn/mai-offline-transcriber

Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.

asr audio-transcription offline-transcriber pyqt6 python speech-recognition video-transcription whisper

Last synced: 05 Jan 2025

https://github.com/kitschpatrol/ambient-novel

An interface for nonlinear interactive exploration of a novel.

ambient book fiction interactive novel svelte whisper

Last synced: 20 Jan 2025

https://github.com/kristofferv98/whisper_turboapi

An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization

ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo

Last synced: 14 Dec 2024

https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-

This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.

artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model

Last synced: 19 Dec 2024

https://github.com/akhkim/babel

Real-time Internal Audio Translate and Transcriber that uses Whisper model

ai internal-audio real-time transcription translation whisper

Last synced: 19 Dec 2024

https://github.com/heyfoz/python-openai-whisper

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.

ai api audio-transcription openai python speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/geo-y20/enhanced-learning-experience

IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.

chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding

Last synced: 19 Dec 2024

https://github.com/doctorpok42/pheere

Pheere is a simple virtual assistant

ai chatgpt elevenlabs ts virtual-assistant whisper

Last synced: 10 Jan 2025

https://github.com/ajxv/rtstt

Real time speech to text transcription using OpenAi whisper

live-transcription openai openai-whisper python3 transcription whisper

Last synced: 22 Dec 2024

https://github.com/obay-ismaeel/post-generator

An API that generates social media posts by implementing RAG with Llama-3

ai api fastapi llama llm python retrieval-augmented-generation social-media whisper

Last synced: 12 Oct 2024

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 14 Dec 2024

https://github.com/deepbiolab/customer-complaint-classification

An GenAI-powered pipeline leveraging Whisper, DALL-E, and GPT to transform customer complaints into actionable insights with automated transcription, visualization, and classification.

azure dalle gpt whisper

Last synced: 23 Jan 2025

https://github.com/fatma-moanes/voice-assistant

Voice Assistant for FM-Clinic: A multilingual AI-powered voice assistant for booking doctor appointments, leveraging advanced speech-to-text, text-to-speech, and large language models for seamless, natural user interactions.

ai-assistant arabic arabic-nlp aws-polly chatbot gpt groq langchain langsmith llm mongodb multilingual openai speech-recognition speech-to-text streamlit text-to-speech transcription voice-assistant whisper

Last synced: 26 Dec 2024

https://github.com/nazago/meeting-minutes-generator

Script which takes a .wav audio file, performs speech-to-text using OpenAI/Whisper, and then, using Llama3, summarization and action point from the transcript generated

langchain-python llm-inference local-inference meeting-minutes ollama speech-to-text summarization whisper

Last synced: 02 Jan 2025

https://github.com/asai95/speech-recognition-api

Simple but extensible API for Speech Recognition.

speech-recognition whisper

Last synced: 02 Jan 2025

https://github.com/crucials/twaddle

speech analysis app that collects statistics like words frequencies and transcribed text

ai audio python python-eel speech-to-text vue whisper

Last synced: 24 Oct 2024

https://github.com/soenneker/soenneker.libraries.whisper.ctranslate

Simply adds the Whisper_CTrantlate2 Windows executable, updated daily (if available)

ai csharp ctranslate ctranslate2 dotnet faster libraries library whisper whisperctranslate

Last synced: 29 Dec 2024

https://github.com/sugarcane-mk/whisper

This repository provides a Python script for extracting speech embeddings using OpenAI's Whisper model. The embeddings are high-dimensional feature vectors that capture the acoustic properties of the input audio. These embeddings can be used for downstream tasks such as speech classification, clustering, and speaker recognition.

asr classification feature-extraction openai speech-processing speech-recognition speech-to-text svm-classifier whisper

Last synced: 02 Jan 2025

https://github.com/yjg30737/pyqt-simple-whisper-gui

Whisper text-to-speech, speech-to-text example in PyQt5 GUI

openai pyqt pyqt-ai pyqt5 pyqt5-desktop-application pyqt5-examples pyqt5-gui whisper

Last synced: 03 Jan 2025

https://github.com/mrbuslov/reminder_4u_bot

AI Telegram Bot Reminder. You send a free-form text OR voice reminder, the AI bot records it and reminds you at the right time!

ai ai-bot aiogram chatgpt django gpt-3 gpt-4 gpt-models python reminder telegram-bot voice-recognition whisper

Last synced: 10 Jan 2025

https://github.com/bilelouahmed/vocal-assistant

Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.

chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts

Last synced: 21 Dec 2024

https://github.com/pjarbas/azure-ai

Examples using Azure AI services (DALLE3, Text to Speech, Whisper)

azure-openai dalle-3 image-generation-ai speech-synthesis text-to-speech whisper

Last synced: 21 Jan 2025

https://github.com/educa-ch/educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

langchain ollama open-source open-weight speech-to-text summarization whisper

Last synced: 11 Oct 2024

https://github.com/werserk/techstormhack-1st-place

Решение соревнования ТехШторм от корпорации ТатНефть по анализу активности членов команды на ВКС

pyannote speaker-diarization speech-recognition streamlit whisper

Last synced: 11 Jan 2025

https://github.com/huuquyet/phowhisper-tiny

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024

https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue

Lors de ce hackathon, nous avons développé la solution Smart VT, une application web basée sur l'IA conçue pour sous-titrer et doubler n'importe quelle vidéo d'une langue à une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidéos, et Node.js pour la gestion des sous-titres vidéo.

api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper

Last synced: 12 Jan 2025

https://github.com/suchith-2002/whisperwave

Transcribe any Audio to Text.

openai whisper

Last synced: 08 Dec 2024

https://github.com/waikato-llm/whisper

Docker images for the whisper audio transcription library and variants.

audio transcription whisper

Last synced: 12 Jan 2025

https://github.com/jgw96/speech-to-text-web-toolkit

Making Speech-To-Text on the web easy, both local and in the cloud

ai lit transformersjs webcomponents whisper

Last synced: 06 Dec 2024

https://github.com/RingoMar/whisper-devcontainer

Openai whisper inside of vscode docker devcontainer using example files

ai devcontainer docker openapi python whisper

Last synced: 24 Oct 2024

https://github.com/lohiyah/vidcraft

VidCraft is an AI-driven backend application that generates videos from user-defined topics and backgrounds. It combines text, audio, and visuals using advanced AI services, making video creation accessible and efficient for developers and content creators alike.

ai elevenlabs fastapi ffmpgeg full-stack-web-development gemini-ai huggingface image-generation machine-learning reactjs subtitles text-to-speech typescript video-generation whisper

Last synced: 18 Jan 2025

https://github.com/MattCode64/Scriba

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

fastapi python speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/vlazic/json-verbose-to-vtt-converter

Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.

converter groq json json-verbose openai vtt webvtt whisper

Last synced: 26 Jan 2025

https://github.com/ty-martz/audiologic

Python Module to process and predict on music attributes

machine-learning music python whisper

Last synced: 24 Oct 2024

https://github.com/luizcalaca/transcricao-medica

Full Stack + Whisper Transcription + Node.js REST API + VITE + React.js + Railway deploy

full-stack nodejs openai openai-api railway reactjs sequelize sequelize-orm vite whisper whisper-ai

Last synced: 25 Jan 2025

https://github.com/evil0ctal/whisper-speech-to-text-api

An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.

ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api

Last synced: 25 Oct 2024

https://github.com/deshwalmahesh/whisper-fastapi-realtime

It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription

audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large

Last synced: 25 Oct 2024

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/isladot/speech-to-text-whisper

A speech-to-text converter powered by OpenAI's Whisper model. Easy-to-use tool for transcribing audio into text with high accuracy.

ai python s2t speech-to-text whisper

Last synced: 19 Jan 2025

https://github.com/nanext21/vidcraft

VidCraft is an AI-driven backend application that generates videos from user-defined topics and backgrounds. It combines text, audio, and visuals using advanced AI services, making video creation accessible and efficient for developers and content creators alike.

elevenlabs fastapi ffmpgeg full-stack-web-development gemini-ai github-config image-generation machine-learning mern-project subtitles typescript video-generation whisper whisper-ai

Last synced: 19 Jan 2025

https://github.com/jt-427/whisper-ui

A minimalist and elegant UI for OpenAI's Whisper speech-to-text model, built with React + Vite and Flask

flask openai react speech-to-text transcription vite whisper

Last synced: 19 Jan 2025

https://github.com/devgeekm/chat-it-up

Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.

ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl

Last synced: 20 Dec 2024

https://github.com/njorogemaurice/speech-recognition-openai-whisper

This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.

openai speech-recognition speech-to-text whisper

Last synced: 14 Jan 2025

https://github.com/egorsmkv/optimized-whisper-intel

Run quantized Whisper models only on CPU with Intel hardware

intel onnx onnxruntime quantized-neural-networks whisper

Last synced: 19 Dec 2024

https://github.com/tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

colab jupyter python transcribe transformers translate whisper

Last synced: 27 Dec 2024

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 19 Dec 2024

https://github.com/danibcorr/university-helper

🧑‍🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.

deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper

Last synced: 19 Dec 2024