Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/i4ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

fine-tuning nlp speech-to-text whisper

Last synced: 09 Oct 2024

https://github.com/shani-sinojiya/sandalquest

AI/ML project for recognizing colloquial Kannada speech and building a speech-based Q&A system focused on sandalwood cultivation.

ai audio-processing data-augmentation deep-learning machine-learning mongodb nlp python pytorch question-answering speech-based-question-answering-system speech-recognition whisper

Last synced: 10 Jan 2025

https://github.com/tylim88/voicefu

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/baristikir/voice-typing

Simple Desktop Application with Voice Typing features. Runs locally, transcribes locally and works fully offline with support for real-time transcribing. Powered by OpenAI Whisper ASR-models and whisper.cpp inference engine

electron whisper whisper-cpp

Last synced: 24 Dec 2024

https://github.com/natanielf/lecsum

Automatically transcribe and summarize lecture recordings completely on-device using AI.

ollama ollama-python whisper whisper-ai

Last synced: 18 Dec 2024

https://github.com/chinese-soup/cbot-telegram-whisper

Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.

bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp

Last synced: 17 Jan 2025

https://github.com/egorsmkv/star-adapt-uk

Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.

asr speech-recognition ukrainian whisper

Last synced: 19 Dec 2024

https://github.com/nri12/filter_voice

Dự án lọc và tắt tiếng video những từ khóa mong muốn

python tools whisper

Last synced: 19 Dec 2024

https://github.com/oov/aviutl_subtitler

AviUtl+拡張編集の環境で Whisper による文字起こしをするためのプラグイン

aviutl aviutl-plugin whisper

Last synced: 19 Dec 2024

https://github.com/canaxs/whisper-core

An application where users can make rumor-based news and earn money in return.

mysql panel spring spring-boot whisper

Last synced: 19 Dec 2024

https://github.com/breadrock1/audio-to-text

There is simple backend project to use whisper-rs.

actix-web audio-to-text rust swagger-ui whisper

Last synced: 10 Jan 2025

https://github.com/tranbavinhson/eth-decentralized-chat

Decentralized chat app by Ethereum Whisper protocol + Vuejs

ethereum vue vuejs whisper whisper-protocol

Last synced: 26 Dec 2024

https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives

AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.

bart gpt2 video-analysis whisper yolov8

Last synced: 02 Jan 2025

https://github.com/fukuro-kun/wortweber

Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform für Spracherkennung in Deutsch und Englisch.

speech-to-text whisper

Last synced: 17 Jan 2025

https://github.com/winstxnhdw/capgen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.

asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper

Last synced: 23 Oct 2024

https://github.com/valiantlynx/custom-whisper-api

This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.

ai docker-compose fastapi python whisper

Last synced: 10 Jan 2025

https://github.com/marquesafonso/multilang-asr-captioner

A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.

automatic-speech-recognition captioning-videos faster-whisper whisper

Last synced: 24 Oct 2024

https://github.com/Op27/meeting_minutes_generator

This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.

ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper

Last synced: 24 Oct 2024

https://github.com/bbc-esq/whisper-solo-with-gui

OpenAI's Whisper program with a simple lightweight GUI.

pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper

Last synced: 11 Jan 2025

https://github.com/TranBaVinhSon/eth-decentralized-chat

Decentralized chat app by Ethereum Whisper protocol + Vuejs

ethereum vue vuejs whisper whisper-protocol

Last synced: 24 Oct 2024

https://github.com/lazauk/aoai-entraidauth-sdkv1

Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x

ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper

Last synced: 12 Jan 2025

https://github.com/nerdimite/meetsy-app

Frontend for the Workshop on Building an End-to-End AI Meeting Assistant

gpt-3 nextjs sentence-transformers tailwindcss whisper

Last synced: 24 Oct 2024

https://github.com/tracywong117/ai-learning-material-from-video

Support subtitling, translating, RAG to generate language learning material from video.

ai auto-subtitle gpt-translate groq groq-api rag subtitles-generator translate whisper

Last synced: 19 Jan 2025

https://github.com/Shtirmann/V2T

Telegram bot which automatically transcribes all voice and video messages to text.

ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper

Last synced: 24 Oct 2024

https://github.com/thewh1teagle/whisper.zig

Transcribe audio with whisper in zig

asr openai whisper zig

Last synced: 24 Jan 2025

https://github.com/marty1885/useful-whisper-server

Whisper server based on useful-transformers for the RK3588

npu rk3588 rockchip useful-transformers whisper

Last synced: 05 Dec 2024

https://github.com/stnderror/robotron

🤖 A personal robot assistant for Telegram

assistant bot dall-e gpt-35-turbo openai telegram-bot whisper

Last synced: 25 Jan 2025

https://github.com/tposcic/audio-to-srt-transcriber

Audio to srt transcriber in Python using whisper for transcription and Tcl/Tk for GUI

audio python3 srt transcription whisper

Last synced: 05 Jan 2025

https://github.com/mickekring/top-of-mind-clara

Clara är en prototyp som möjliggör att anonymt kunna göra sin röst hörd. Medarbetaren kan prata eller skriva in det du vill säga och AI anonymiserar det. Medarbetaren har dessutom tillgång till en chatbot att rådfråga. Därefter analyseras och sammanställs alla medarbetares tankar i en dashboard.

ai chatbot feedback openai python streamlit transcription whisper

Last synced: 22 Dec 2024

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 14 Dec 2024

https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper

Whisper Automatic Speech Recognition (ASR) Model

openai openai-api transcription webapp whisper

Last synced: 22 Dec 2024

https://github.com/waikato-llm/whisper

Docker images for the whisper audio transcription library and variants.

audio transcription whisper

Last synced: 12 Jan 2025

https://github.com/mickekring/top-of-mind-beromfabriken

Att ge beröm till en kollega kan kännas lite pinsamt, men forskning har visat att det kan få oss att må bättre på jobbet och att vi till och med blir mer produktiva. Att få höra att kollegor värdesätter och uppmärksammar en ökar ens välmående helt enkelt.

api gpt openai python transcription whisper

Last synced: 16 Jan 2025

https://github.com/wa-lead/audio2md

Summarizes audio using openai Whisper-1 model and GPT-Turbo3.5

audio-processing gpt-3 openai python whisper

Last synced: 26 Jan 2025

https://github.com/suchith-2002/whisperwave

Transcribe any Audio to Text.

openai whisper

Last synced: 08 Dec 2024

https://github.com/baomeomeo/speech

A Speech-To-Text (with translation) library for Go; currently uses Whisper (runs locally if needed; no need in any API keys)

ai converter go golang library module package speech speech-recognition speech-to-text text whisper

Last synced: 13 Jan 2025

https://github.com/msrsaditya/speech2speech

A Personal Digital Assistant designed to help you with quick responses.

ollama openai phi3 sox tts whisper

Last synced: 27 Jan 2025

https://github.com/concaption/containerized-transcription-api

Containerized Transcription API using Whisper Model and FastAPI

docker fastapi openai transcription whisper

Last synced: 16 Dec 2024

https://github.com/seanvelasco/ai

Cloudflare AI challenge submission: Slater - your virtual foreign language friend

ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper

Last synced: 09 Dec 2024

https://github.com/cnseniorious000/dl-a2t

download, audio-to-text PyPI: https://pypi.org/p/dl-a2t

audio transcription whisper youtube

Last synced: 02 Jan 2025

https://github.com/velocitatem/dontlectureme

A program that pays attention to your lectures for you.

ai lectures university whisper

Last synced: 03 Dec 2024

https://github.com/homelab-00/longformstt

A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.

faster-whisper python speech-to-text whisper

Last synced: 09 Jan 2025

https://github.com/patryk-ku/sasayaki

A small CLI tool that simplifies and automates the process of installing and using AI models to transcribe and translate videos.

automation cli faster-whisper gemini-api transcription translation whisper whisper-cpp

Last synced: 05 Jan 2025

https://github.com/orhancavus/transcribe_video

Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper

insanely-fast speach-to-text whisper

Last synced: 09 Jan 2025

https://github.com/zuplyx/subtitle-creator

Add english subtitles to videos using openai/whisper-large-v3

open-ai poetry-python python3 subtitles-generator whisper

Last synced: 09 Dec 2024

https://github.com/status-im/infra-role-status-go

Ansible role for status-go

ansible-role infra waku whisper

Last synced: 05 Jan 2025

https://github.com/lohiyah/vidcraft

VidCraft is an AI-driven backend application that generates videos from user-defined topics and backgrounds. It combines text, audio, and visuals using advanced AI services, making video creation accessible and efficient for developers and content creators alike.

ai elevenlabs fastapi ffmpgeg full-stack-web-development gemini-ai huggingface image-generation machine-learning reactjs subtitles text-to-speech typescript video-generation whisper

Last synced: 18 Jan 2025

https://github.com/studiowebux/tommygotchi

whisper, piper, llama-gpt, python, fun .. so much fun !

llama-gpt piper python3 whisper whisper-ai

Last synced: 05 Jan 2025

https://github.com/huuquyet/phowhisper-tiny

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024

https://github.com/paszkoo/real_time_whisper_iot

Real time voice transcription from default audio input using faster-whisper

ai iot-application iot-device smart-home voice-assistant voice-recognition whisper

Last synced: 17 Jan 2025

https://github.com/mario-huang/whisper-desktop

A desktop app for easy subtitle using whisper model.

ai desktop gradio open-source python pytorch tauri web-ui whisper

Last synced: 17 Jan 2025

https://github.com/educa-ch/educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

langchain ollama open-source open-weight speech-to-text summarization whisper

Last synced: 11 Oct 2024

https://github.com/ashot72/answering-questions-about-images

You can upload images, ask questions about images using voice prompts, then listen to the responses in voice

answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper

Last synced: 30 Dec 2024

https://github.com/bilelouahmed/vocal-assistant

Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.

chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts

Last synced: 21 Dec 2024

https://github.com/njorogemaurice/speech-recognition-openai-whisper

This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.

openai speech-recognition speech-to-text whisper

Last synced: 14 Jan 2025

https://github.com/thealphamerc/audio-to-text

Transcribe multi-lingual audio clips using whisper model

openai whisper

Last synced: 16 Dec 2024

https://github.com/jgurakuqi/auto-subtitles-generator

A utility based on neural networks for generating srt subtitles from common video formats

faster-whisper ffmpeg neural-network python whisper

Last synced: 26 Jan 2025

https://github.com/tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

colab jupyter python transcribe transformers translate whisper

Last synced: 27 Dec 2024

https://github.com/mariatepei/vt_thesis_mtepei

This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.

data-augmentation fastspeech2 speech-recognition whisper

Last synced: 09 Oct 2024

https://github.com/arkaniightt/web_app_transcriptor_openai

Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.

ai app openai python streamlit tool tools transcript transcription webapp whisper

Last synced: 12 Dec 2024

https://github.com/fkiller/whispertranscript

Transcribe voice from mic input using OpenAI Whisper API.

llm openai transcribe transcript transcription webaudio whisper

Last synced: 06 Jan 2025

https://github.com/RingoMar/whisper-devcontainer

Openai whisper inside of vscode docker devcontainer using example files

ai devcontainer docker openapi python whisper

Last synced: 24 Oct 2024

https://github.com/evilfreelancer/whisper-tests

Collection of experiments on OpenAI Whisper models

api-server docker-compose testing transcription whisper

Last synced: 17 Dec 2024

https://github.com/MattCode64/Scriba

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

fastapi python speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/s-emanuilov/whispercpp_kit

A wrapper on whisper.cpp with additional helper features like model management capabilities.

asr whisper

Last synced: 13 Dec 2024

https://github.com/zahidhasann88/video-summarizer

A videos by extracting audio and generating summaries based on the audio content.

nodejs openai typescript whisper

Last synced: 07 Jan 2025

https://github.com/mai-reborn/mai-offline-transcriber

Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.

asr audio-transcription offline-transcriber pyqt6 python speech-recognition video-transcription whisper

Last synced: 05 Jan 2025

https://github.com/jgw96/speech-to-text-web-toolkit

Making Speech-To-Text on the web easy, both local and in the cloud

ai lit transformersjs webcomponents whisper

Last synced: 06 Dec 2024

https://github.com/hsiehbocheng/yt-gen-caption

This is a Porject for generating captions for YouTube videos using Faster Whisper & yt_dlp.

asr python whisper

Last synced: 19 Dec 2024

https://github.com/philogicae/docker-faster-whisper-fr-api

Docker - Faster Whisper FR - RunPod Serverless API

ctranslate2 docker faster-whisper french runpod serverless whisper

Last synced: 08 Jan 2025

https://github.com/soenneker/soenneker.libraries.whisper.ctranslate

Simply adds the Whisper_CTrantlate2 Windows executable, updated daily (if available)

ai csharp ctranslate ctranslate2 dotnet faster libraries library whisper whisperctranslate

Last synced: 29 Dec 2024

https://github.com/theaussiepom/wyoming-openai

OpenAI SST and TTS support for the Wyoming protocol

home-assistant home-assistant-assist openai sst tts whisper wyoming

Last synced: 21 Dec 2024

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 18 Dec 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/pjarbas/azure-ai

Examples using Azure AI services (DALLE3, Text to Speech, Whisper)

azure-openai dalle-3 image-generation-ai speech-synthesis text-to-speech whisper

Last synced: 21 Jan 2025

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/ty-martz/audiologic

Python Module to process and predict on music attributes

machine-learning music python whisper

Last synced: 24 Oct 2024

https://github.com/huuquyet/phowhisper-small

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024