Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/nerdimite/meetsy-app

Frontend for the Workshop on Building an End-to-End AI Meeting Assistant

gpt-3 nextjs sentence-transformers tailwindcss whisper

Last synced: 24 Oct 2024

https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives

AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.

bart gpt2 video-analysis whisper yolov8

Last synced: 08 Nov 2024

https://github.com/sumitesh9/localizedwhisper

A initiative to make OpenAI Whisper more localized by adding more languages.

albanian albanian-language huggingface openai speech speech-to-text whisper

Last synced: 08 Nov 2024

https://github.com/jojasadventure/whisper-client

Very simple Python based client for Whisper compatible endpoint

desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper

Last synced: 09 Oct 2024

https://github.com/yc-w-cn/s-wave

S-WAVE is a browser-based podcast reading app with AI transcription. User data is stored locally. MIT License.

podcast pouchdb typescript wasm whisper whisper-cpp

Last synced: 07 Nov 2024

https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo

This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.

ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper

Last synced: 09 Oct 2024

https://github.com/tranbavinhson/eth-decentralized-chat

Decentralized chat app by Ethereum Whisper protocol + Vuejs

ethereum vue vuejs whisper whisper-protocol

Last synced: 06 Nov 2024

https://github.com/alancunningham/chatgpt-assistant

A ChatGPT assistant with voice activation and image generation, connected to a Raspberry Pi display.

chatgpt chatgpt-api dall-e dall-e-api porcupine python raspberry-pi whisper

Last synced: 10 Nov 2024

https://github.com/chaoticbyte/audio-summarize

An audio summarizer (faster-whisper and BART glued together)

ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper

Last synced: 09 Oct 2024

https://github.com/adisol07/sharpspeech

SharpSpeech is free, local and open source way to speech and wake word recognition.

audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai

Last synced: 19 Dec 2024

https://github.com/rhysdg/whisper-onnx-python

A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph

ai chatbot cuda machine-learning onnxruntime speech-to-text whisper

Last synced: 09 Oct 2024

https://github.com/bigyaa/transcription-system

This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.

accessibility diarization speech-to-text storytelling-with-data transcription whisper

Last synced: 19 Dec 2024

https://github.com/roman01la/sub-deep

Transcribe and translate audio with AI

deepl transcribe translate whisper

Last synced: 08 Nov 2024

https://github.com/gamut73/quizinator

Generating quizzes, on Android, from YouTube videos.

kotlin-android llm python whisper

Last synced: 19 Dec 2024

https://github.com/slinusc/speaker_identification_evaluation

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

wav2vec2 whisper xls-r

Last synced: 09 Oct 2024

https://github.com/bbc-esq/whisper-solo-with-gui

OpenAI's Whisper program with a simple lightweight GUI.

pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper

Last synced: 12 Nov 2024

https://github.com/bbc-esq/batch-openai-whisper-ctranslate2

Batch process multiple files using the fasted ctranslate2 implementation of Open AI's Whisper

batch-processing batch-script openai openai-whisper pyside6 transcription translation whisper whisperx

Last synced: 12 Nov 2024

https://github.com/rokbenko/arctic-meet

ArcticMeet is an AI meeting assistant using Streamlit as a GUI and the Snowflake Arctic LLM via the Snowflake Cortex

ffmpeg pandas plotly python pytorch snowflake snowflake-arctic snowflake-cortex snowpark streamlit transformers whisper

Last synced: 12 Nov 2024

https://github.com/niqifan007/openai-tts-stt-streamlit

A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能

openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api

Last synced: 09 Oct 2024

https://github.com/schnoddelbotz/whisper-ui

Transcribe audio/video to text, locally on macOS, Linux and Windows. A simple whisper.cpp wrapper/UI built with Go/Fyne.

ffmpeg ffmpeg-wrapper fyne gui local privacy speech-to-text transcription whisper whisper-cpp

Last synced: 22 Dec 2024

https://github.com/dustland/talk

IELTS Talk Master

ielts nextjs15 openai tts whisper

Last synced: 14 Nov 2024

https://github.com/aeronjl/transcribe

Python package for accurate audio transcription with speaker diarisation

audio-transcription gpt speaker-diarization whisper

Last synced: 09 Oct 2024

https://github.com/silentsoft/whiscribe

🎬 A tool with a UI that transcribes audio files into subtitles using OpenAI's Whisper and runs completely on your local machine.

audio-transcription openai-whisper srt subtitle whisper

Last synced: 11 Nov 2024

https://github.com/mikeesto/subber

A small CLI tool for converting video & audio to a text transcription

audio cli ffmpeg golang transcribe video whisper

Last synced: 19 Dec 2024

https://github.com/etienneab3d/srt-sync

Synchronize SRT timestamps over an existing accurate transcription

aligner asr nlp subtitles text-to-speech whisper

Last synced: 19 Dec 2024

https://github.com/troyanovsky/llm_summarizer

Use LLM and Whisper to summarize long text and audio/video

gpt summarization whisper

Last synced: 13 Nov 2024

https://github.com/bhattbhavesh91/openai-whisper-benchmarking

Comparing the performance of OpenAI's Whisper model on a GPU vs OpenAI's API

gpu openai speech-to-text whisper

Last synced: 16 Nov 2024

https://github.com/fukuro-kun/wortweber

Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform für Spracherkennung in Deutsch und Englisch.

speech-to-text whisper

Last synced: 17 Nov 2024

https://github.com/valiantlynx/custom-whisper-api

This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.

ai docker-compose fastapi python whisper

Last synced: 22 Dec 2024

https://github.com/wtlow003/auto-subtitles

CLI tool to transcribe (+ translate) videos and embed subtitles automatically.

faster-whisper nllb subtitles subtitles-generator translation whisper whisper-cpp

Last synced: 15 Nov 2024

https://github.com/topdev0215/AudioMultifunctionChatbot

This app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4) generates transcriptions, summaries, fact checks, sentiment analysis, and text metrics. Users can also intelligently chat about their transcriptions with a GPT4 chatbot. Data is stored relationally in SQLite and also vectorized in Pinecone.

gpt4 langcha nltk openai python3 sqlite3 streamlit strean whisper

Last synced: 24 Oct 2024

https://github.com/ioriens/whisper-video

Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.

subtitle-generator video-to-audio video-to-text whisper

Last synced: 17 Nov 2024

https://github.com/h3yn3s/tl-dl

A selfhostable webapp which helps you read those uselessly long (by nature) voice messages with the power of AI.

sveltekit tailwind whisper

Last synced: 24 Oct 2024

https://github.com/lelserslasers/transcriberplus

Transcribe your files with ease!

flask python socket-io svelte trancribe whisper

Last synced: 25 Nov 2024

https://github.com/nerdimite/meetsy-backend

AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant

gpt-3 nextjs sentence-transformers tailwindcss whisper

Last synced: 24 Oct 2024

https://github.com/gangula-karthik/memo-mate

🚀 Discord meetings redefined with Memo Mate: Transcribe, summarize, and automate minutes seamlessly! ✨

discord-bot huggingface mistral py-cord speech-to-text transcribe whisper

Last synced: 22 Dec 2024

https://github.com/toLSC/tolsc-speech-to-text

Speech to text service for toLSC app implemented with OpenAI Whisper model

fastapi python speech-recognition speech-to-text tts whisper

Last synced: 24 Oct 2024

https://github.com/platput/pysubs

api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/extrange/transcription-benchmarks

Speech to text model benchmarks

transcription whisper

Last synced: 08 Dec 2024

https://github.com/shani-sinojiya/sandalquest

AI/ML project for recognizing colloquial Kannada speech and building a speech-based Q&A system focused on sandalwood cultivation.

ai audio-processing data-augmentation deep-learning machine-learning mongodb nlp python pytorch question-answering speech-based-question-answering-system speech-recognition whisper

Last synced: 02 Dec 2024

https://github.com/amanpriyanshu/medtranslate-360

MedTranslate 360 redefines medical documentation by providing an AI-powered assistant designed specifically for healthcare professionals.

ai gemini gemini-api hackathon llm llms medical ml privacy streamlit whisper

Last synced: 15 Dec 2024

https://github.com/marty1885/useful-whisper-server

Whisper server based on useful-transformers for the RK3588

npu rk3588 rockchip useful-transformers whisper

Last synced: 05 Dec 2024

https://github.com/volkansah/text-to-speech-pygui-for-whisper

This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.

openai openai-api python-gui-tkinter python3 whisper whisper-ai

Last synced: 28 Nov 2024

https://github.com/maawad/luna

Personal assistant

bot openai personal-assistant whisper

Last synced: 17 Dec 2024

https://github.com/huuquyet/phowhisper-next

Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js

nextjs onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 19 Dec 2024

https://github.com/adamelkholyy/whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles

asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube

Last synced: 10 Dec 2024

https://github.com/aspadax/subtitlegenerator

Automatically generate a subtitle for your video.

gpt machine-learning openai rust streamlit subtitles-generator whisper

Last synced: 09 Oct 2024

https://github.com/shtirmann/v2t

Telegram bot which automatically transcribes all voice and video messages to text.

ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper

Last synced: 09 Oct 2024

https://github.com/i4ds/whisper-finetune

This repository contains code for fine-tuning the Whisper speech-to-text model.

fine-tuning nlp speech-to-text whisper

Last synced: 09 Oct 2024

https://github.com/mikeesto/whispercpp-android

An Android app using whisper.cpp to do voice-to-text transcriptions

android kotlin speech-to-text whisper whisper-cpp

Last synced: 17 Dec 2024

https://github.com/brentwong-kiel1997/brents_ai_language_school

Use AI such as ChatGPT and Whisper to learn foreign languages from YouTube videos

ai chatgpt foreign-language openai openai-api whisper whisper-ai youtube

Last synced: 08 Nov 2024

https://github.com/winstxnhdw/capgen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.

asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper

Last synced: 23 Oct 2024

https://github.com/tylim88/voicefu

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/baristikir/voice-typing

Simple Desktop Application with Voice Typing features. Runs locally, transcribes locally and works fully offline with support for real-time transcribing. Powered by OpenAI Whisper ASR-models and whisper.cpp inference engine

electron whisper whisper-cpp

Last synced: 24 Dec 2024

https://github.com/marquesafonso/multilang-asr-captioner

A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.

automatic-speech-recognition captioning-videos faster-whisper whisper

Last synced: 24 Oct 2024

https://github.com/natanielf/lecsum

Automatically transcribe and summarize lecture recordings completely on-device using AI.

ollama ollama-python whisper whisper-ai

Last synced: 18 Dec 2024

https://github.com/Op27/meeting_minutes_generator

This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.

ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper

Last synced: 24 Oct 2024

https://github.com/antoniosbarotsis/telegram-transcriber

A Telegram bot for transcribing voice messages

telegram transcribe voice whisper

Last synced: 31 Oct 2024

https://github.com/TranBaVinhSon/eth-decentralized-chat

Decentralized chat app by Ethereum Whisper protocol + Vuejs

ethereum vue vuejs whisper whisper-protocol

Last synced: 24 Oct 2024

https://github.com/egorsmkv/star-adapt-uk

Fork of https://github.com/YUCHEN005/STAR-Adapt with some modifications for Ukrainian.

asr speech-recognition ukrainian whisper

Last synced: 19 Dec 2024

https://github.com/nri12/filter_voice

Dự án lọc và tắt tiếng video những từ khóa mong muốn

python tools whisper

Last synced: 19 Dec 2024

https://github.com/oov/aviutl_subtitler

AviUtl+拡張編集の環境で Whisper による文字起こしをするためのプラグイン

aviutl aviutl-plugin whisper

Last synced: 19 Dec 2024

https://github.com/canaxs/whisper-core

An application where users can make rumor-based news and earn money in return.

mysql panel spring spring-boot whisper

Last synced: 19 Dec 2024

https://github.com/status-im/infra-role-status-go

Ansible role for status-go

ansible-role infra waku whisper

Last synced: 09 Nov 2024

https://github.com/studiowebux/tommygotchi

whisper, piper, llama-gpt, python, fun .. so much fun !

llama-gpt piper python3 whisper whisper-ai

Last synced: 09 Nov 2024

https://github.com/tylim88/Voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 24 Oct 2024

https://gitlab.com/ifrz/asr-multi-lite

Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition

distilhubert wav2vec2 whisper

Last synced: 24 Oct 2024

https://github.com/Franky1/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

openai python streamlit transcription whisper

Last synced: 24 Oct 2024

https://github.com/userpjm/whisper-youtube

Generate a SubRip subtitle file (srt) using Whisper for the audio of a YouTube video.

faster-whisper openai speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/arslanex/whisperdemo

A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.

audio-processing openai openai-whisper python whisper

Last synced: 23 Nov 2024

https://github.com/darienmt/radio-listener

Speech Recognition applied to transcribe amateur radio traffic experiments

python3 radio-amateurs speach-to-text speech-recognition whisper

Last synced: 21 Nov 2024

https://github.com/mottla/speech-to-text

Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.

huggingface speech-recognition stt teams transcription translation whisper zoom

Last synced: 21 Nov 2024

https://github.com/xi-rick/captains-log

Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.

audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper

Last synced: 21 Nov 2024

https://github.com/mdbecker/whisper_cpp_macos_utils

Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.

audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp

Last synced: 01 Dec 2024

https://github.com/obay-ismaeel/post-generator

An API that generates social media posts by implementing RAG with Llama-3

ai api fastapi llama llm python retrieval-augmented-generation social-media whisper

Last synced: 12 Oct 2024

https://github.com/kolger/forty-two-transcribe

A Telegram bot that transcribes videos and audio messages to text via OpenAI Whisper API

openai self-hosted telegram whisper

Last synced: 25 Nov 2024

https://github.com/tobybenjaminclark/intermew

👨‍💻 Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.

computer-vision openai-api whisper

Last synced: 25 Nov 2024

https://github.com/teemow/mnote

Generates meeting notes and summaries from video recordings

ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper

Last synced: 07 Dec 2024

https://github.com/armaggheddon/whisper2me

whisper2me is a telegram bot written with pyTelegramBotAPI that uses OpenAI's whisper to perform speech2text so you no longer have listen to voice messages 🤫🔇

docker openia pytelegrambotapi python whisper

Last synced: 25 Nov 2024

https://github.com/heng30/vtbox

It is an offline voice to text tool. Using whisper model to transcribe.

rust slint-ui voice2text whisper

Last synced: 21 Nov 2024

https://github.com/willdphan/little-jarvis-whisper

Jarvis, a GPT Voice Assistant made with speech recognition, OpenAI's Whisper, and Gradio

gradio openai voice-assistant voice-recognition whisper

Last synced: 24 Oct 2024

https://github.com/flo-bit/youtube-speaker-separation

simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate

speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube

Last synced: 19 Dec 2024

https://github.com/iamarunbrahma/smart-voice-assistant

A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.

chatgpt tts whisper

Last synced: 07 Dec 2024

https://github.com/hydrol0x/retriever

A new aid for the visually impaired powered by AI

elevenlabs llm palm visual-impairment-aid whisper

Last synced: 14 Nov 2024