Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/xi-rick/captains-log

Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.

audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper

Last synced: 21 Nov 2024

https://github.com/mdbecker/whisper_cpp_macos_utils

Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.

audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp

Last synced: 01 Dec 2024

https://github.com/yjg30737/pyqt-simple-whisper-gui

Whisper text-to-speech, speech-to-text example in PyQt5 GUI

openai pyqt pyqt-ai pyqt5 pyqt5-desktop-application pyqt5-examples pyqt5-gui whisper

Last synced: 03 Jan 2025

https://github.com/kolger/forty-two-transcribe

A Telegram bot that transcribes videos and audio messages to text via OpenAI Whisper API

openai self-hosted telegram whisper

Last synced: 25 Nov 2024

https://github.com/tobybenjaminclark/intermew

👨‍💻 Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.

computer-vision openai-api whisper

Last synced: 25 Nov 2024

https://github.com/teemow/mnote

Generates meeting notes and summaries from video recordings

ai chatgpt google-meet kubeai kubernetes meeting-minutes transcription video-transcription whisper

Last synced: 07 Dec 2024

https://github.com/armaggheddon/whisper2me

whisper2me is a telegram bot written with pyTelegramBotAPI that uses OpenAI's whisper to perform speech2text so you no longer have listen to voice messages 🤫🔇

docker openia pytelegrambotapi python whisper

Last synced: 25 Nov 2024

https://github.com/heng30/vtbox

It is an offline voice to text tool. Using whisper model to transcribe.

rust slint-ui voice2text whisper

Last synced: 21 Nov 2024

https://github.com/vinayaktalukder17/Youtube-Transcribe-tool-

YouTube Transcribe tool that uses Whisper tech made by OPENAI

chatgpt chatgpt3 gradio openai python whisper youtube

Last synced: 24 Oct 2024

https://github.com/iamarunbrahma/smart-voice-assistant

A simple voice assistant to get your queries in speech format and generate answers using ChatGPT API in both text and audio format.

chatgpt tts whisper

Last synced: 07 Dec 2024

https://github.com/MattCode64/Scriba_Front

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

speech-to-text vite vue vuejs whisper

Last synced: 24 Oct 2024

https://github.com/cnseniorious000/dl-a2t

download, audio-to-text PyPI: https://pypi.org/p/dl-a2t

audio transcription whisper youtube

Last synced: 02 Jan 2025

https://github.com/zahidhasann88/video-summarizer

A videos by extracting audio and generating summaries based on the audio content.

nodejs openai typescript whisper

Last synced: 07 Jan 2025

https://github.com/notyusheng/transcribe-translate_kubernetes

Local web app for transcription and translation services for audio and video using Whisper models

docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper

Last synced: 22 Nov 2024

https://github.com/pratikpakhale/terravis

Voice guided GIS system

genai gis lam llm voice whisper

Last synced: 09 Oct 2024

https://github.com/webmural/rewind

rewind mural

mural whisper wind

Last synced: 01 Dec 2024

https://github.com/deepbiolab/customer-complaint-classification

An GenAI-powered pipeline leveraging Whisper, DALL-E, and GPT to transform customer complaints into actionable insights with automated transcription, visualization, and classification.

azure dalle gpt whisper

Last synced: 23 Nov 2024

https://github.com/mrbuslov/reminder_4u_bot

AI Telegram Bot Reminder. You send a free-form text OR voice reminder, the AI bot records it and reminds you at the right time!

ai ai-bot aiogram chatgpt django gpt-3 gpt-4 gpt-models python reminder telegram-bot voice-recognition whisper

Last synced: 12 Nov 2024

https://github.com/ty-martz/audiologic

Python Module to process and predict on music attributes

machine-learning music python whisper

Last synced: 24 Oct 2024

https://github.com/willdphan/little-jarvis-whisper

Jarvis, a GPT Voice Assistant made with speech recognition, OpenAI's Whisper, and Gradio

gradio openai voice-assistant voice-recognition whisper

Last synced: 24 Oct 2024

https://github.com/firefly55lm/bisbigliatorev2

Automatic audio transcriber notebook based on Whisper

colab-notebook speech-to-text whisper

Last synced: 25 Nov 2024

https://github.com/userpjm/whisper-youtube

Generate a SubRip subtitle file (srt) using Whisper for the audio of a YouTube video.

faster-whisper openai speech-to-text whisper

Last synced: 24 Oct 2024

https://gitlab.com/ifrz/asr-multi-lite

Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition

distilhubert wav2vec2 whisper

Last synced: 24 Oct 2024

https://github.com/luizcalaca/transcricao-medica

Full Stack + Whisper Transcription + Node.js REST API + VITE + React.js + Railway deploy

full-stack nodejs openai openai-api railway reactjs sequelize sequelize-orm vite whisper whisper-ai

Last synced: 25 Nov 2024

https://github.com/tylim88/Voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 24 Oct 2024

https://github.com/evil0ctal/whisper-speech-to-text-api

An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.

ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api

Last synced: 25 Oct 2024

https://github.com/rudrodip/kittyscribe

microservice for transcribing audio/video files to text and transcoding video

docker ffmpeg python whisper

Last synced: 01 Dec 2024

https://github.com/dheison0/subcreator

A subtitle creator, translator and embeder tool made using AI

ai machine-learning ml python subtitles video-processing whisper

Last synced: 09 Oct 2024

https://github.com/simongino/whisper-fastapi

A FastAPI-based application integrating Whisper for efficient speech recognition and processing.

ai docker fastapi python whisper

Last synced: 09 Oct 2024

https://github.com/vlazic/json-verbose-to-vtt-converter

Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.

converter groq json json-verbose openai vtt webvtt whisper

Last synced: 26 Nov 2024

https://github.com/jfgonsalves/scribe

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 26 Nov 2024

https://github.com/josemarcosrf/Lexicap-QA

QA retrieval for Lex Fridman's podcast transcriptions

lexicap qa search whisper

Last synced: 24 Oct 2024

https://github.com/ivanrj7j/transcription

This project transcribes audio using whisper and provides an api

ai api flask transcription whisper

Last synced: 09 Oct 2024

https://github.com/deshwalmahesh/whisper-fastapi-realtime

It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription

audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large

Last synced: 25 Oct 2024

https://github.com/valkryst/whisper_automations

Various scripts for automating tasks using OpenAI's Whisper.

automation openai subtitle subtitle-generator transcription translation whisper

Last synced: 26 Dec 2024

https://github.com/ifeech/subtitler

Creating subtitles from video

subtitles whisper

Last synced: 09 Oct 2024

https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper

Whisper Automatic Speech Recognition (ASR) Model

openai openai-api transcription webapp whisper

Last synced: 22 Dec 2024

https://github.com/microsoft/azure-ai-foundry-whatsapp-bot

WhatsApp Bot built with Azure Functions and Azure AI Foundry, using Python.

azure-ai-foundry azure-functions azure-openai python whatsapp-api whatsapp-bot whisper

Last synced: 27 Nov 2024

https://github.com/nelzomal/videolens_ai

VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience

ai chrome-ai gemini-nano transformers whisper wxt

Last synced: 02 Dec 2024

https://github.com/chloelavrat/speech-to-text-app

Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.

audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/tomdewildt/whisper-experiment

Experiments using the Whisper model from Open AI

colab jupyter python transcribe transformers translate whisper

Last synced: 27 Dec 2024

https://github.com/xawos/owt

🦙🗣️ Ollama and Whisper Telegram bot, with advanced configuration

ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper

Last synced: 08 Jan 2025

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/ashot72/answering-questions-about-images

You can upload images, ask questions about images using voice prompts, then listen to the responses in voice

answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper

Last synced: 30 Dec 2024

https://github.com/msrsaditya/speech2speech

A Personal Digital Assistant designed to help you with quick responses.

ollama openai phi3 sox tts whisper

Last synced: 28 Nov 2024

https://github.com/pawelzeja098/whisper-video-transcription

Testing whisper Open-AI to transcribe videos

mp4 transcription whisper whisper-ai

Last synced: 28 Nov 2024

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/concaption/containerized-transcription-api

Containerized Transcription API using Whisper Model and FastAPI

docker fastapi openai transcription whisper

Last synced: 16 Dec 2024

https://github.com/zdwolfe/transcription-tools

Docker video transcriber, wrapper around OpenAI

openai transcription whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/breadrock1/audio-to-text

There is simple backend project to use whisper-rs.

actix-web audio-to-text rust swagger-ui whisper

Last synced: 11 Nov 2024

https://github.com/devgeekm/chat-it-up

Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.

ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl

Last synced: 20 Dec 2024

https://github.com/seanvelasco/ai

Cloudflare AI challenge submission: Slater - your virtual foreign language friend

ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper

Last synced: 09 Dec 2024

https://github.com/aixerum/faster-whisper

faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

ctranslate2 gpu transcription whisper

Last synced: 07 Jan 2025

https://github.com/velocitatem/dontlectureme

A program that pays attention to your lectures for you.

ai lectures university whisper

Last synced: 03 Dec 2024

https://github.com/egorsmkv/optimized-whisper-intel

Run quantized Whisper models only on CPU with Intel hardware

intel onnx onnxruntime quantized-neural-networks whisper

Last synced: 19 Dec 2024

https://github.com/jalvarezz13/summarai

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.

llama-index llama2 pymovie whisper

Last synced: 22 Dec 2024

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 19 Dec 2024

https://github.com/huuquyet/phowhisper-small

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024

https://github.com/zuplyx/subtitle-creator

Add english subtitles to videos using openai/whisper-large-v3

open-ai poetry-python python3 subtitles-generator whisper

Last synced: 09 Dec 2024

https://github.com/danibcorr/university-helper

🧑‍🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.

deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper

Last synced: 19 Dec 2024

https://github.com/brunogaliati/speech2text-investments

This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.

chatgpt investment openai politics python speech-recognition speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/barrylee111/voicechat-llm

A chatbot with both prompt and voicechat capabilities leveraging LangChain, Elasticsearch, and FastAPI. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.

elasticsearch fastapi langchain largelanguagemodel python react speech-to-text tailwind text-to-speech typescript websocket whisper

Last synced: 19 Dec 2024

https://github.com/thealphamerc/audio-to-text

Transcribe multi-lingual audio clips using whisper model

openai whisper

Last synced: 16 Dec 2024

https://github.com/ajxv/rtstt

Real time speech to text transcription using OpenAi whisper

live-transcription openai openai-whisper python3 transcription whisper

Last synced: 22 Dec 2024

https://github.com/bluebirdback/groq-subtitles

Batch video subtitle generation using Groq Whisper API

groq speech-to-text subtitles video whisper

Last synced: 21 Dec 2024

https://github.com/lifeosm/whisper

🐳 Docker image with OpenAI Whisper.

docker octolab speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/arkaniightt/web_app_transcriptor_openai

Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.

ai app openai python streamlit tool tools transcript transcription webapp whisper

Last synced: 12 Dec 2024

https://github.com/khushijtrivedi/speech

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.

ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper

Last synced: 09 Oct 2024

https://github.com/evilfreelancer/whisper-tests

Collection of experiments on OpenAI Whisper models

api-server docker-compose testing transcription whisper

Last synced: 17 Dec 2024

https://github.com/obay-ismaeel/post-generator

An API that generates social media posts by implementing RAG with Llama-3

ai api fastapi llama llm python retrieval-augmented-generation social-media whisper

Last synced: 12 Oct 2024

https://github.com/flo-bit/youtube-speaker-separation

simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate

speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube

Last synced: 19 Dec 2024

https://github.com/Franky1/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

openai python streamlit transcription whisper

Last synced: 24 Oct 2024

https://github.com/s-emanuilov/whispercpp_kit

A wrapper on whisper.cpp with additional helper features like model management capabilities.

asr whisper

Last synced: 13 Dec 2024

https://github.com/crucials/twaddle

speech analysis app that collects statistics like words frequencies and transcribed text

ai audio python python-eel speech-to-text vue whisper

Last synced: 24 Oct 2024

https://github.com/theaussiepom/wyoming-openai

OpenAI SST and TTS support for the Wyoming protocol

home-assistant home-assistant-assist openai sst tts whisper wyoming

Last synced: 21 Dec 2024

https://github.com/jgw96/speech-to-text-web-toolkit

Making Speech-To-Text on the web easy, both local and in the cloud

ai lit transformersjs webcomponents whisper

Last synced: 06 Dec 2024

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 18 Dec 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/saamerm/whisperkit-ios15

iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon

ios ios15 swiftui whisper whisper-ai

Last synced: 26 Sep 2024

https://github.com/arkapravo-ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

ai fastapi huggingface machine-learning openai python3 speech-to-text transformers whisper

Last synced: 21 Dec 2024

https://github.com/nicknaskida/cog-whisper-diarization

Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx

Last synced: 27 Sep 2024

https://github.com/miosipof/whisper_inference

OpenAI Whisper ASR inference on CPU with OpenVino, PyTorch or Huggingface

asr inference machine-learning openvino pytorch whisper

Last synced: 07 Jan 2025

https://github.com/bloodworks-io/phlox

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 18 Dec 2024

https://github.com/malexandersalazar/casey

Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI

agentic-ai content-creation groq large-language-models python wellbeing whisper

Last synced: 18 Dec 2024

https://github.com/same-ou/whisper-speech-recognition

This repository contains a deployment of the Whisper speech recognition model using Flask and Python. Whisper is a cutting-edge speech recognition model designed to accurately transcribe speech input into text.

deep-learning flask machine-learning openai python pytorch whisper

Last synced: 01 Jan 2025