Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/thewh1teagle/pyannote-rs

pyannote audio diarization in rust

asr diarization onnxruntime rust speech-recognition whisper

Last synced: 09 Oct 2024

https://github.com/thinh-vu/ur_audio_sub

Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.

audio-to-text audio-transcription caption-generator speech-recognition whisper

Last synced: 07 Nov 2024

https://github.com/gha3mi/foropenai

ForOpenAI - A Fortran library for OpenAI API.

api chatgpt dall-e fortran fortran-package-manager gpt openai openai-api whisper

Last synced: 12 Dec 2024

https://github.com/CoHuK/gpt-telegram-bot

GPT/Whisper/DALL-E Telegram Bot with easy deployment using Chalice to AWS Lambda

aws-lambda dall-e gpt gpt-35-turbo telegram telegram-bot whisper

Last synced: 24 Oct 2024

https://github.com/neka-nat/mylangrobot

Language instructions to mycobot using GPT-4V

chatgpt gpt-4-vision gpt-4-vision-preview gpt4v mycobot segment-anything whisper

Last synced: 14 Oct 2024

https://github.com/sloganking/desk-talk

A desktop transcription software

desktop dictation transcription whisper

Last synced: 13 Nov 2024

https://github.com/santima10/resumico

🤖 A WhatsApp bot to transcribe and summarize audio messages.

google-cloud-platform gpt-3 openai speech-to-text whatsapp-api whatsapp-bot whisper

Last synced: 07 Nov 2024

https://github.com/XMuli/ThinkyMatePages

Simple and easy to use desktop application for ChatGPT & AI, will supporting Window, MacOS, Linux platforms. | 洁且易用的 ChatGPT/星火大模型 & AI 的跨平台客户端

chatgpt cross-platform linux macos openai qt whisper

Last synced: 08 Nov 2024

https://github.com/mazzz1y/matrix-gpt

Chat with ChatGPT directly in your Matrix client

chatgpt dall-e matrix openai synapse whisper

Last synced: 19 Oct 2024

https://github.com/xuegao-tzx/whisper_flutter_new

A flutter library for offline speech-to-text conversion which use whisper.cpp models implementation for Android、iOS、macOS.

android flutter ios whisper whisper-cpp

Last synced: 09 Oct 2024

https://github.com/pulijon/sttcast

Transcription from mp3 files to html with or without embedded player

ansible automation aws-ec2 aws-s3 g4dn gpu ia iac puppet python terraform transcription vagrant vosk-engine whisper

Last synced: 14 Oct 2024

https://github.com/AppleHolic/chatgpt-streamlit

Simple demo project with OpenAI's API and TTS

chatgpt openai streamlit tts whisper

Last synced: 24 Oct 2024

https://github.com/m0rf30/shisper

A quick & dirty script to generate and view subtitles and transcriptions for your multimedia files using ggerganov/whisper.cpp

asr bash shisper whisper whispercpp

Last synced: 06 Dec 2024

https://github.com/machinelearningzh/audio-transcription

Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.

audio-transcription machine-learning whisper

Last synced: 02 Nov 2024

https://github.com/ieasybooks/almufarrigh

الواجهة الرسومية الخاصة بأداة تفريغ على أنظمة التشغيل المختلفة

ai audio-processing desktop linux macos python qt subtitles video-processing whisper windows wit

Last synced: 08 Nov 2024

https://github.com/xmuli/thinkymatepages

Simple and easy to use desktop application for ChatGPT & AI, will supporting Window, MacOS, Linux platforms. | 洁且易用的 ChatGPT/星火大模型 & AI 的跨平台客户端

chatgpt cross-platform linux macos openai qt whisper

Last synced: 25 Nov 2024

https://github.com/miclast/FreePBX-Call-intrusion

Intrusion. Custom Asterisk dial plan for listen, whisper and barge in calls. For Asterisk FreePBX, Issabel, Asterisk based Elastix call centers.

asterisk barge call callcenter intrusion monitoring whisper

Last synced: 24 Oct 2024

https://github.com/ssciwr/vink

A stand-alone application with GUI for OpenAI's Whisper

gui hacktoberfest iwr-hacktoberfest openai pyinstaller speech-to-text transcription whisper whisper-ai

Last synced: 09 Nov 2024

https://github.com/eryk-mazus/sigh

Seamless Voice Interactions with LLMs

llm speech-recognition speech-to-text voice-recognition whisper

Last synced: 24 Oct 2024

https://github.com/gumblex/whisper_vad

Whisper.cpp Speech-to-text with Voice Acticity Detection

speech-to-text whisper whisper-cpp

Last synced: 06 Nov 2024

https://github.com/nicolodiamante/notefy

Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast and efficient summarising.

ai-powered apple-notes apple-shortcuts chatgpt chatgpt-api gpt-4 gpt-4-turbo gpt-4o gpt-4o-mini gpt35turbo notes openai openai-api openai-chatgpt openai-whisper siri summarization summary whisper whisper-ai

Last synced: 20 Nov 2024

https://github.com/jjwroeloffs/transcribe_align_textgrid

A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!

force-alignment praat speech-recognition speech-to-text textgrid whisper

Last synced: 01 Nov 2024

https://github.com/RoyNkem/SwiftUI-AI-Voice-Assistant

A multi-platform app for voice-based interactions built using SwiftUI with advanced AI capabilities.

gpt-4 ios macos mvvm openai-api swiftui text-to-speech visionos whisper

Last synced: 23 Oct 2024

https://github.com/SanHacks/AiGen

Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported)

chatbot gpt3-turbo openai speech-recognition speech-synthesis speech-to-text text-to-speech tts voice whisper

Last synced: 15 Nov 2024

https://github.com/lazauk/aoai-whisper-gradio

Demos of Whisper model's functionality in Gradio-powered minimalistic Web apps: offline, using Azure OpenAI and Azure AI Speech.

ai azure gradio openai whisper

Last synced: 13 Nov 2024

https://github.com/stayallive/whisper-subtitles

Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.

cog replicate whisper

Last synced: 24 Oct 2024

https://github.com/mribeirodantas/nf-whisper

Proof-of-concept Nextflow pipeline to interact with OpenAI Whisper

docker nextflow pipeline speech-to-text transcription whisper

Last synced: 15 Oct 2024

https://github.com/t0mer/wassist

Wassist allows you to contact GPT3 directly from WhatsApp and not only that. Wassist also allows you to save your own personal data and later search and retrieve it using GPT3 to generate a response. In the examples folder, you can see several examples of how to use this bot so you don't have to remember anything ever again.

dall-e docker personal-assistant python weather whatsapp whisper

Last synced: 15 Oct 2024

https://github.com/YvesCheung/Whisper

一套用于代码检阅的注解

android annotation inspect lint whisper

Last synced: 24 Oct 2024

https://github.com/abus-aikorea/studio-free

youtube download, vocal remover, vocal extraction, karaoke video production, STT, automatic speech recognition, transcription, automatic subtitle, AI, yt-dlp, demucs, whisper, webui, gradio, windows

ai automatic-speech-recognition automatic-subtitle demucs gradio karaoke openai stt transcription video-download vocal-remover webui whisper windows yt-dlp

Last synced: 10 Nov 2024

https://github.com/decryptu/decryptgpt

A multifaceted ChatGPT Discord bot that harnesses discord.js, OpenAI's GPT-4o model, Whisper to understand voice messages, and Dall-E for image generation — engage in smart conversations, get voice messages transcribed, and have images analyzed directly within your Discord community.

chatgpt dall-e dalle discord discord-bot discord-js discordjs gpt gpt-3 gpt-4 nodejs openai whisper

Last synced: 12 Dec 2024

https://github.com/devanshu-17/transcriptiq

TranscriptIQ is a project that enables users to transcribe YouTube videos and perform various NLP (Natural Language Processing) tasks, chat with youtube video and many more on the transcribed text.

clarifai-python cohere streamlit whisper

Last synced: 22 Dec 2024

https://github.com/mj23978/openserver

Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also includes a chat functionality.

autogen g4f image-generation langchain litellm llamacpp llm llmops openai stable vector-database whisper

Last synced: 14 Dec 2024

https://github.com/redocrepus/ahk-whisper-paste

Allows dictating anywhere in Windows using AutoHotKey and OpenAI's Whisper speech-to-text engine.

dictation openai openai-api text-to-speech voice-typing whisper whisper-ai windows

Last synced: 24 Oct 2024

https://github.com/status-im/status-js-api

Status Javascript Client (WIP)

ethereum javascript shh status-im web3 web3js whisper

Last synced: 01 Nov 2024

https://github.com/sanhacks/aigen

Multi Model Personal Assistant Wrapper in Go: Interact with ChatGPT, Claude or Ollama Cross Platform (Speech & Image generation supported)

chatbot gpt3-turbo openai speech-recognition speech-synthesis speech-to-text text-to-speech tts voice whisper

Last synced: 09 Oct 2024

https://github.com/sepiropht/auto-subtitle

Automatic subtitles in your videos

ffmpeg openai subtitles subtitles-generator whisper

Last synced: 09 Oct 2024

https://github.com/cp3249/splaa

SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and has capabilities for extending functionalities through a modular tool system.

coqui-tts llm ollama whisper

Last synced: 03 Dec 2024

https://github.com/hay/audio2text

Python command line utility wrappers for Whispercpp and other speech-to-text utilities

speech-recognition speech-to-text stt whisper whisper-cpp

Last synced: 15 Oct 2024

https://github.com/bhattbhavesh91/whisper-youtube

This repository will guide you to create automatically generate YouTube Transcription using Using OpenAI's Whisper

automatic-speech-recognition ffmpeg openai openai-gym python pytube subtitles whisper youtube youtube-dl

Last synced: 16 Nov 2024

https://github.com/hisano/openai-whisper-on-docker

OpenAI Whisper on Docker

docker openai whisper

Last synced: 10 Nov 2024

https://github.com/luquedaniel/whisper2subs

A CLI tool that transcribes audio using openai-whisper and translates it using DeepL.

audio cli deepl subtitle transcribe translate video weekend-project whisper

Last synced: 11 Oct 2024

https://github.com/kurianbenoy/malayalam_asr_benchmarking

A study to benchmark whisper based ASRs in Malayalam

asr benchmarking speech transformers-library whisper

Last synced: 14 Oct 2024

https://github.com/jim60105/aichatassistant

Stream YouTube live to OpenAI, get AI-generated summaries and real-time reply options. (Chrome Extension)

chrome-extension openai typescript whisper youtube

Last synced: 23 Oct 2024

https://github.com/evilfreelancer/docker-whisper-server

whisper.cpp HTTP transcription server with OpenAI-like API in Docker

api api-server asr cuda docker docker-compose dockerfile nvidia openai openai-api whisper whisper-cpp

Last synced: 09 Oct 2024

https://github.com/voidful/whisper-live-asr-demo

run whisper on CPU/GPU server

asr livestream whisper

Last synced: 24 Oct 2024

https://github.com/byigitt/transcriptor

create transcripts with youtube links on google colab using whisper ai

python python3 transcript transcription whisper whisper-ai

Last synced: 22 Dec 2024

https://github.com/redocrepus/Whisper-Paste

Chrome extension that allows dictating anywhere using OpenAI Whisper

chrome-extension dictation openai openai-api text-to-speech voice-recognition voice-typing whisper whisper-ai

Last synced: 24 Oct 2024

https://github.com/nik-kras/live_asr_whisper_gradio

Real Time Speech To Text with corrections powered by Gradio

asr faster-whisper gradio speech-to-text whisper

Last synced: 23 Nov 2024

https://github.com/neka-nat/stenocaptioner

CLI tool for automatic subtitling using whisper.

python subtitles subtitles-generator whisper

Last synced: 14 Oct 2024

https://github.com/legendsort/openAISpeechToDatabase

AI automation to save formatted text with proper title from speech

automation chatgpt dropbox notion openai whisper zapier

Last synced: 22 Nov 2024

https://github.com/egorsmkv/optimized-whisper

Use quantized versions of Whisper to speed up inference

faster-whisper hqq quantization whisper

Last synced: 18 Oct 2024

https://github.com/qqxufo/whisper-nodejs

whisper-nodejs is an npm package for using OpenAI's Whisper API to transcribe and translate audio. With whisper-nodejs, you can easily convert audio files into text and translate them into English or other supported languages.

nodejs openai whisper whisper-nodejs

Last synced: 13 Nov 2024

https://github.com/hiradary/simplewhisper

A simple speech-to-text transcription interface using OpenAI's Whisper API.

openai speech-to-text whisper whisper-ai

Last synced: 09 Oct 2024

https://github.com/mbotsu/mlx_speech2text

Audio transcription using mlx whisper and vad silence processing

mlx silero-vad whisper

Last synced: 09 Oct 2024

https://github.com/oddlama/whisper-overlay

A wayland overlay providing speech-to-text functionality for any application via a global push-to-talk hotkey

faster-whisper hyprland realtime speech-recognition speech-to-text wayland whisper wlroots

Last synced: 09 Oct 2024

https://github.com/niawjunior/vision-speak

CameraVision: Capture, Analyze - Seamlessly integrate image analysis using GPT-4 Vision API and convert text to speech with Whisper AI

camera gpt-4-vision whisper

Last synced: 02 Dec 2024

https://github.com/SrinadhVura/OpenAI-Stack-Hack

Our Medifix is an AI powered assistant powered on gpt-3.5 turbo (chatGPT). Medifix is designed to help people by providing preventive measures based on the symptoms mentioned.

chatgpt gtts streamlit whisper

Last synced: 24 Oct 2024

https://github.com/hoangv97/ai-chatbot

Integrate ChatGPT, Dall-E, Whisper and other AI models in Replicate into Messenger and Telegram bot

bottender chatbot chatgpt dall-e2 messenger-bot replicate telegram-bot typescript whisper

Last synced: 24 Oct 2024

https://github.com/chetanxpro/autosub

Automatically generate and overlay subtitles for any video.

ai ffmpeg nodejs-whisper openai-whisper subtitles subtitles-generator whisper

Last synced: 15 Nov 2024

https://github.com/mharrvic/redhorse-ai-transcriber

Audio transcriber using Openai whisper ML deployed to Banana.dev

banana openai whisper

Last synced: 15 Nov 2024

https://github.com/openvoiceos/ovos-docker-stt

Open Voice OS Speech-to-Text (STT) container images and docker-compose.yml file for x86_64 CPU architecture.

fasterwhisper openvoiceos ovos speech-to-text stt whisper

Last synced: 19 Nov 2024

https://github.com/moebiussurfing/ofxsurfingtextsubtitle

Draws subtitles from an .SRT (or plain text) into a formatted styled paragraph with fading opacity and more.

openframeworks openframeworks-addon whisper whisper-cpp

Last synced: 27 Oct 2024

https://github.com/olololoe110399/mikasa_gpt

🚀 MiksaGPT, part of the 'Miksa' project, is a groundbreaking voice assistant utilizing Claude 3 and APIs from 'anthropic' and 'elevenlabs'. It enables real-time Opus two-way voice chat with seamless interruptibility, built with Flutter and available for free on GitHub.

aivoice artificialintelligence claude claudeai elevenlabs flutterai flutterprogramming flutterprojects openai opensource opensourceai opus speechtotext whisper

Last synced: 22 Dec 2024

https://github.com/gabrielrf/voice2text

Descrição automática de mensagens de voz em conversas privadas no Telegram

automation openai openai-whisper pyrogram telegram transcription whisper

Last synced: 13 Dec 2024

https://github.com/jxxe/murmur

A proof-of-concept transcription app

journalism mac macos transcribe transcription whisper

Last synced: 24 Oct 2024

https://github.com/princejoogie/chunktube

It's YouTube.. but text!

gpt-3 openai react typescript whisper

Last synced: 09 Nov 2024

https://github.com/ebowwa/llm_telecenter

A fastapi wrapper of babca / python-gsmmodem for a waveshare sim7600x. Not an exact copy of the 'python-gsmmodem' so be sure to uninstall that lib or venv to run | Open-source Twilio with LLM batteries

agentgpt deepgram elevenlabs elevenlabs-api gsm gsm-modem gsm-module langchain langchain-python llama2 llamacpp mistral-7b mistralai oai openai openai-api pyserial raspberry-pi salesgpt whisper

Last synced: 29 Nov 2024

https://github.com/BatuhanYilmaz26/Youtube-Transcriber

Input a YouTube video link and get a transcription as a .txt, .vtt or .srt file.

automatic-speech-recognition huggingface openai python speech-recognition streamlit whisper

Last synced: 24 Oct 2024

https://github.com/makaveli10/whisper-tflite

openai/whisper in TFLite

tflite whisper

Last synced: 27 Oct 2024

https://github.com/royceschultz/ComfyUI-TranscriptionTools

ComfyUI nodes for transcription on audio or video input.

comfyui comfyui-nodes openai-whisper transcription whisper

Last synced: 19 Dec 2024

https://github.com/rakshans1/ex-whisper

Elixir speech to text demo

bumblebee elixir nx whisper

Last synced: 27 Oct 2024

https://github.com/ignabelitzky/easy-subber

A Python-based tool that that takes video files and generates .srt subtitle files using Whisper for speech recognition, FFmpeg for audio processing, and a simple Tkinter GUI

ffmpeg gui python speech-recognition srt subtitles tkinter transcription video-processing whisper

Last synced: 22 Oct 2024

https://github.com/doctorpok42/subtitle

Create subtitles for your video and traduction in a few clicks

ai ffmpeg groq material-ui multer nextjs openai sass ts whisper

Last synced: 11 Nov 2024

https://github.com/easygithdev/phpopenai

PHPOpenAI is a wrapper to access OpenAI API

ai api curl dall-e dalle gpt-3 gpt-35-turbo openai php whisper

Last synced: 17 Nov 2024

https://github.com/lissettecarlr/AutomaticSpeechRecognition

语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库

ai asr audio audio-processing deepl paraformer python speech-to-text text whisper

Last synced: 24 Oct 2024

https://github.com/lissettecarlr/automaticspeechrecognition

语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库

ai asr audio audio-processing deepl paraformer python speech-to-text text whisper

Last synced: 19 Nov 2024

https://github.com/pablocerdeira/whatsapp-bot

This project is an advanced WhatsApp bot that leverages artificial intelligence for automated audio transcription, document summarization, and scheduling of future messages. It uses Whisper for transcription and offers a choice between OpenAI's API and the Ollama local model for document summarization.

api api-rest artificial-intelligence automation bot ollama openai whatsapp whatsapp-bot whisper

Last synced: 03 Dec 2024

https://github.com/gcoter/extract-keywords-from-youtube-videos

This project combines youtube-dl, whisper, LangChain and ChatGPT to extract keywords from YouTube videos. It was intented as a tool for Lyon Data Science to better reference its videos.

chatgpt langchain whisper youtube-dl

Last synced: 24 Oct 2024

https://github.com/cansik/speech-to-text-osc

Speech to text with OSC output.

osc speech-to-text whisper

Last synced: 13 Dec 2024

https://github.com/coderscreative/faster-whisper-rs

a rust crate for easily implementing faster-whisper stt into your rust programs.

ai faster-whisper rust speech-recognition speech-to-text stt whisper

Last synced: 09 Oct 2024

https://github.com/yjg30737/whisper_transcribe_youtube_video_example_gui

GUI Showcase of using Whisper to transcribe and analyze Youtube video

audio-to-text pyqt pyqt5 pyqt5-desktop-application python pytube qt whisper

Last synced: 06 Dec 2024

https://github.com/kristofferv98/voiceprocessingtoolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api

Last synced: 02 Nov 2024