Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/m0rf30/shisper

A quick & dirty script to generate and view subtitles and transcriptions for your multimedia files using ggerganov/whisper.cpp

asr bash shisper whisper whispercpp

Last synced: 14 Oct 2024

https://github.com/luquedaniel/whisper2subs

A CLI tool that transcribes audio using openai-whisper and translates it using DeepL.

audio cli deepl subtitle transcribe translate video weekend-project whisper

Last synced: 11 Oct 2024

https://github.com/voidful/whisper-live-asr-demo

run whisper on CPU/GPU server

asr livestream whisper

Last synced: 24 Oct 2024

https://github.com/evilfreelancer/docker-whisper-server

whisper.cpp HTTP transcription server with OpenAI-like API in Docker

api api-server asr cuda docker docker-compose dockerfile nvidia openai openai-api whisper whisper-cpp

Last synced: 09 Oct 2024

https://github.com/redocrepus/Whisper-Paste

Chrome extension that allows dictating anywhere using OpenAI Whisper

chrome-extension dictation openai openai-api text-to-speech voice-recognition voice-typing whisper whisper-ai

Last synced: 24 Oct 2024

https://github.com/jim60105/aichatassistant

Stream YouTube live to OpenAI, get AI-generated summaries and real-time reply options. (Chrome Extension)

chrome-extension openai typescript whisper youtube

Last synced: 23 Oct 2024

https://github.com/kurianbenoy/malayalam_asr_benchmarking

A study to benchmark whisper based ASRs in Malayalam

asr benchmarking speech transformers-library whisper

Last synced: 14 Oct 2024

https://github.com/hisano/openai-whisper-on-docker

OpenAI Whisper on Docker

docker openai whisper

Last synced: 10 Nov 2024

https://github.com/mbotsu/mlx_speech2text

Audio transcription using mlx whisper and vad silence processing

mlx silero-vad whisper

Last synced: 09 Oct 2024

https://github.com/legendsort/openAISpeechToDatabase

AI automation to save formatted text with proper title from speech

automation chatgpt dropbox notion openai whisper zapier

Last synced: 05 Aug 2024

https://github.com/oddlama/whisper-overlay

A wayland overlay providing speech-to-text functionality for any application via a global push-to-talk hotkey

faster-whisper hyprland realtime speech-recognition speech-to-text wayland whisper wlroots

Last synced: 09 Oct 2024

https://github.com/hoangv97/ai-chatbot

Integrate ChatGPT, Dall-E, Whisper and other AI models in Replicate into Messenger and Telegram bot

bottender chatbot chatgpt dall-e2 messenger-bot replicate telegram-bot typescript whisper

Last synced: 24 Oct 2024

https://github.com/qqxufo/whisper-nodejs

whisper-nodejs is an npm package for using OpenAI's Whisper API to transcribe and translate audio. With whisper-nodejs, you can easily convert audio files into text and translate them into English or other supported languages.

nodejs openai whisper whisper-nodejs

Last synced: 13 Nov 2024

https://github.com/egorsmkv/optimized-whisper

Use quantized versions of Whisper to speed up inference

faster-whisper hqq quantization whisper

Last synced: 18 Oct 2024

https://github.com/mharrvic/redhorse-ai-transcriber

Audio transcriber using Openai whisper ML deployed to Banana.dev

banana openai whisper

Last synced: 07 Aug 2024

https://github.com/neka-nat/stenocaptioner

CLI tool for automatic subtitling using whisper.

python subtitles subtitles-generator whisper

Last synced: 14 Oct 2024

https://github.com/hiradary/simplewhisper

A simple speech-to-text transcription interface using OpenAI's Whisper API.

openai speech-to-text whisper whisper-ai

Last synced: 09 Oct 2024

https://github.com/SrinadhVura/OpenAI-Stack-Hack

Our Medifix is an AI powered assistant powered on gpt-3.5 turbo (chatGPT). Medifix is designed to help people by providing preventive measures based on the symptoms mentioned.

chatgpt gtts streamlit whisper

Last synced: 24 Oct 2024

https://github.com/makaveli10/whisper-tflite

openai/whisper in TFLite

tflite whisper

Last synced: 27 Oct 2024

https://github.com/jxxe/murmur

A proof-of-concept transcription app

journalism mac macos transcribe transcription whisper

Last synced: 24 Oct 2024

https://github.com/moebiussurfing/ofxsurfingtextsubtitle

Draws subtitles from an .SRT (or plain text) into a formatted styled paragraph with fading opacity and more.

openframeworks openframeworks-addon whisper whisper-cpp

Last synced: 27 Oct 2024

https://github.com/BatuhanYilmaz26/Youtube-Transcriber

Input a YouTube video link and get a transcription as a .txt, .vtt or .srt file.

automatic-speech-recognition huggingface openai python speech-recognition streamlit whisper

Last synced: 24 Oct 2024

https://github.com/gabrielrf/voice2text

Descrição automática de mensagens de voz em conversas privadas no Telegram

automation openai openai-whisper pyrogram telegram transcription whisper

Last synced: 11 Nov 2024

https://github.com/princejoogie/chunktube

It's YouTube.. but text!

gpt-3 openai react typescript whisper

Last synced: 09 Nov 2024

https://github.com/lissettecarlr/AutomaticSpeechRecognition

语音转文本的各类python封装实现(paraformer、whisper_online、whisper_offline、funasr),用于服务kuon仓库

ai asr audio audio-processing deepl paraformer python speech-to-text text whisper

Last synced: 24 Oct 2024

https://github.com/rakshans1/ex-whisper

Elixir speech to text demo

bumblebee elixir nx whisper

Last synced: 27 Oct 2024

https://github.com/coderscreative/faster-whisper-rs

a rust crate for easily implementing faster-whisper stt into your rust programs.

ai faster-whisper rust speech-recognition speech-to-text stt whisper

Last synced: 09 Oct 2024

https://github.com/cansik/speech-to-text-osc

Speech to text with OSC output.

osc speech-to-text whisper

Last synced: 23 Oct 2024

https://github.com/doctorpok42/subtitle

Create subtitles for your video and traduction in a few clicks

ai ffmpeg groq material-ui multer nextjs openai sass ts whisper

Last synced: 11 Nov 2024

https://github.com/ignabelitzky/easy-subber

A Python-based tool that that takes video files and generates .srt subtitle files using Whisper for speech recognition, FFmpeg for audio processing, and a simple Tkinter GUI

ffmpeg gui python speech-recognition srt subtitles tkinter transcription video-processing whisper

Last synced: 22 Oct 2024

https://github.com/gcoter/extract-keywords-from-youtube-videos

This project combines youtube-dl, whisper, LangChain and ChatGPT to extract keywords from YouTube videos. It was intented as a tool for Lyon Data Science to better reference its videos.

chatgpt langchain whisper youtube-dl

Last synced: 24 Oct 2024

https://github.com/lhr0909/live-subtitles-rokid-ar

通过Rokid AR眼镜和OpenAI Whisper实现现实生活中的字幕

augmented-reality openai real-time rokid subtitles whisper

Last synced: 11 Oct 2024

https://github.com/0x20f/listen-wise

Save the last 30 seconds of audio to text using ai. Send that text to a notion page, readwise, obsidian, or just save it locally in a text file.

ai notion openai speech-to-text transcription whisper

Last synced: 30 Oct 2024

https://github.com/erkara/Rise-of-Transfer-Learning

you will find brief code implementations of some of the latest developments in AI, including Stable Diffusion, Whisper, YOLO and HuggigFace Transformers

gpt-3 huggingface openai stable-diffusion transfer-learning whisper yolov5

Last synced: 24 Oct 2024

https://github.com/ognisty321/whisper-transcription-ui

Whisper Transcription UI is a user-friendly graphical interface for whisper-standalone-win. Transcribe and translate audio/video files effortlessly with customizable settings and saved preferences.

gui python transcription ui whisper whisper-standalone-win

Last synced: 09 Oct 2024

https://github.com/m0wer/aibot

Telegram bot powered by Ollama, capable of handling text and voice messages, with configurable language models and system prompts.

ai assistant llama3 ollama telegram telegram-bot tts whisper

Last synced: 10 Oct 2024

https://github.com/umerarif01/ai-translator

AI Translator: Fast and Accurate Translations with Next.js and OpenAI's Whisper and GPT-3 APIs

gpt-3 nextjs openai whisper

Last synced: 24 Oct 2024

https://github.com/kristofferv98/voiceprocessingtoolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api

Last synced: 02 Nov 2024

https://github.com/tristan-mcinnis/realtime-whisper-console-transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.

asr console python real-time speech-recognition speech-to-text terminal transcription whisper

Last synced: 12 Oct 2024

https://github.com/prathamesh-mandavkar/AutoTalker

The project focuses on leveraging technology to create new courses, personalize existing ones, and enhance the assessment process, ultimately contributing to the development of 21st-century skills in students.

ai bark gdsc gdsc-dypsn gemini-api gemini-pro gen-ai ngo python solution-challenge-2024 stt subtitles tts video-creation whisper

Last synced: 24 Oct 2024

https://github.com/yjg30737/whisper_transcribe_youtube_video_example_gui

GUI Showcase of using Whisper to transcribe and analyze Youtube video

audio-to-text pyqt pyqt5 pyqt5-desktop-application python pytube qt whisper

Last synced: 07 Nov 2024

https://github.com/botisan-ai/whisper-aws-stack

Deplay Whisper on AWS Scalably

aws cdk ecs fargate fastapi openai silero-vad whisper

Last synced: 24 Oct 2024

https://github.com/robbinhan/whisper-test

以太坊whisper v6 demo

ethereum whisper

Last synced: 24 Oct 2024

https://github.com/achraf-oujjir/profgpt-smart-vr-professor

👨‍🏫🤖 ProfGPT: AI-powered VR professor with electrical circuits lab table ⚡💡 Built with Unity 🎮 GPT and Whisper APIs 🧠 and AWS Polly 🦜🗣️

ai-education aws-polly chatgpt-api csharp education oculus-quest-2 openai-api openai-whisper speech-to-text text-to-speech unity3d virtual-reality vr whisper

Last synced: 03 Nov 2024

https://github.com/t0mer/telessist

Telessist allows you to contact GPT3 directly from WhatsApp and not only that. Telessist also allows you to save your own personal data and later search and retrieve it using GPT3 to generate a response. In the examples folder, you can see several examples of how to use this bot so you don't have to remember anything ever again.

assistant chatgpt dall-e docker openapi python3 telegram telegram-bot weather whisper

Last synced: 15 Oct 2024

https://github.com/phineas-pta/fine-tune-whisper-vi

jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2

aws docker fine-tuning lora multi-gpu-training speech-recognition speech-to-text vietnamese whisper

Last synced: 14 Oct 2024

https://github.com/abus-aikorea/voice-pro

The best gradio web-ui for ai transcription, translation and TTS. Automatic subtitle creation using faster whisper. Easy one click installation. Fully portable.

asr faster-whisper translation tts whisper

Last synced: 09 Oct 2024

https://github.com/egorsmkv/whisper-ukrainian

Trainer and Evaluation scripts for fine-tuning Whisper models for the Ukrainian language

asr automatic-speech-recognition openai speech-recognition ukrainian whisper

Last synced: 18 Oct 2024

https://github.com/nexuslux/realtime-whisper-console-transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.

asr console python real-time speech-recognition speech-to-text terminal transcription whisper

Last synced: 09 Oct 2024

https://github.com/adt109119/whisper-json-to-srt-converter

這是一個為了用來將 Groq 的 Whisper API 回傳的 JSON,轉換為 SRT 字幕而製作的簡單的專案。

ai converter gradio groq whisper

Last synced: 09 Oct 2024

https://github.com/weihanchen/google-colab-python-learn

📚 Learn Google Colab、Python、ML、OpenAI、Whisper、spaCy、NLP、HuggingFace

colab-notebook huggingface matplotlib natural-language-processing nlp openai pandas python spacy whisper

Last synced: 11 Nov 2024

https://github.com/fabio-garavini/ha-groq-whisper-stt-api

HACS custom integration for using GroqCloud speech-to-text (Whisper) API in the Assist pipeline, reducing the workload on the Home Assistant server.

groq-api home-assistant stt whisper

Last synced: 29 Sep 2024

https://github.com/jech/galene-stt

Speech-to-text support for Galene

galene stt videoconference webrtc whisper whisper-cpp

Last synced: 09 Oct 2024

https://github.com/danomation/Voice-Website

Talk back and forth to GPT over browser. Customize to have your own interactive voice assistant!

elevenlabs gpt stt tts whisper

Last synced: 24 Oct 2024

https://github.com/ribartra/call-listener_bot

A bot that downloads, transcribes and analyzes calls to find insights for sales advisors.

api audio-analyser call-bot call-listener drive gcp openai python whisper

Last synced: 09 Oct 2024

https://github.com/gustavz/audio-to-text

streamlit app to transcript audio to text using openai's whisper library

audio-to-text streamlit whisper

Last synced: 24 Oct 2024

https://github.com/CrabAss/dCollab

Decentralized e-Learning Collaboration Platform as a Capstone Project (COMP4913, PolyU)

comp4913 dapp ethereum javascript react whisper

Last synced: 24 Oct 2024

https://github.com/patbqc/thoughtforgeai

Forge your thoughts through an AI powered brainstorming session !

ai anthropic brainstorm brainstorming brainstorms mobile openai reactnative whisper

Last synced: 31 Oct 2024

https://github.com/manucabral/quick-subtitles

An easy way to generate SRT subtitles from a video in Windows.

audio-to-text srt srt-subtitles subtitles subtitles-generator transcription whisper whisper-ai windows

Last synced: 03 Nov 2024

https://github.com/awaisoem/interview-lingo

(Aug 2024) AI assistant which help with interviews, hiring, personality development and communication skills

ai ai71 drizzle-orm falcon neondb nextjs postgresql tailwindcss whisper

Last synced: 09 Oct 2024

https://github.com/sandy1990418/chinesetaiwanesewhisper

This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, model training, and evaluation, tailored to improve speech recognition accuracy for these languages.

asr chinese gradio realtime speech-to-text streaming-audio taiwanese whisper

Last synced: 09 Oct 2024

https://github.com/ckaznable/yt-cli-live

Youtube Text Live Streaming in CLI

asr cli rust silero-vad whisper whisper-cpp youtube

Last synced: 12 Nov 2024

https://github.com/chriamue/whisper-example

Docker compose environment and example for whisper.

docker-compose geth p2p-network shh web3js whisper

Last synced: 24 Oct 2024

https://github.com/milkyskies/line-chatgpt

A LINE ChatGPT bot with text and AI audio generation / transcription.

chatgpt go golang surrealdb whisper

Last synced: 09 Oct 2024

https://github.com/my-north-ai/semantic_audio_filtering

Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.

automatic-speech-recognition fine-tuning synthetic-dataset-generation text-to-speech whisper

Last synced: 24 Oct 2024

https://github.com/olololoe110399/mikasa_gpt

🚀 MiksaGPT, part of the 'Miksa' project, is a groundbreaking voice assistant utilizing Claude 3 and APIs from 'anthropic' and 'elevenlabs'. It enables real-time Opus two-way voice chat with seamless interruptibility, built with Flutter and available for free on GitHub.

aivoice artificialintelligence claude claudeai elevenlabs flutterai flutterprogramming flutterprojects openai opensource opensourceai opus speechtotext whisper

Last synced: 05 Nov 2024

https://github.com/TheGuysBrushes/Whisper

Secured chat application

android chat socket whisper

Last synced: 24 Oct 2024

https://github.com/daisyyedda/whisper-large-v2-atcosim_corpus

A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.

asr-model nlp whisper whisper-ai

Last synced: 09 Oct 2024

https://github.com/williamwa/mssmith

A Telegram bot that utilizes the ChatGPT API and can communicate through voice.

chatpgt-api telegram-bot tts whisper

Last synced: 08 Nov 2024

https://github.com/otonomee/mic2transcript

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.

audio cli cli-tool openai speech speech-transcription transcription whisper

Last synced: 09 Oct 2024

https://github.com/t-h-chung/note-taker

Note-taking app for online/local video/audio using Whisper transcription, ChatGPT, and Notion

chatgpt notes notion transcription whisper youtube

Last synced: 09 Oct 2024

https://github.com/JoSuru/speeka

Speeaka is an open-source project that uses the Whisper model of OpenAI to transcribe audio into text. Its intuitive web interface makes it easy to use. Contributions are welcome.

open-source python python3 speech-to-text streamlit whisper

Last synced: 24 Oct 2024

https://github.com/alessioborgi/stylealigned_multireference-multimodal

Novel framework for Zero-Shot Style Alignment in Text-to-Image generation, incorporating Multi-Modal Context-Awareness and Multi-Reference Style Alignment, using minimal attention sharing, ensuring consistent style transfer without fine-tuning.

adain blip clap context-awareness multi-modal multi-style-transfer no-fine-tuning shared-attention-heads style-aligned text-to-image-generation whisper zero-shot-learning

Last synced: 18 Oct 2024

https://github.com/water25234/ChatREP

Summary on Youtube By ChatGPT & whisper

chatgpt-api openai python python3 video whisper youtube

Last synced: 24 Oct 2024

https://github.com/zaneh/heybilly

🗣️ It's like Alexa, but for your computer. Highly modular, real-time voice assistant. Built using self-assembling graphs.

contributions-welcome graph python3 rabbitmq self-hosted tts voice-assistant whisper

Last synced: 19 Oct 2024

https://github.com/bharathajjarapu/voicecipher

Local Speech transcription

transformerjs whisper

Last synced: 09 Oct 2024

https://github.com/i4ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.

fine-tuning nlp speech-to-text whisper

Last synced: 09 Nov 2024

https://github.com/limdongjin/ignkafasr

Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)

apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper

Last synced: 24 Oct 2024

https://github.com/marty1885/useful-whisper-server

Whisper server based on useful-transformers for the RK3588

npu rk3588 rockchip useful-transformers whisper

Last synced: 15 Oct 2024

https://github.com/paulocoutinhox/py-transcriptor-ai

PyTranscriptorAi - Transcript videos to text with Ai and add subtitles - OpenAi

ai openai subtitles transcript video whisper

Last synced: 09 Nov 2024

https://github.com/seitzquest/RavenWhisperer

Listens to your voice and queries a language model for answers when a question is detected

rwkv whisper

Last synced: 05 Aug 2024

https://github.com/astrologos/py-speakeasy

Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.

automatic-speech-recognition chat-gpt coqui-ai coqui-tts elevenlabs-api mimic mycroftai text-to-speech whisper

Last synced: 24 Oct 2024

https://github.com/ndjenkins85/afkode

Personal voice command interface for iPhone on pythonista powered by Whisper and ChatGPT.

chatgpt openai python-packaging quick-start whisper

Last synced: 12 Oct 2024

https://github.com/tensoraws/yuisub

Auto translation of new anime episodes based on Yui-MHCP001

anime chatgpt llm openai pysubs2 subtitle translation whisper

Last synced: 09 Oct 2024

https://github.com/ksylvest/omniai-openai

An implementation of the OmniAI interface for OpenAI.

chatgpt omniai openai ruby whisper

Last synced: 09 Oct 2024