Whisper | Ecosyste.ms: Awesome

https://github.com/kristofferv98/voiceprocessingtoolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api

Last synced: 02 Nov 2024

https://github.com/botisan-ai/whisper-aws-stack

Deplay Whisper on AWS Scalably

aws cdk ecs fargate fastapi openai silero-vad whisper

Last synced: 24 Oct 2024

https://github.com/0x20f/listen-wise

Save the last 30 seconds of audio to text using ai. Send that text to a notion page, readwise, obsidian, or just save it locally in a text file.

ai notion openai speech-to-text transcription whisper

Last synced: 30 Oct 2024

https://github.com/erkara/Rise-of-Transfer-Learning

you will find brief code implementations of some of the latest developments in AI, including Stable Diffusion, Whisper, YOLO and HuggigFace Transformers

gpt-3 huggingface openai stable-diffusion transfer-learning whisper yolov5

Last synced: 24 Oct 2024

https://github.com/yjg30737/whisper_transcribe_youtube_video_example_gui

GUI Showcase of using Whisper to transcribe and analyze Youtube video

audio-to-text pyqt pyqt5 pyqt5-desktop-application python pytube qt whisper

Last synced: 06 Dec 2024

https://github.com/prathamesh-mandavkar/AutoTalker

The project focuses on leveraging technology to create new courses, personalize existing ones, and enhance the assessment process, ultimately contributing to the development of 21st-century skills in students.

ai bark gdsc gdsc-dypsn gemini-api gemini-pro gen-ai ngo python solution-challenge-2024 stt subtitles tts video-creation whisper

Last synced: 24 Oct 2024

https://github.com/semyon-dev/whissage

the backend of blockchain-based messenger

blockchain blockchain-messenger ethereum geth messenger whisper whisper-protocol

Last synced: 30 Oct 2024

https://github.com/m0wer/aibot

Telegram bot powered by Ollama, capable of handling text and voice messages, with configurable language models and system prompts.

ai assistant llama3 ollama telegram telegram-bot tts whisper

Last synced: 10 Oct 2024

https://github.com/dbpprt/whispr

🎙️ Privacy-focused menubar app for local voice-to-text transcription on macOS, powered by Whisper.cpp - no cloud required

ai macos rust transcription whisper

Last synced: 26 Dec 2024

https://github.com/umerarif01/ai-translator

AI Translator: Fast and Accurate Translations with Next.js and OpenAI's Whisper and GPT-3 APIs

gpt-3 nextjs openai whisper

Last synced: 24 Oct 2024

https://github.com/robbinhan/whisper-test

以太坊whisper v6 demo

ethereum whisper

Last synced: 24 Oct 2024

https://github.com/ognisty321/whisper-transcription-ui

Whisper Transcription UI is a user-friendly graphical interface for whisper-standalone-win. Transcribe and translate audio/video files effortlessly with customizable settings and saved preferences.

gui python transcription ui whisper whisper-standalone-win

Last synced: 09 Oct 2024

https://github.com/tristan-mcinnis/realtime-whisper-console-transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.

asr console python real-time speech-recognition speech-to-text terminal transcription whisper

Last synced: 12 Oct 2024

https://github.com/zaneh/heybilly

🗣️ It's like Alexa, but for your computer. Highly modular, real-time voice assistant. Built using self-assembling graphs.

contributions-welcome graph python3 rabbitmq self-hosted tts voice-assistant whisper

Last synced: 08 Dec 2024

https://github.com/achraf-oujjir/profgpt-smart-vr-professor

👨‍🏫🤖 ProfGPT: AI-powered VR professor with electrical circuits lab table ⚡💡 Built with Unity 🎮 GPT and Whisper APIs 🧠 and AWS Polly 🦜🗣️

ai-education aws-polly chatgpt-api csharp education oculus-quest-2 openai-api openai-whisper speech-to-text text-to-speech unity3d virtual-reality vr whisper

Last synced: 03 Nov 2024

https://github.com/200ok-ch/voice_vault

voice_vault enables you to record and archive all your meetings and conversations with ease. Later, search through them with lightning speed using full-text search.

ffmpeg transcription whisper

Last synced: 19 Nov 2024

https://github.com/chidiwilliams/talks

ai whisper

Last synced: 27 Oct 2024

https://github.com/cgbur/whisp

A lightweight and minimal desktop speech-to-text tool.

accessibility speech-to-text whisper

Last synced: 07 Dec 2024

https://github.com/walkswithaswagger/whisperforge

WhisperForge is a Python tool that leverages OpenAI's Whisper model to transcribe large audio files. It automatically splits files into manageable chunks, processes them, and combines the transcriptions into a single document. Ideal for handling lengthy recordings and generating clear, organized transcriptions.

audio-transcription openai python whisper

Last synced: 21 Nov 2024

https://github.com/adt109119/whisper-json-to-srt-converter

這是一個為了用來將 Groq 的 Whisper API 回傳的 JSON，轉換為 SRT 字幕而製作的簡單的專案。

ai converter gradio groq whisper

Last synced: 09 Oct 2024

https://github.com/sandy1990418/chinesetaiwanesewhisper

This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages. It includes tools and scripts for data preprocessing, model training, and evaluation, tailored to improve speech recognition accuracy for these languages.

asr chinese gradio realtime speech-to-text streaming-audio taiwanese whisper

Last synced: 09 Oct 2024

https://github.com/fabio-garavini/ha-groq-whisper-stt-api

HACS custom integration for using GroqCloud speech-to-text (Whisper) API in the Assist pipeline, reducing the workload on the Home Assistant server.

groq-api home-assistant stt whisper

Last synced: 29 Sep 2024

https://github.com/fly-apps/cog-whisper

Run OpenAI Whisper as a Cog model on Fly GPUs

ai cog gpu whisper

Last synced: 17 Nov 2024

https://github.com/lbrndnr/nutshell-macos

An AI-powered note-taking app for your meetings. Built for macOS using SwiftUI.

ai llm swiftui whisper

Last synced: 13 Nov 2024

https://github.com/CrabAss/dCollab

Decentralized e-Learning Collaboration Platform as a Capstone Project (COMP4913, PolyU)

comp4913 dapp ethereum javascript react whisper

Last synced: 24 Oct 2024

https://github.com/ckaznable/yt-cli-live

Youtube Text Live Streaming in CLI

asr cli rust silero-vad whisper whisper-cpp youtube

Last synced: 12 Nov 2024

https://github.com/nexuslux/realtime-whisper-console-transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.

asr console python real-time speech-recognition speech-to-text terminal transcription whisper

Last synced: 09 Oct 2024

https://github.com/micartey/karl-the-voice-assistant

Voice Assistant with the power of OpenAI's ChatGPT

ai chatgpt home-assistant karl openai raspberry-pi voice-assistant whisper

Last synced: 16 Nov 2024

https://github.com/gustavz/audio-to-text

streamlit app to transcript audio to text using openai's whisper library

audio-to-text streamlit whisper

Last synced: 19 Nov 2024

https://github.com/jorgeandrespadilla/avtools

AV Tools - A collection of CLI tools for audio and video processing (powered by AI). Audio transcription, Video to Audio conversion, YouTube downloader.

ai python pytorch whisper

Last synced: 13 Nov 2024

https://github.com/driftingruby/395-transcribing-with-artificial-intelligence

In this episode, we look at creating an audio transcription service which allows files uploaded from Active Storage to be transcribed with Artificial Intelligence. However, there are a lot of considerations around the approach from both a performance and thread safety perspectives.

artificial-intelligence openai ruby ruby-on-rails whisper

Last synced: 23 Dec 2024

https://github.com/schibsted/sum

Sum, a powerful tool for enhancing your articles with the help of ChatGPT.

chatgpt nextjs nrk openai tailwindcss vg whisper

Last synced: 13 Nov 2024

https://github.com/ribartra/call-listener_bot

A bot that downloads, transcribes and analyzes calls to find insights for sales advisors.

api audio-analyser call-bot call-listener drive gcp openai python whisper

Last synced: 09 Oct 2024

https://github.com/patbqc/thoughtforgeai

Forge your thoughts through an AI powered brainstorming session !

ai anthropic brainstorm brainstorming brainstorms mobile openai reactnative whisper

Last synced: 31 Oct 2024

https://github.com/awaisoem/interview-lingo

(Aug 2024) AI assistant which help with interviews, hiring, personality development and communication skills

ai ai71 drizzle-orm falcon neondb nextjs postgresql tailwindcss whisper

Last synced: 09 Oct 2024

https://github.com/hrehfeld/archlinux-whisper.cpp-model

PKGBUILD generation for whisper.cpp models

archlinux aur pkgbuild whisper whisper-cpp

Last synced: 14 Dec 2024

https://github.com/sslava/ai-voice-chat

AI Voice Chat

nodejs openai tts voice-recognition whisper

Last synced: 20 Oct 2024

https://github.com/weihanchen/google-colab-python-learn

📚 Learn Google Colab、Python、ML、OpenAI、Whisper、spaCy、NLP、HuggingFace

colab-notebook huggingface matplotlib natural-language-processing nlp openai pandas python spacy whisper

Last synced: 11 Nov 2024

https://github.com/phineas-pta/fine-tune-whisper-vi

jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2

aws docker fine-tuning lora multi-gpu-training speech-recognition speech-to-text vietnamese whisper

Last synced: 14 Oct 2024

https://github.com/t0mer/telessist

Telessist allows you to contact GPT3 directly from WhatsApp and not only that. Telessist also allows you to save your own personal data and later search and retrieve it using GPT3 to generate a response. In the examples folder, you can see several examples of how to use this bot so you don't have to remember anything ever again.

assistant chatgpt dall-e docker openapi python3 telegram telegram-bot weather whisper

Last synced: 06 Dec 2024

https://github.com/milkyskies/line-chatgpt

A LINE ChatGPT bot with text and AI audio generation / transcription.

chatgpt go golang surrealdb whisper

Last synced: 17 Dec 2024

https://github.com/jech/galene-stt

Speech-to-text support for Galene

galene stt videoconference webrtc whisper whisper-cpp

Last synced: 09 Oct 2024

https://github.com/danomation/Voice-Website

Talk back and forth to GPT over browser. Customize to have your own interactive voice assistant!

elevenlabs gpt stt tts whisper

Last synced: 24 Oct 2024

https://github.com/yanivhaliwa/linux-stuff

ai arp automation bash cyber device-discovery gpt linux monitoring openai package-manager python scanning scripts subtitle utilities whisper

Last synced: 09 Dec 2024

https://github.com/chriamue/whisper-example

Docker compose environment and example for whisper.

docker-compose geth p2p-network shh web3js whisper

Last synced: 15 Dec 2024

https://github.com/egorsmkv/whisper-ukrainian

Trainer and Evaluation scripts for fine-tuning Whisper models for the Ukrainian language

asr automatic-speech-recognition openai speech-recognition ukrainian whisper

Last synced: 18 Oct 2024

https://github.com/manucabral/quick-subtitles

An easy way to generate SRT subtitles from a video in Windows.

audio-to-text srt srt-subtitles subtitles subtitles-generator transcription whisper whisper-ai windows

Last synced: 03 Nov 2024

https://github.com/redocrepus/arkode

Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT

accessibility chatgpt chatgpt-api code-assistant coding-assistant coding-by-voice developer-tools openai openai-api programming-assistant programming-by-voice visual-studio-code visual-studio-code-extension visualstudiocode voice-coding voicecode voicecoding vscode-extension whisper whisper-api

Last synced: 24 Oct 2024

https://github.com/marketcalls/openalgo-voice-based-orders

OpenAlgo Voice Based Orders

flask groq openai python speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/tonywu71/distilling-and-forgetting-in-large-pre-trained-models

Code for my dissertation on "Distilling and Forgetting in Large Pre-Trained Models" for the MPhil in Machine Learning and Machine Intelligence (MLMI) at the University of Cambridge.

continual-learning distillation speech-recognition whisper

Last synced: 04 Dec 2024

https://github.com/knot-inc/john

John is a web app that records video, analyzes audio with AI, and identifies the speaker's native language from their English accent, simplifying language assessment.

audio-analysis machine-learning whisper

Last synced: 17 Nov 2024

https://github.com/datarabbit-ai/transcription_service

System/service with REST API for extracting text transcriptions from movies and audio recordings in most popular video formats.

containers datarabbit rest-api speech-to-text stt transcription transcription-services whisper

Last synced: 09 Oct 2024

https://github.com/jacoblincool/wft

Run Whisper fine-tuning with ease—it works on MPS, CUDA, and CPU without code changes.

fine-tuning whisper

Last synced: 11 Dec 2024

https://github.com/tensoraws/yuisub

Auto translation of new anime episodes based on Yui-MHCP001

anime chatgpt llm openai pysubs2 subtitle translation whisper

Last synced: 09 Oct 2024

https://github.com/kazkozdev/video-analyser

⚡ The YouTube Video Analyzer Pro brings AI-powered analysis capabilities to your fingertips, offering deep insights for content creators and marketers.

ai content-analytics fastapi llama3 llm ollama-api python3 video-analysis video-analysis-client whisper youtube youtube-analytics youtube-api youtube-subscribers

Last synced: 22 Dec 2024

https://github.com/water25234/ChatREP

Summary on Youtube By ChatGPT & whisper

chatgpt-api openai python python3 video whisper youtube

Last synced: 24 Oct 2024

https://github.com/my-north-ai/semantic_audio_filtering

Synthetic data augmentation technique via LLM for Automatic Speech Recognition fine tuning.

automatic-speech-recognition fine-tuning synthetic-dataset-generation text-to-speech whisper

Last synced: 24 Oct 2024

https://github.com/TheGuysBrushes/Whisper

Secured chat application

android chat socket whisper

Last synced: 24 Oct 2024

https://github.com/JoSuru/speeka

Speeaka is an open-source project that uses the Whisper model of OpenAI to transcribe audio into text. Its intuitive web interface makes it easy to use. Contributions are welcome.

open-source python python3 speech-to-text streamlit whisper

Last synced: 24 Oct 2024

https://github.com/bharathajjarapu/voicecipher

Local Speech transcription

transformerjs whisper

Last synced: 09 Oct 2024

https://github.com/limdongjin/ignkafasr

Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)

apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper

Last synced: 24 Oct 2024

https://github.com/upes-open/osoc-24-the-content-forge

The Content Hub Is a online platform which acts as a all in one solution helping content creators develop and generate short form video image content utilising genai models and cloud to maximize their efficiency and benefit from the ever-growing developments in ai models

aws docker fastapi genai microservices nodejs react whisper

Last synced: 09 Oct 2024

https://github.com/astrologos/py-speakeasy

Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.

automatic-speech-recognition chat-gpt coqui-ai coqui-tts elevenlabs-api mimic mycroftai text-to-speech whisper

Last synced: 24 Oct 2024

https://github.com/vimwei/whispertranscriber

Whisper Transcribe and srt Resegment

speech-to-text subtitle whisper

Last synced: 17 Oct 2024

https://github.com/andreabak/whispersubs

Generate subtitles for your video or audio files using the power of AI

ai cuda deep-learning gpu-acceleration machine-learning srt subtitles transcribe transcription translate whisper

Last synced: 16 Nov 2024

https://github.com/seitzquest/RavenWhisperer

Listens to your voice and queries a language model for answers when a question is detected

rwkv whisper

Last synced: 22 Nov 2024

https://github.com/sanket-poojary-03/fine-tuning-whisper

Fine tuning Whisper-Small LLM for Hinglish Audio dataset

audio-dataset audio-to-text deep-learning fine-tuning huggingface-transformers python speech-recognition speech-to-text whisper whisper-ai

Last synced: 09 Oct 2024

https://github.com/daisyyedda/whisper-large-v2-atcosim_corpus

A fine-tuned Whisper model (whisper-large-v2) for aviation audio transcription. WER < 5%.

asr-model nlp whisper whisper-ai

Last synced: 09 Oct 2024

https://github.com/Lord-Haji/ChatAudio

chatbot gpt-3-5-turbo gpt-4 langchain langchain-python speech-recognition whisper whisper-api

Last synced: 24 Oct 2024

https://github.com/t-h-chung/note-taker

Note-taking app for online/local video/audio using Whisper transcription, ChatGPT, and Notion

chatgpt notes notion transcription whisper youtube

Last synced: 09 Oct 2024

https://github.com/gurpreetkaurjethra/multimodal-ai-app-using-llava-7b

Multimodal AI App using Llava 7B and Gradio

ai generative-ai gradio large-language-models llava llavacpp llm multimodal voice-assistant whisper

Last synced: 22 Nov 2024

https://github.com/paulocoutinhox/py-transcriptor-ai

PyTranscriptorAi - Transcript videos to text with Ai and add subtitles - OpenAi

ai openai subtitles transcript video whisper

Last synced: 09 Nov 2024

https://github.com/alessioborgi/stylealigned_multireference-multimodal

Novel framework for Zero-Shot Style Alignment in Text-to-Image generation, incorporating Multi-Modal Context-Awareness and Multi-Reference Style Alignment, using minimal attention sharing, ensuring consistent style transfer without fine-tuning.

adain blip clap context-awareness multi-modal multi-style-transfer no-fine-tuning shared-attention-heads style-aligned text-to-image-generation whisper zero-shot-learning

Last synced: 18 Oct 2024

https://github.com/williamwa/mssmith

A Telegram bot that utilizes the ChatGPT API and can communicate through voice.

chatpgt-api telegram-bot tts whisper

Last synced: 08 Nov 2024

https://github.com/sovit-123/sam_molmo_whisper

An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.

molmo segment-anything-model segmentanythingmodel vlm whisper

Last synced: 18 Oct 2024

https://github.com/i4ds/whisper-prep

Data preparation utility for the finetuning of OpenAI's Whisper model.

fine-tuning nlp speech-to-text whisper

Last synced: 09 Nov 2024

https://github.com/otonomee/mic2transcript

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.

audio cli cli-tool openai speech speech-transcription transcription whisper

Last synced: 09 Oct 2024

https://github.com/bhattbhavesh91/neo4j-palm2-makersuite

Explore how to build a Q&A system on Neo4j using Google's Palm2 model with MakerSuite in this repository.

google google-api google-palm maker-suite neo4j-driver neo4j-python-scripts palm2 python table-qa voice-assistant whisper

Last synced: 16 Nov 2024

https://github.com/jemtaly/whispering

A real-time transcription and translation tool implemented in Python based on the fast-whisper library.

live-caption python real-time-transcription real-time-translation tkinter transcription translation whisper

Last synced: 11 Nov 2024

https://github.com/sonhm3029/realtime-vietnamese-asr-react-native-and-whisper

This project implement end to end realtime vietnamese speech recognition with PhoWhisper in Backend and frontend in React Native

asr phowhiper react-native realtime realtime-speech-recognition speech-recognition speech-to-text vietnamese whisper

Last synced: 16 Nov 2024

https://github.com/imsanjoykb/speech-nlp-bootcamp

Speech NLP Bootcamp

asr audio-analysis audio-applications bangla-nlp huggingface-transformers seq2seq speech speech-recognition tts wav2vec2 whisper

Last synced: 17 Nov 2024

https://github.com/ksylvest/omniai-openai

An implementation of the OmniAI interface for OpenAI.

chatgpt omniai openai ruby whisper

Last synced: 06 Dec 2024

https://github.com/abhishtagatya/polly

☎️ Language Learning Chatbot

chatbot chatgpt python telegram whisper

Last synced: 17 Nov 2024

https://github.com/sakurajimamai-1202/stream-translator-gpt-webui

A web ui application that utilizes the stream-translator-gpt

faster-whisper gemini gpt transcribe translate translation translator webui whisper yt-dlp

Last synced: 11 Oct 2024

https://github.com/ndjenkins85/afkode

Personal voice command interface for iPhone on pythonista powered by Whisper and ChatGPT.

chatgpt openai python-packaging quick-start whisper

Last synced: 12 Oct 2024

https://github.com/Shtirmann/V2T

Telegram bot which automatically transcribes all voice and video messages to text.

ai aiogram faster-whisper python telegram-bot telegram-bot-python voice-to-text whisper

Last synced: 24 Oct 2024

https://github.com/julienvincent/whalker

Whisper talker

whisper whisper-ai whisper-cpp

Last synced: 07 Nov 2024

https://github.com/notyusheng/transcribe-translate

Local web app for transcription and translation services for audio and video using Whisper models

docker full-stack nodejs react reactjs self-hosted speech-to-text transcribe translate whisper

Last synced: 11 Oct 2024

https://github.com/pdcalado/waste

Whisper Audio Service for Transcription and Ergonomics

productivity rofi transcription tts whisper

Last synced: 20 Nov 2024

https://github.com/jojasadventure/whisper-client

Very simple Python based client for Whisper compatible endpoint

desktop-app dictation faster-whisper macos productivity python speech-to-text stt whisper

Last synced: 09 Oct 2024

https://github.com/aws-samples/amazon-ivs-webgpu-captions-demo

This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers.js and WebGPU.

ai amazon-ivs aws captions experimental ivs-lowlatency ivs-realtime lambda lowlatency lvl-300 realtime serverless transformersjs web webgpu webrtc whisper

Last synced: 09 Oct 2024

https://github.com/chaoticbyte/audio-summarize

An audio summarizer (faster-whisper and BART glued together)

ai ai-summarizer audio bart ctranslate2 faster-whisper nlp speech-to-text summarization whisper

Last synced: 09 Oct 2024

https://github.com/vi-ssh-al/auto-caption-generator

flask genai whisper

Last synced: 12 Oct 2024

https://github.com/adisol07/sharpspeech

SharpSpeech is free, local and open source way to speech and wake word recognition.

audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai

Last synced: 19 Dec 2024

https://github.com/rhysdg/whisper-onnx-python

A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph

ai chatbot cuda machine-learning onnxruntime speech-to-text whisper

Last synced: 09 Oct 2024

https://github.com/bigyaa/transcription-system

This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.

accessibility diarization speech-to-text storytelling-with-data transcription whisper