Projects in Awesome Lists tagged with whisper-api
A curated list of projects in awesome lists tagged with whisper-api .
https://github.com/adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
ingestion-api ocr omniparser parse-server parser-library vision-transformer web-crawler whisper-api
Last synced: 13 May 2025
https://github.com/mallorbc/whisper_mic
Project that allows one to use a microphone with OpenAI whisper.
microphone speech-recognition speech-to-text whisper whisper-ai whisper-api
Last synced: 16 May 2025
https://github.com/Evil0ctal/Fast-Powerful-Whisper-AI-Services-API
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
asr crawler douyin-api fastapi faster-whisper openai-whisper speech-recognition speech-to-text speech-to-text-api tiktok-analytics tiktok-api tiktok-crawler video-analysis whisper-ai whisper-api whisperbot
Last synced: 05 Apr 2025
https://github.com/evil0ctal/fast-powerful-whisper-ai-services-api
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
asr crawler douyin-api fastapi faster-whisper openai-whisper speech-recognition speech-to-text speech-to-text-api tiktok-analytics tiktok-api tiktok-crawler video-analysis whisper-ai whisper-api whisperbot
Last synced: 16 May 2025
https://github.com/Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 13 Apr 2025
https://github.com/carleslc/audiototext
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 06 Apr 2025
https://github.com/mouredev/tggenerator
Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)
android android-app androidstudio dall-e dalle2 gpt-3-5-turbo jetpack-compose openai openai-api whisper whisper-ai whisper-api
Last synced: 25 Jan 2025
https://github.com/themanyone/whisper_dictation
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
ai assistant-chat-bots assistive-technology client client-server coding continuous dictation hands-free launcher server speech-recognition stable-diffusion stable-diffusion-webui star-trek voice-assistant voice-control voice-recognition whisper-api whisper-cpp
Last synced: 04 Apr 2025
https://github.com/carloscdias/whisper-cpp-python
whisper.cpp bindings for python
python python3 whisper whisper-api whisper-cpp
Last synced: 11 Mar 2025
https://github.com/flyingfathead/telegrambot-openai-api
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
bot bot-framework chatbot gpt-3-5-turbo gpt-35-turbo gpt-4 gpt-4-api gpt-4o-mini gpt4-api openai openai-api openai-api-chatbot perplexity-api telegram telegram-bot telegram-bot-api telegram-bot-app whisper whisper-ai whisper-api
Last synced: 01 May 2025
https://github.com/natehouk/flow-ai-hackathon-2023
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
ai chatgpt chatgpt-api django gpt-3-5-turbo gpt-4 marketaux-api newsapi openai openai-api python3 whisper whisper-ai whisper-api
Last synced: 12 Apr 2025
https://github.com/gurpreetkaurjethra/youtube-video-transcribe-summarizer-llm-app
YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.
generative-ai haystack haystack-ai large-language-models llama2 llamacpp llm streamlit whisper-api
Last synced: 03 Dec 2024
https://github.com/goktugcy/noteai
An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.
adonisjs whisper whisper-ai whisper-api
Last synced: 15 Jan 2025
https://github.com/shaadclt/groq-whisper-transcription-app
A Streamlit-based web application that transcribes audio files using OpenAI's Whisper API. You can either upload an MP3 file or input a YouTube URL to convert video audio into text within seconds.
groq streamlit transcription whisper-api
Last synced: 11 Apr 2025
https://github.com/ayushsoni1010/textify
🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.
ai audio nextjs openai openai-api radix-ui shadcn-ui speech-to-text tailwind-css tailwindcss transcribe translation typescript video whisper-api
Last synced: 07 May 2025
https://github.com/kristofferv98/voiceprocessingtoolkit
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications
api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api
Last synced: 12 May 2025
https://github.com/codeonthespectrum/aisubs
A subtitle generator for videos up to 10GB, automatically transcribing and translating spoken content into Brazilian Portuguese. Ideal for multilingual content, this tool creates accurate `.srt` files for seamless integration with video players.
automation ffmpeg language-detection moviepy multilingual-translations openai python speech-to-text subtitles-generator translation video-processing video-subtitles video-transcription whisper-api
Last synced: 09 Apr 2025
https://github.com/redocrepus/arkode
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
accessibility chatgpt chatgpt-api code-assistant coding-assistant coding-by-voice developer-tools openai openai-api programming-assistant programming-by-voice visual-studio-code visual-studio-code-extension visualstudiocode voice-coding voicecode voicecoding vscode-extension whisper whisper-api
Last synced: 11 Mar 2025
https://github.com/Lord-Haji/ChatAudio
chatbot gpt-3-5-turbo gpt-4 langchain langchain-python speech-recognition whisper whisper-api
Last synced: 11 Mar 2025
https://github.com/cedpoilly/parrot
Ced's parrot! Speech-to-text (Whisper API from OpenAI) and text-to-speech (Narakeet API) demo.
formidable narakeet nuxt3 openai whisper-api
Last synced: 03 Mar 2025
https://github.com/bruceunx/video-maestro
A powerful desktop app built with Tauri and ReactJS to manage videos from YouTube or similar platforms. Features include audio-to-text transcription, translation, summarization, and a user-friendly interface. Perfect for creators, researchers, and video enthusiasts!
Last synced: 28 Mar 2025
https://github.com/itz-fork/vrappy
Summarize videos using AI
collaborate openai summarizer video-summarization whisper-api
Last synced: 18 Feb 2025
https://github.com/jk-oster/voice-to-text-extension
A web extension to use your voice as input for any webpage
chrome-extension speech-to-text transcription voice-recognition webextension whisper-api
Last synced: 12 Apr 2025
https://github.com/niqifan007/openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能
openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api
Last synced: 25 Mar 2025
https://github.com/maninhouse/huh
「Huh(蛤)?」是一個使用 Flask 和 OpenAI API 建立的 LINE 聊天機器人。它可以接收並處理來自 LINE 的語音訊息,並利用 OpenAI 的語音識別技術將語音轉換為文字,同時將文字訊息回傳給用戶。
chatbot flask linebot openai-api voice-recognition whisper-api
Last synced: 02 Apr 2025
https://github.com/danielrosehill/thought-pad
Linux desktop application that provides a two-stage process for creating notes from dictated speech (first stage, transcription via Whisper API; second stage light text formatting). Exports to markdown docs.
notes notes-app openai openai-whisper voice-to-text whisper whisper-api
Last synced: 24 Feb 2025
https://github.com/youknow2509/real-time-speech-to-text
Speech To Text in Real-Time
blackhole speech-recognition speech-to-text whisper whisper-api
Last synced: 06 Apr 2025
https://github.com/aznironman/pyscribe
PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.
audio audio-conversion audio-transcription clarktribegames ffmpeg local-model python pywhisper transcribe transcriber transcription whisper whisper-api
Last synced: 12 Mar 2025
https://github.com/chidwi-commits/host-client-for-whisper-ai
A simple Python host-client setup for audio transcription using OpenAI's Whisper AI model.
how-to python sample whisper whisper-ai whisper-api
Last synced: 17 Mar 2025
https://github.com/jacintogomez/whisper-ai-translation
Multilingual verbal conversation with an AI bot
langchain openai openai-api pygame python whisper-ai whisper-api
Last synced: 30 Mar 2025
https://github.com/satoryu/video_description_generator
gpt-35-turbo openai-api transcribe whisper-api youtube
Last synced: 25 Mar 2025
https://github.com/loginchik/audio-to-text
Audio transcriber based on Whisper by OpenAI
Last synced: 22 Feb 2025
https://github.com/ubos-tech/node-red-contrib-speech-to-text-ubos
Learn how to turn audio into text.
ai low-code lowcode node-red node-red-contrib node-red-flow openai openai-api openai-whisper speech-to-text whisper whisper-ai whisper-api
Last synced: 13 Mar 2025