An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with whisper-api

A curated list of projects in awesome lists tagged with whisper-api .

https://github.com/adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

ingestion-api ocr omniparser parse-server parser-library vision-transformer web-crawler whisper-api

Last synced: 13 May 2025

https://github.com/mallorbc/whisper_mic

Project that allows one to use a microphone with OpenAI whisper.

microphone speech-recognition speech-to-text whisper whisper-ai whisper-api

Last synced: 16 May 2025

https://github.com/Evil0ctal/Fast-Powerful-Whisper-AI-Services-API

⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

asr crawler douyin-api fastapi faster-whisper openai-whisper speech-recognition speech-to-text speech-to-text-api tiktok-analytics tiktok-api tiktok-crawler video-analysis whisper-ai whisper-api whisperbot

Last synced: 05 Apr 2025

https://github.com/evil0ctal/fast-powerful-whisper-ai-services-api

⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。

asr crawler douyin-api fastapi faster-whisper openai-whisper speech-recognition speech-to-text speech-to-text-api tiktok-analytics tiktok-api tiktok-crawler video-analysis whisper-ai whisper-api whisperbot

Last synced: 16 May 2025

https://github.com/mouredev/tggenerator

Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)

android android-app androidstudio dall-e dalle2 gpt-3-5-turbo jetpack-compose openai openai-api whisper whisper-ai whisper-api

Last synced: 25 Jan 2025

https://github.com/carloscdias/whisper-cpp-python

whisper.cpp bindings for python

python python3 whisper whisper-api whisper-cpp

Last synced: 11 Mar 2025

https://github.com/gurpreetkaurjethra/youtube-video-transcribe-summarizer-llm-app

YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.

generative-ai haystack haystack-ai large-language-models llama2 llamacpp llm streamlit whisper-api

Last synced: 03 Dec 2024

https://github.com/goktugcy/noteai

An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.

adonisjs whisper whisper-ai whisper-api

Last synced: 15 Jan 2025

https://github.com/shaadclt/groq-whisper-transcription-app

A Streamlit-based web application that transcribes audio files using OpenAI's Whisper API. You can either upload an MP3 file or input a YouTube URL to convert video audio into text within seconds.

groq streamlit transcription whisper-api

Last synced: 11 Apr 2025

https://github.com/ayushsoni1010/textify

🎙️Seamlessly transcribing the world, one spoken word at a time, in any language you desire.

ai audio nextjs openai openai-api radix-ui shadcn-ui speech-to-text tailwind-css tailwindcss transcribe translation typescript video whisper-api

Last synced: 07 May 2025

https://github.com/kristofferv98/voiceprocessingtoolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

api audio automation elevenlabs gpt-4 multithreading openai picovoice python speech text-to-speech transcription utility voice voice-processing wake-word whisper whisper-api

Last synced: 12 May 2025

https://github.com/codeonthespectrum/aisubs

A subtitle generator for videos up to 10GB, automatically transcribing and translating spoken content into Brazilian Portuguese. Ideal for multilingual content, this tool creates accurate `.srt` files for seamless integration with video players.

automation ffmpeg language-detection moviepy multilingual-translations openai python speech-to-text subtitles-generator translation video-processing video-subtitles video-transcription whisper-api

Last synced: 09 Apr 2025

https://github.com/cedpoilly/parrot

Ced's parrot! Speech-to-text (Whisper API from OpenAI) and text-to-speech (Narakeet API) demo.

formidable narakeet nuxt3 openai whisper-api

Last synced: 03 Mar 2025

https://github.com/bruceunx/video-maestro

A powerful desktop app built with Tauri and ReactJS to manage videos from YouTube or similar platforms. Features include audio-to-text transcription, translation, summarization, and a user-friendly interface. Perfect for creators, researchers, and video enthusiasts!

openai-api tauri2 whisper-api

Last synced: 28 Mar 2025

https://github.com/jk-oster/voice-to-text-extension

A web extension to use your voice as input for any webpage

chrome-extension speech-to-text transcription voice-recognition webextension whisper-api

Last synced: 12 Apr 2025

https://github.com/niqifan007/openai-tts-stt-streamlit

A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能

openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api

Last synced: 25 Mar 2025

https://github.com/maninhouse/huh

「Huh(蛤)?」是一個使用 Flask 和 OpenAI API 建立的 LINE 聊天機器人。它可以接收並處理來自 LINE 的語音訊息,並利用 OpenAI 的語音識別技術將語音轉換為文字,同時將文字訊息回傳給用戶。

chatbot flask linebot openai-api voice-recognition whisper-api

Last synced: 02 Apr 2025

https://github.com/danielrosehill/thought-pad

Linux desktop application that provides a two-stage process for creating notes from dictated speech (first stage, transcription via Whisper API; second stage light text formatting). Exports to markdown docs.

notes notes-app openai openai-whisper voice-to-text whisper whisper-api

Last synced: 24 Feb 2025

https://github.com/aznironman/pyscribe

PyScribe is a command-line tool to transcribe audio files. It uses `ffmpeg` for audio conversion and `pywhisper` for transcription.

audio audio-conversion audio-transcription clarktribegames ffmpeg local-model python pywhisper transcribe transcriber transcription whisper whisper-api

Last synced: 12 Mar 2025

https://github.com/chidwi-commits/host-client-for-whisper-ai

A simple Python host-client setup for audio transcription using OpenAI's Whisper AI model.

how-to python sample whisper whisper-ai whisper-api

Last synced: 17 Mar 2025

https://github.com/jacintogomez/whisper-ai-translation

Multilingual verbal conversation with an AI bot

langchain openai openai-api pygame python whisper-ai whisper-api

Last synced: 30 Mar 2025

https://github.com/loginchik/audio-to-text

Audio transcriber based on Whisper by OpenAI

whisper-api

Last synced: 22 Feb 2025