Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2024-12-26 00:29:40 UTC
- JSON Representation
https://github.com/vlazic/json-verbose-to-vtt-converter
Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.
converter groq json json-verbose openai vtt webvtt whisper
Last synced: 26 Nov 2024
https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper
Whisper Automatic Speech Recognition (ASR) Model
openai openai-api transcription webapp whisper
Last synced: 22 Dec 2024
https://github.com/microsoft/azure-ai-foundry-whatsapp-bot
WhatsApp Bot built with Azure Functions and Azure AI Foundry, using Python.
azure-ai-foundry azure-functions azure-openai python whatsapp-api whatsapp-bot whisper
Last synced: 27 Nov 2024
https://github.com/nelzomal/videolens_ai
VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience
ai chrome-ai gemini-nano transformers whisper wxt
Last synced: 02 Dec 2024
https://github.com/pawelzeja098/whisper-video-transcription
Testing whisper Open-AI to transcribe videos
mp4 transcription whisper whisper-ai
Last synced: 28 Nov 2024
https://github.com/dheison0/subcreator
A subtitle creator, translator and embeder tool made using AI
ai machine-learning ml python subtitles video-processing whisper
Last synced: 09 Oct 2024
https://github.com/concaption/containerized-transcription-api
Containerized Transcription API using Whisper Model and FastAPI
docker fastapi openai transcription whisper
Last synced: 16 Dec 2024
https://github.com/ivanrj7j/transcription
This project transcribes audio using whisper and provides an api
ai api flask transcription whisper
Last synced: 09 Oct 2024
https://github.com/seanvelasco/ai
Cloudflare AI challenge submission: Slater - your virtual foreign language friend
ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper
Last synced: 09 Dec 2024
https://github.com/velocitatem/dontlectureme
A program that pays attention to your lectures for you.
ai lectures university whisper
Last synced: 03 Dec 2024
https://github.com/breadrock1/audio-to-text
There is simple backend project to use whisper-rs.
actix-web audio-to-text rust swagger-ui whisper
Last synced: 11 Nov 2024
https://github.com/zuplyx/subtitle-creator
Add english subtitles to videos using openai/whisper-large-v3
open-ai poetry-python python3 subtitles-generator whisper
Last synced: 09 Dec 2024
https://github.com/jalvarezz13/summarai
SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.
llama-index llama2 pymovie whisper
Last synced: 22 Dec 2024
https://github.com/thealphamerc/audio-to-text
Transcribe multi-lingual audio clips using whisper model
Last synced: 16 Dec 2024
https://github.com/brucewind/localwhisperapiservice
openai-whisper transcribe whisper
Last synced: 19 Nov 2024
https://github.com/ajxv/rtstt
Real time speech to text transcription using OpenAi whisper
live-transcription openai openai-whisper python3 transcription whisper
Last synced: 22 Dec 2024
https://github.com/obay-ismaeel/post-generator
An API that generates social media posts by implementing RAG with Llama-3
ai api fastapi llama llm python retrieval-augmented-generation social-media whisper
Last synced: 12 Oct 2024
https://github.com/crucials/twaddle
speech analysis app that collects statistics like words frequencies and transcribed text
ai audio python python-eel speech-to-text vue whisper
Last synced: 24 Oct 2024
https://github.com/leafyeexyz/counselorleaf
一个随时陪伴你的 AI 心理咨询师
cloudflare-api cloudflare-pages cloudflare-workers counselling counselor javascript psychology qwen react reactjs whisper
Last synced: 11 Dec 2024
https://github.com/arkaniightt/web_app_transcriptor_openai
Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.
ai app openai python streamlit tool tools transcript transcription webapp whisper
Last synced: 12 Dec 2024
https://github.com/evilfreelancer/whisper-tests
Collection of experiments on OpenAI Whisper models
api-server docker-compose testing transcription whisper
Last synced: 17 Dec 2024
https://github.com/s-emanuilov/whispercpp_kit
A wrapper on whisper.cpp with additional helper features like model management capabilities.
Last synced: 13 Dec 2024
https://github.com/whisper-666/TikTok-Login
TikTok Login With No Captcha No Proxy (unlimited requests)
api combo combo-checker proxyless tiktok tiktok-api tiktok-followers tiktok-followers-generator tiktok-followers-software tiktok-login tiktok-views whisper
Last synced: 24 Oct 2024
https://github.com/RingoMar/whisper-devcontainer
Openai whisper inside of vscode docker devcontainer using example files
ai devcontainer docker openapi python whisper
Last synced: 24 Oct 2024
https://github.com/javi-cc/python-openai-generator-srt
Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.
argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper
Last synced: 18 Dec 2024
https://github.com/hanpham32/react-native-whisper
A simple text transcription web/mobile app
flask ngrok react-native transcribe whisper
Last synced: 24 Dec 2024
https://github.com/MattCode64/Scriba
SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.
fastapi python speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/tylim88/voicefu-back-end
Translate Speech Into Japanese
chatgpt speech-synthesis voicevox whisper
Last synced: 18 Dec 2024
https://github.com/malexandersalazar/casey
Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI
agentic-ai content-creation groq large-language-models python wellbeing whisper
Last synced: 18 Dec 2024
https://github.com/ty-martz/audiologic
Python Module to process and predict on music attributes
machine-learning music python whisper
Last synced: 24 Oct 2024
https://github.com/LarissaGuder/whisper-datastream
Transcription and NER in streaming environment
bert-ner python spark-streaming whisper
Last synced: 24 Oct 2024
https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-
This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.
artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model
Last synced: 19 Dec 2024
https://github.com/akhkim/babel
Real-time Internal Audio Translate and Transcriber that uses Whisper model
ai internal-audio real-time transcription translation whisper
Last synced: 19 Dec 2024
https://github.com/youknow2509/real-time-speech-to-text
Speech To Text in Real-Time
blackhole speech-recognition speech-to-text whisper whisper-api
Last synced: 19 Dec 2024
https://github.com/heyfoz/python-openai-whisper
This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.
ai api audio-transcription openai python speech-to-text whisper
Last synced: 19 Dec 2024
https://github.com/geo-y20/enhanced-learning-experience
IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.
chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding
Last synced: 19 Dec 2024
https://github.com/yuxiang32/Audio-Transcription
Audio transcriber using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/lifeosm/whisper
🐳 Docker image with OpenAI Whisper.
docker octolab speech-to-text whisper
Last synced: 24 Oct 2024
https://github.com/Franky1/AIAudioTranscriber
A minimalistic web app to generate transciption for audio built using Python
openai python streamlit transcription whisper
Last synced: 24 Oct 2024
https://github.com/saamerm/whisperkit-ios15
iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon
ios ios15 swiftui whisper whisper-ai
Last synced: 26 Sep 2024
https://github.com/lukasbach/whisper-cpp-static
Static build of whisper.cpp by ggerganov
ai asr audio ml model recognition speech whisper
Last synced: 22 Nov 2024
https://github.com/samliebl/ai-whisper
Simple Node.js app: speech-to-text via whisper by OpenAI with file download.
nodejs openai speect-to-text transcription whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/fatma-moanes/voice-assistant
Voice Assistant for FM-Clinic: A multilingual AI-powered voice assistant for booking doctor appointments, leveraging advanced speech-to-text, text-to-speech, and large language models for seamless, natural user interactions.
ai-assistant arabic arabic-nlp aws-polly chatbot gpt groq langchain langsmith llm mongodb multilingual openai speech-recognition speech-to-text streamlit text-to-speech transcription voice-assistant whisper
Last synced: 26 Dec 2024
https://github.com/neiltron/autocap
ALL CAPS
closedcaptions ml subtitles transcription whisper
Last synced: 19 Dec 2024