Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/vlazic/json-verbose-to-vtt-converter

Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.

converter groq json json-verbose openai vtt webvtt whisper

Last synced: 26 Nov 2024

https://github.com/jfgonsalves/scribe

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 26 Nov 2024

https://github.com/pratikpakhale/terravis

Voice guided GIS system

genai gis lam llm voice whisper

Last synced: 09 Oct 2024

https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper

Whisper Automatic Speech Recognition (ASR) Model

openai openai-api transcription webapp whisper

Last synced: 22 Dec 2024

https://github.com/microsoft/azure-ai-foundry-whatsapp-bot

WhatsApp Bot built with Azure Functions and Azure AI Foundry, using Python.

azure-ai-foundry azure-functions azure-openai python whatsapp-api whatsapp-bot whisper

Last synced: 27 Nov 2024

https://github.com/nelzomal/videolens_ai

VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience

ai chrome-ai gemini-nano transformers whisper wxt

Last synced: 02 Dec 2024

https://github.com/msrsaditya/speech2speech

A Personal Digital Assistant designed to help you with quick responses.

ollama openai phi3 sox tts whisper

Last synced: 28 Nov 2024

https://github.com/pawelzeja098/whisper-video-transcription

Testing whisper Open-AI to transcribe videos

mp4 transcription whisper whisper-ai

Last synced: 28 Nov 2024

https://github.com/dheison0/subcreator

A subtitle creator, translator and embeder tool made using AI

ai machine-learning ml python subtitles video-processing whisper

Last synced: 09 Oct 2024

https://github.com/simongino/whisper-fastapi

A FastAPI-based application integrating Whisper for efficient speech recognition and processing.

ai docker fastapi python whisper

Last synced: 09 Oct 2024

https://github.com/concaption/containerized-transcription-api

Containerized Transcription API using Whisper Model and FastAPI

docker fastapi openai transcription whisper

Last synced: 16 Dec 2024

https://github.com/ivanrj7j/transcription

This project transcribes audio using whisper and provides an api

ai api flask transcription whisper

Last synced: 09 Oct 2024

https://github.com/seanvelasco/ai

Cloudflare AI challenge submission: Slater - your virtual foreign language friend

ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper

Last synced: 09 Dec 2024

https://github.com/velocitatem/dontlectureme

A program that pays attention to your lectures for you.

ai lectures university whisper

Last synced: 03 Dec 2024

https://github.com/ifeech/subtitler

Creating subtitles from video

subtitles whisper

Last synced: 09 Oct 2024

https://github.com/breadrock1/audio-to-text

There is simple backend project to use whisper-rs.

actix-web audio-to-text rust swagger-ui whisper

Last synced: 11 Nov 2024

https://github.com/zuplyx/subtitle-creator

Add english subtitles to videos using openai/whisper-large-v3

open-ai poetry-python python3 subtitles-generator whisper

Last synced: 09 Dec 2024

https://github.com/jalvarezz13/summarai

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.

llama-index llama2 pymovie whisper

Last synced: 22 Dec 2024

https://github.com/thealphamerc/audio-to-text

Transcribe multi-lingual audio clips using whisper model

openai whisper

Last synced: 16 Dec 2024

https://github.com/ajxv/rtstt

Real time speech to text transcription using OpenAi whisper

live-transcription openai openai-whisper python3 transcription whisper

Last synced: 22 Dec 2024

https://github.com/obay-ismaeel/post-generator

An API that generates social media posts by implementing RAG with Llama-3

ai api fastapi llama llm python retrieval-augmented-generation social-media whisper

Last synced: 12 Oct 2024

https://github.com/crucials/twaddle

speech analysis app that collects statistics like words frequencies and transcribed text

ai audio python python-eel speech-to-text vue whisper

Last synced: 24 Oct 2024

https://github.com/arkaniightt/web_app_transcriptor_openai

Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.

ai app openai python streamlit tool tools transcript transcription webapp whisper

Last synced: 12 Dec 2024

https://github.com/evilfreelancer/whisper-tests

Collection of experiments on OpenAI Whisper models

api-server docker-compose testing transcription whisper

Last synced: 17 Dec 2024

https://github.com/s-emanuilov/whispercpp_kit

A wrapper on whisper.cpp with additional helper features like model management capabilities.

asr whisper

Last synced: 13 Dec 2024

https://github.com/RingoMar/whisper-devcontainer

Openai whisper inside of vscode docker devcontainer using example files

ai devcontainer docker openapi python whisper

Last synced: 24 Oct 2024

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 18 Dec 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/MattCode64/Scriba

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

fastapi python speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/bloodworks-io/phlox

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 18 Dec 2024

https://github.com/malexandersalazar/casey

Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI

agentic-ai content-creation groq large-language-models python wellbeing whisper

Last synced: 18 Dec 2024

https://github.com/ty-martz/audiologic

Python Module to process and predict on music attributes

machine-learning music python whisper

Last synced: 24 Oct 2024

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-

This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.

artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model

Last synced: 19 Dec 2024

https://github.com/akhkim/babel

Real-time Internal Audio Translate and Transcriber that uses Whisper model

ai internal-audio real-time transcription translation whisper

Last synced: 19 Dec 2024

https://github.com/heyfoz/python-openai-whisper

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.

ai api audio-transcription openai python speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/geo-y20/enhanced-learning-experience

IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.

chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding

Last synced: 19 Dec 2024

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/lifeosm/whisper

🐳 Docker image with OpenAI Whisper.

docker octolab speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/Franky1/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

openai python streamlit transcription whisper

Last synced: 24 Oct 2024

https://github.com/saamerm/whisperkit-ios15

iOS 15 - On-device Inference of Whisper Speech Recognition Models for Apple Silicon

ios ios15 swiftui whisper whisper-ai

Last synced: 26 Sep 2024

https://github.com/lukasbach/whisper-cpp-static

Static build of whisper.cpp by ggerganov

ai asr audio ml model recognition speech whisper

Last synced: 22 Nov 2024

https://github.com/samliebl/ai-whisper

Simple Node.js app: speech-to-text via whisper by OpenAI with file download.

nodejs openai speect-to-text transcription whisper whisper-ai

Last synced: 19 Dec 2024

https://github.com/fatma-moanes/voice-assistant

Voice Assistant for FM-Clinic: A multilingual AI-powered voice assistant for booking doctor appointments, leveraging advanced speech-to-text, text-to-speech, and large language models for seamless, natural user interactions.

ai-assistant arabic arabic-nlp aws-polly chatbot gpt groq langchain langsmith llm mongodb multilingual openai speech-recognition speech-to-text streamlit text-to-speech transcription voice-assistant whisper

Last synced: 26 Dec 2024