Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/flo-bit/youtube-speaker-separation

simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate

speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube

Last synced: 19 Dec 2024

https://github.com/jfgonsalves/scribe

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 26 Nov 2024

https://github.com/escarrie/transcriptaudio

This is a script that can be used to transcript audio file into text file using Whisper AI

ai transcription whisper

Last synced: 17 Jan 2025

https://github.com/meain/raus

Record audio until silence (RAUS)

audio hammerspoon transcription whisper whisper-cpp

Last synced: 17 Jan 2025

https://github.com/baomeomeo/speech

A Speech-To-Text (with translation) library for Go; currently uses Whisper (runs locally if needed; no need in any API keys)

ai converter go golang library module package speech speech-recognition speech-to-text text whisper

Last synced: 13 Jan 2025

https://github.com/evil0ctal/whisper-speech-to-text-api

An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.

ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api

Last synced: 25 Oct 2024

https://github.com/dtbuchholz/yt-timestamps-subtitles

Generate YouTube timestamps and subtitles from a video file with OpenAI Whisper and GPT-4

gpt-4 subtitles timestamp whisper youtube

Last synced: 15 Dec 2024

https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper

Whisper Automatic Speech Recognition (ASR) Model

openai openai-api transcription webapp whisper

Last synced: 22 Dec 2024

https://github.com/isladot/speech-to-text-whisper

A speech-to-text converter powered by OpenAI's Whisper model. Easy-to-use tool for transcribing audio into text with high accuracy.

ai python s2t speech-to-text whisper

Last synced: 19 Jan 2025

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 17 Jan 2025

https://github.com/levysantiago/upload-ai

Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.

ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod

Last synced: 12 Jan 2025

https://github.com/wa-lead/audio2md

Summarizes audio using openai Whisper-1 model and GPT-Turbo3.5

audio-processing gpt-3 openai python whisper

Last synced: 26 Jan 2025

https://github.com/nanext21/vidcraft

VidCraft is an AI-driven backend application that generates videos from user-defined topics and backgrounds. It combines text, audio, and visuals using advanced AI services, making video creation accessible and efficient for developers and content creators alike.

elevenlabs fastapi ffmpgeg full-stack-web-development gemini-ai github-config image-generation machine-learning mern-project subtitles typescript video-generation whisper whisper-ai

Last synced: 19 Jan 2025

https://github.com/status-im/infra-role-status-go

Ansible role for status-go

ansible-role infra waku whisper

Last synced: 05 Jan 2025

https://github.com/msrsaditya/speech2speech

A Personal Digital Assistant designed to help you with quick responses.

ollama openai phi3 sox tts whisper

Last synced: 27 Jan 2025

https://github.com/concaption/containerized-transcription-api

Containerized Transcription API using Whisper Model and FastAPI

docker fastapi openai transcription whisper

Last synced: 16 Dec 2024

https://github.com/seanvelasco/ai

Cloudflare AI challenge submission: Slater - your virtual foreign language friend

ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper

Last synced: 09 Dec 2024

https://github.com/jt-427/whisper-ui

A minimalist and elegant UI for OpenAI's Whisper speech-to-text model, built with React + Vite and Flask

flask openai react speech-to-text transcription vite whisper

Last synced: 19 Jan 2025