Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/msrsaditya/speech2speech

A Personal Digital Assistant designed to help you with quick responses.

ollama openai phi3 sox tts whisper

Last synced: 27 Jan 2025

https://github.com/wa-lead/audio2md

Summarizes audio using openai Whisper-1 model and GPT-Turbo3.5

audio-processing gpt-3 openai python whisper

Last synced: 26 Jan 2025

https://github.com/arkaniightt/web_app_transcriptor_openai

Ferramenta de transcrição automática de áudio para texto, utilizando Streamlit e OpenAI, com suporte a microfone, vídeo e upload de arquivos de áudio.

ai app openai python streamlit tool tools transcript transcription webapp whisper

Last synced: 06 Feb 2025

https://github.com/EvilFreelancer/whisper-tests

Collection of experiments on OpenAI Whisper models

api-server docker-compose testing transcription whisper

Last synced: 24 Oct 2024

https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper

Whisper Automatic Speech Recognition (ASR) Model

openai openai-api transcription webapp whisper

Last synced: 22 Dec 2024

https://github.com/jfgonsalves/scribe

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 26 Nov 2024

https://github.com/rudrodip/kittyscribe

microservice for transcribing audio/video files to text and transcoding video

docker ffmpeg python whisper

Last synced: 29 Jan 2025

https://github.com/nelzomal/videolens_ai

VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience

ai chrome-ai gemini-nano transformers whisper wxt

Last synced: 30 Jan 2025

https://github.com/lukasbach/whisper-cpp-static

Static build of whisper.cpp by ggerganov

ai asr audio ml model recognition speech whisper

Last synced: 23 Jan 2025

https://github.com/paszkoo/real_time_whisper_iot

Real time voice transcription from default audio input using faster-whisper

ai iot-application iot-device smart-home voice-assistant voice-recognition whisper

Last synced: 17 Jan 2025

https://github.com/mario-huang/whisper-desktop

A desktop app for easy subtitle using whisper model.

ai desktop gradio open-source python pytorch tauri web-ui whisper

Last synced: 17 Jan 2025

https://github.com/heng30/vtbox

It is an offline voice to text tool. Using whisper model to transcribe.

rust slint-ui voice2text whisper

Last synced: 21 Nov 2024

https://github.com/tristan-mcinnis/Simultaneous-Interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 08 Feb 2025

https://github.com/tobybenjaminclark/intermew

👨‍💻 Realistic, generative simulated interviews for Durhack 2024. Built using Webscraping, OpenCV, Deepface, Whisper, OpenAI and Gamemaker.

computer-vision openai-api whisper

Last synced: 25 Jan 2025

https://github.com/webmural/rewind

rewind mural

mural whisper wind

Last synced: 29 Jan 2025

https://github.com/mdbecker/whisper_cpp_macos_utils

Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.

audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp

Last synced: 29 Jan 2025

https://github.com/stefanangelovski/voice_to_tweet

Tweet with your Voice using Whisper STT from OpenAI and Twitter4J flow to connect and talk with any account.

ai frontend openai twitter website whisper x

Last synced: 15 Dec 2024

https://github.com/deshwalmahesh/interview-help-cheat-live

As the name suggests, it helps you cheat in your live interviews or video calls. It transcribes your audio and provides answers to your query in real time. Supports equation rendering, custom prompts, text selection and editing. It's basically chatGPT for cheating in interviews

audio-transcription chatgpt fastapi huggingface interview interviews live openai pyaudio realtime transcription transformers whisper whisper-large

Last synced: 31 Dec 2024

https://github.com/notyusheng/transcribe-translate_kubernetes

Local web app for transcription and translation services for audio and video using Whisper models

docker full-stack k8s kubernetes nodejs react reactjs self-hosted speech-to-text transcribe translate whisper

Last synced: 23 Jan 2025

https://github.com/sixiaolong1117/whisperpythonscript

一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。

pysrt python python3 whisper whisper-ai

Last synced: 16 Jan 2025

https://github.com/s-emanuilov/whispercpp_kit

A wrapper on whisper.cpp with additional helper features like model management capabilities.

asr whisper

Last synced: 13 Dec 2024

https://github.com/televisionninja/chat

Chat with an AI Vtuber

ai chatbot llama llm tts vtube-studio vtuber whisper

Last synced: 20 Nov 2024

https://github.com/ts-azure-services/batch-transcription-examples

A repo to archive some code related to batch transcription for animation movies.

batch-transcription speech-to-text whisper

Last synced: 28 Jan 2025

https://github.com/escarrie/transcriptaudio

This is a script that can be used to transcript audio file into text file using Whisper AI

ai transcription whisper

Last synced: 17 Jan 2025

https://github.com/codewithdark-git/talktube

A powerful Streamlit application that allows users to analyze and interact with YouTube video content through natural language questions.

agents genai genai-domain groq groq-api langchain langchain-python llm lvlm lvlms pyhton3 python rag streamlit webapp whisper youtube youtube-bot

Last synced: 10 Feb 2025

https://github.com/meain/raus

Record audio until silence (RAUS)

audio hammerspoon transcription whisper whisper-cpp

Last synced: 17 Jan 2025

https://github.com/man2dev/whisper-cpp

dev fork of https://src.fedoraproject.org/rpms/whisper-cpp

fedora fedora-repository linux whisper whisper-cpp whispercpp

Last synced: 10 Feb 2025

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 17 Jan 2025

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 10 Feb 2025

https://github.com/levysantiago/upload-ai

Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.

ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod

Last synced: 12 Jan 2025