Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/samliebl/ai-whisper

Simple Node.js app: speech-to-text via whisper by OpenAI with file download.

nodejs openai speect-to-text transcription whisper whisper-ai

Last synced: 19 Dec 2024

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 18 Dec 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/nexuslux/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 09 Oct 2024

https://github.com/khushijtrivedi/speech

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.

ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper

Last synced: 09 Oct 2024

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 14 Dec 2024

https://github.com/doctorpok42/pheere

Pheere is a simple virtual assistant

ai chatgpt elevenlabs ts virtual-assistant whisper

Last synced: 11 Nov 2024

https://github.com/dtbuchholz/yt-timestamps-subtitles

Generate YouTube timestamps and subtitles from a video file with OpenAI Whisper and GPT-4

gpt-4 subtitles timestamp whisper youtube

Last synced: 15 Dec 2024

https://github.com/meain/raus

Record audio until silence (RAUS)

audio hammerspoon transcription whisper whisper-cpp

Last synced: 17 Nov 2024

https://github.com/chinese-soup/cbot-telegram-whisper

Simple bot that transcribes Telegram voice messages. Powered by go-telegram-bot-api & whisper.cpp Go bindings.

bot cpu-inference golang openai speech-recognition speech-to-text whisper whisper-cpp whispercpp

Last synced: 16 Nov 2024

https://github.com/goktugcy/noteai

An artificial intelligence supported NodeJS application that allows the audio file to be displayed as pdf after converting it to text with the Whisper tool.

adonisjs whisper whisper-ai whisper-api

Last synced: 15 Nov 2024

https://github.com/escarrie/transcriptaudio

This is a script that can be used to transcript audio file into text file using Whisper AI

ai transcription whisper

Last synced: 17 Nov 2024

https://github.com/lifeosm/whisper

🐳 Docker image with OpenAI Whisper.

docker octolab speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/Franky1/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

openai python streamlit transcription whisper

Last synced: 24 Oct 2024

https://github.com/cp3249/athena_project

Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.

coqui-tts llm ollama whisper

Last synced: 03 Dec 2024

https://github.com/bloodworks-io/phlox

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 18 Dec 2024

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 16 Nov 2024

https://github.com/malexandersalazar/casey

Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI

agentic-ai content-creation groq large-language-models python wellbeing whisper

Last synced: 18 Dec 2024

https://github.com/televisionninja/chat

Chat with an AI Vtuber

ai chatbot llama llm tts vtube-studio vtuber whisper

Last synced: 20 Nov 2024

https://github.com/sixiaolong1117/whisperpythonscript

一个简单的 Whisper Python 脚本,可以将媒体文件的音频通过 whisper 识别成文字,并通过 pysrt 保存为字幕。

pysrt python python3 whisper whisper-ai

Last synced: 15 Nov 2024

https://github.com/levysantiago/upload-ai

Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.

ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod

Last synced: 13 Nov 2024

https://github.com/waikato-llm/whisper

Docker images for the whisper audio transcription library and variants.

audio transcription whisper

Last synced: 13 Nov 2024

https://github.com/lazauk/aoai-entraidauth-sdkv1

Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x

ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper

Last synced: 13 Nov 2024

https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue

Lors de ce hackathon, nous avons développé la solution Smart VT, une application web basée sur l'IA conçue pour sous-titrer et doubler n'importe quelle vidéo d'une langue à une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidéos, et Node.js pour la gestion des sous-titres vidéo.

api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper

Last synced: 12 Nov 2024

https://github.com/ainoya/chrome-extension-web-transcriptor-ai

Privacy-focused Chrome extension that transcribes audio from browser tabs locally using transformers.js and the TabCapture API. All processing occurs within the browser, ensuring that audio data is never sent to external servers.

chrome-extension chrome-extensions transformersjs whisper

Last synced: 11 Nov 2024

https://github.com/mariatepei/vt_thesis_mtepei

This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.

data-augmentation fastspeech2 speech-recognition whisper

Last synced: 09 Oct 2024

https://github.com/kitschpatrol/ambient-novel

An interface for nonlinear interactive exploration of a novel.

ambient book fiction interactive novel svelte whisper

Last synced: 19 Nov 2024

https://github.com/ts-azure-services/batch-transcription-examples

A repo to archive some code related to batch transcription for animation movies.

batch-transcription speech-to-text whisper

Last synced: 30 Nov 2024

https://github.com/fer14/videoseek

Intelligent video search tool powered by AI

bert timestamp video whisper youtube-api

Last synced: 14 Nov 2024

https://github.com/njorogemaurice/speech-recognition-openai-whisper

This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.

openai speech-recognition speech-to-text whisper

Last synced: 14 Nov 2024

https://github.com/kristofferv98/semanthavoiceassistant

A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.

ai-companion autogen elevenlabs intent-detection local-llm natural-language-processing openai personalized-interactions picovoice python rag semantic-routing sentiment-analysis text-to-speech voice-activity-detection voice-assistant voice-processing voice-recognition websearch whisper

Last synced: 11 Nov 2024

https://github.com/pkarpovich/kira-client

An AI-powered voice automation tool for IoT, integrating voice-triggered commands, OpenAI-driven intent recognition, and HTTP server management for seamless control of smart devices

ai-assistant intent-classification porcupine trigger-word-detection whisper

Last synced: 14 Nov 2024

https://github.com/aquibali01/voice-to-text-and-voice-chatbot

Voice-to-Voice Chatbot using Whisper, LLaMA, and Groq API

chatbot gtts llama8b llm opeai python voice whisper

Last synced: 19 Dec 2024

https://github.com/datvm/openaiwhisperclient

A HTML page for using OpenAI Whisper API for transcripting, including making subtitles. JSON is also supported.

client-side openai subtitle timestamp transcript transcription whisper whisper-ai

Last synced: 15 Dec 2024

https://github.com/stefanangelovski/voice_to_tweet

Tweet with your Voice using Whisper STT from OpenAI and Twitter4J flow to connect and talk with any account.

ai frontend openai twitter website whisper x

Last synced: 15 Dec 2024

https://github.com/flyingfathead/youwhisper-cli

A streamlined CLI tool combining `yt-dlp` and `whisperx` (or `openai-whisper`) for quick and efficient audio transcription from various video platforms.

cli cli-app python transcribe transcriber transcription whisper whisper-ai whisperx youtube-downloader yt-dlp yt-dlp-wrapper

Last synced: 12 Nov 2024

https://github.com/status-im/infra-role-status-go

Ansible role for status-go

ansible-role infra waku whisper

Last synced: 09 Nov 2024

https://github.com/huuquyet/phowhisper-small

Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper

Last synced: 06 Dec 2024

https://github.com/studiowebux/tommygotchi

whisper, piper, llama-gpt, python, fun .. so much fun !

llama-gpt piper python3 whisper whisper-ai

Last synced: 09 Nov 2024

https://github.com/jgw96/speech-to-text-web-toolkit

Making Speech-To-Text on the web easy, both local and in the cloud

ai lit transformersjs webcomponents whisper

Last synced: 06 Dec 2024