Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Whisper

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

https://github.com/status-im/infra-role-status-go

Ansible role for status-go

ansible-role infra waku whisper

Last synced: 09 Nov 2024

https://github.com/barrylee111/voicechat-llm

A chatbot with both prompt and voicechat capabilities leveraging LangChain, Elasticsearch, and FastAPI. When using voicechat, the user can immerse themselves in the experience by selecting a narrator, like a pirate for instance.

elasticsearch fastapi langchain largelanguagemodel python react speech-to-text tailwind text-to-speech typescript websocket whisper

Last synced: 19 Dec 2024

https://github.com/xawos/owt

🦙🗣️ Ollama and Whisper Telegram bot, with advanced configuration

ai-bots local-ai ollama telegram-aichatbot telegram-bots whisper

Last synced: 03 Dec 2024

https://github.com/studiowebux/tommygotchi

whisper, piper, llama-gpt, python, fun .. so much fun !

llama-gpt piper python3 whisper whisper-ai

Last synced: 09 Nov 2024

https://github.com/rishabhmathur06/fine-tuning-whisper-small-for-asr-

This repository contains notebook that shows how to fine-tune OpenAI's Whisper model on custom Hindi dataset.

artificial-intelligence asr automatic-speech-recognition fine-tuning openai python whisper whisper-model

Last synced: 19 Dec 2024

https://github.com/webmural/rewind

rewind mural

mural whisper wind

Last synced: 01 Dec 2024

https://github.com/brunogaliati/speech2text-investments

This project automates the download, transcription, and summarization of audio from YouTube videos. Using OpenAI's Whisper model, it converts video content into concise text summaries with an investment analyst's perspective, ideal for professionals needing quick insights.

chatgpt investment openai politics python speech-recognition speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/danibcorr/university-helper

🧑‍🎓 University Helper streamlines academic and administrative tasks for students, educators, and researchers. It provides tools for managing document metadata, converting PDFs to Markdown, transcribing audio, analyzing grade statistics, and more.

deep-learning documentation-tool metadata ocr open-source pdf python statistics university whisper

Last synced: 19 Dec 2024

https://github.com/akhkim/babel

Real-time Internal Audio Translate and Transcriber that uses Whisper model

ai internal-audio real-time transcription translation whisper

Last synced: 19 Dec 2024

https://github.com/educa-ch/educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

langchain ollama open-source open-weight speech-to-text summarization whisper

Last synced: 11 Oct 2024

https://github.com/firefly55lm/bisbigliatorev2

Automatic audio transcriber notebook based on Whisper

colab-notebook speech-to-text whisper

Last synced: 25 Nov 2024

https://github.com/luizcalaca/transcricao-medica

Full Stack + Whisper Transcription + Node.js REST API + VITE + React.js + Railway deploy

full-stack nodejs openai openai-api railway reactjs sequelize sequelize-orm vite whisper whisper-ai

Last synced: 25 Nov 2024

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 19 Dec 2024

https://github.com/rudrodip/kittyscribe

microservice for transcribing audio/video files to text and transcoding video

docker ffmpeg python whisper

Last synced: 01 Dec 2024

https://github.com/javi-cc/python-openai-generator-srt

Application that works offline written in python that transcribes and translates either audio or video files into text to generate a subtitle file (.srt) using deep learning libraries such as openai-whisper and argos-translate.

argos-translate docker docker-compose dockerfile offline openai openai-whisper python whisper

Last synced: 18 Dec 2024

https://github.com/vlazic/json-verbose-to-vtt-converter

Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.

converter groq json json-verbose openai vtt webvtt whisper

Last synced: 26 Nov 2024

https://github.com/jfgonsalves/scribe

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 26 Nov 2024

https://github.com/doctorpok42/pheere

Pheere is a simple virtual assistant

ai chatgpt elevenlabs ts virtual-assistant whisper

Last synced: 11 Nov 2024

https://github.com/luluw8071/whisper-tune

Finetuning Whisper on your own voice

whisper

Last synced: 14 Dec 2024

https://github.com/arslanex/whisperdemo

A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.

audio-processing openai openai-whisper python whisper

Last synced: 23 Nov 2024

https://github.com/eva-kaushik/multilingual-transcription-with-openai_whisper

Whisper Automatic Speech Recognition (ASR) Model

openai openai-api transcription webapp whisper

Last synced: 22 Dec 2024

https://github.com/microsoft/azure-ai-foundry-whatsapp-bot

WhatsApp Bot built with Azure Functions and Azure AI Foundry, using Python.

azure-ai-foundry azure-functions azure-openai python whatsapp-api whatsapp-bot whisper

Last synced: 27 Nov 2024

https://github.com/kristofferv98/whisper_turboapi

An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization

ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo

Last synced: 14 Dec 2024

https://github.com/hanpham32/react-native-whisper

A simple text transcription web/mobile app

flask ngrok react-native transcribe whisper

Last synced: 24 Dec 2024

https://github.com/nelzomal/videolens_ai

VideoLens AI is a powerful Chrome extension that enhances your YouTube viewing experience

ai chrome-ai gemini-nano transformers whisper wxt

Last synced: 02 Dec 2024

https://github.com/RingoMar/whisper-devcontainer

Openai whisper inside of vscode docker devcontainer using example files

ai devcontainer docker openapi python whisper

Last synced: 24 Oct 2024

https://github.com/egorsmkv/optimized-whisper-intel

Run quantized Whisper models only on CPU with Intel hardware

intel onnx onnxruntime quantized-neural-networks whisper

Last synced: 19 Dec 2024

https://github.com/tylim88/voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 18 Dec 2024

https://github.com/sivakumar-mahalingam/subtitle-generator

🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python

ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper

Last synced: 09 Oct 2024

https://github.com/godmode2k/whisper.cpp.android

whisper.cpp.android with CLBlast(OpenCL), Translation (Google ML-Kit) and TTS

android clblast ggml kotlin ml-kit openai-whisper opencl tts whisper whisper-ai whisper-cpp

Last synced: 10 Nov 2024

https://github.com/valkryst/whisper_automations

Various scripts for automating tasks using OpenAI's Whisper.

automation openai subtitle subtitle-generator transcription translation whisper

Last synced: 06 Nov 2024

https://github.com/MattCode64/Scriba

SCRIBA is a web application that transcribes audio files. It supports .mp3 files and provides the transcription results in a user-friendly interface.

fastapi python speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/msrsaditya/speech2speech

A Personal Digital Assistant designed to help you with quick responses.

ollama openai phi3 sox tts whisper

Last synced: 28 Nov 2024

https://github.com/pawelzeja098/whisper-video-transcription

Testing whisper Open-AI to transcribe videos

mp4 transcription whisper whisper-ai

Last synced: 28 Nov 2024

https://github.com/devgeekm/chat-it-up

Chat It Up! elevates conversations by transforming YouTube URLs, documents, and audio into text, enabling interactive Q&A and summaries. With one click, turn media into time-saving, knowledge-rich dialogues.

ai azure azure-functions azureservices blob-storage fastapi python rag whisper youtube-dl

Last synced: 20 Dec 2024

https://github.com/jalvarezz13/summarai

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.

llama-index llama2 pymovie whisper

Last synced: 22 Dec 2024

https://github.com/mrbuslov/reminder_4u_bot

AI Telegram Bot Reminder. You send a free-form text OR voice reminder, the AI bot records it and reminds you at the right time!

ai ai-bot aiogram chatgpt django gpt-3 gpt-4 gpt-models python reminder telegram-bot voice-recognition whisper

Last synced: 12 Nov 2024

https://github.com/zahidhasann88/video-summarizer

A videos by extracting audio and generating summaries based on the audio content.

nodejs openai typescript whisper

Last synced: 10 Nov 2024

https://github.com/concaption/containerized-transcription-api

Containerized Transcription API using Whisper Model and FastAPI

docker fastapi openai transcription whisper

Last synced: 16 Dec 2024

https://github.com/asai95/speech-recognition-api

Simple but extensible API for Speech Recognition.

speech-recognition whisper

Last synced: 08 Nov 2024

https://github.com/miosipof/asr_train

Fine-tuning OpenAI Whisper for ASR tasks on low-size datasets

asr machine-learning nlp whisper

Last synced: 10 Nov 2024

https://github.com/seanvelasco/ai

Cloudflare AI challenge submission: Slater - your virtual foreign language friend

ai artificial-intelligence language-learning llama2 llm m2m100 machine-learning whisper

Last synced: 09 Dec 2024

https://github.com/senkita/gabriel

视频总结工具。

summarizer whisper

Last synced: 09 Oct 2024

https://github.com/ty-martz/audiologic

Python Module to process and predict on music attributes

machine-learning music python whisper

Last synced: 24 Oct 2024

https://github.com/man2dev/whisper-cpp

dev fork of https://src.fedoraproject.org/rpms/whisper-cpp

fedora fedora-repository linux whisper whisper-cpp whispercpp

Last synced: 09 Oct 2024

https://github.com/velocitatem/dontlectureme

A program that pays attention to your lectures for you.

ai lectures university whisper

Last synced: 03 Dec 2024

https://github.com/aquibali01/voice-to-text-and-voice-chatbot

Voice-to-Voice Chatbot using Whisper, LLaMA, and Groq API

chatbot gtts llama8b llm opeai python voice whisper

Last synced: 19 Dec 2024

https://github.com/nexuslux/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 09 Oct 2024

https://github.com/jplhughes/whisper_logit_lens

This Alignment Jam Hackathon project explores whether the concept of "logit lens" applies to the encoder and decoder layers in Whisper, an end-to-end speech recognition model.

alignment-jam asr interpretability interpretability-jam logitlens whisper

Last synced: 24 Oct 2024

https://github.com/zuplyx/subtitle-creator

Add english subtitles to videos using openai/whisper-large-v3

open-ai poetry-python python3 subtitles-generator whisper

Last synced: 09 Dec 2024

https://github.com/zdwolfe/transcription-tools

Docker video transcriber, wrapper around OpenAI

openai transcription whisper whisper-ai

Last synced: 08 Nov 2024

https://github.com/hydrol0x/retriever

A new aid for the visually impaired powered by AI

elevenlabs llm palm visual-impairment-aid whisper

Last synced: 14 Nov 2024

https://github.com/miosipof/whisper_inference

OpenAI Whisper ASR inference on CPU with OpenVino, PyTorch or Huggingface

asr inference machine-learning openvino pytorch whisper

Last synced: 10 Nov 2024

https://github.com/bloodworks-io/phlox

Self-hosted Ollama + Whisper powered AI medical scribe.

medical ollama rag scribe whisper

Last synced: 18 Dec 2024

https://github.com/malexandersalazar/casey

Casey is a Voice-Activated AI Companion for Mental Wellbeing & Content Creation #BuildWithAI

agentic-ai content-creation groq large-language-models python wellbeing whisper

Last synced: 18 Dec 2024

https://github.com/thealphamerc/audio-to-text

Transcribe multi-lingual audio clips using whisper model

openai whisper

Last synced: 16 Dec 2024

https://github.com/ashot72/speech-to-text-to-speech

Node.js app where you can ask questions to ChatGPT using voice prompts, see the ChatGPT-like word-by-word answer, and then listen to the responses with voice

chatgpt langchain large-language-models llm speech-to-text speech-to-text-to-speech text-to-speech whisper

Last synced: 08 Nov 2024

https://github.com/LarissaGuder/whisper-datastream

Transcription and NER in streaming environment

bert-ner python spark-streaming whisper

Last synced: 24 Oct 2024

https://github.com/yuxiang32/Audio-Transcription

Audio transcriber using OpenAI Whisper

openai whisper

Last synced: 24 Oct 2024

https://github.com/saadkh1/docqa-textsummarization-app

A Streamlit app for document question answering and text summarization.

langchain llama-2 llamacpp pytesseract question-answering streamlit summarization whisper

Last synced: 10 Nov 2024

https://github.com/josemarcosrf/Lexicap-QA

QA retrieval for Lex Fridman's podcast transcriptions

lexicap qa search whisper

Last synced: 24 Oct 2024

https://github.com/cnseniorious000/dl-a2t

download, audio-to-text PyPI: https://pypi.org/p/dl-a2t

audio transcription whisper youtube

Last synced: 09 Nov 2024

https://github.com/charlot-dedjinou/hackathon-ia-multimodal-multilingue

Lors de ce hackathon, nous avons développé la solution Smart VT, une application web basée sur l'IA conçue pour sous-titrer et doubler n'importe quelle vidéo d'une langue à une autre (selon votre choix). Le projet s'appuie sur un frontend en React, des API Python pour le traitement des vidéos, et Node.js pour la gestion des sous-titres vidéo.

api dubble fastapi ffmpeg googletranslator mongodb moviepy nodejs openia reactjs subtitles whisper

Last synced: 12 Nov 2024

https://github.com/nicknaskida/cog-whisper-diarization

Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx

Last synced: 27 Sep 2024

https://github.com/lazauk/aoai-entraidauth-sdkv1

Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x

ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper

Last synced: 13 Nov 2024

https://github.com/ivanrj7j/transcription

This project transcribes audio using whisper and provides an api

ai api flask transcription whisper

Last synced: 09 Oct 2024

https://github.com/simongino/whisper-fastapi

A FastAPI-based application integrating Whisper for efficient speech recognition and processing.

ai docker fastapi python whisper

Last synced: 09 Oct 2024

https://github.com/waikato-llm/whisper

Docker images for the whisper audio transcription library and variants.

audio transcription whisper

Last synced: 13 Nov 2024

https://github.com/dheison0/subcreator

A subtitle creator, translator and embeder tool made using AI

ai machine-learning ml python subtitles video-processing whisper

Last synced: 09 Oct 2024

https://github.com/obay-ismaeel/post-generator

An API that generates social media posts by implementing RAG with Llama-3

ai api fastapi llama llm python retrieval-augmented-generation social-media whisper

Last synced: 12 Oct 2024

https://github.com/lifeosm/whisper

🐳 Docker image with OpenAI Whisper.

docker octolab speech-to-text whisper

Last synced: 24 Oct 2024

https://github.com/heyfoz/python-openai-whisper

This Python script provides a simple interface to transcribe audio files using the OpenAI API's speech-to-text functionality, powered by the Whisper model. The result is returned to the console as text or VTT (WebVTT) format.

ai api audio-transcription openai python speech-to-text whisper

Last synced: 19 Dec 2024

https://github.com/theaussiepom/wyoming-openai

OpenAI SST and TTS support for the Wyoming protocol

home-assistant home-assistant-assist openai sst tts whisper wyoming

Last synced: 21 Dec 2024

https://github.com/geo-y20/enhanced-learning-experience

IntelliLearn is a FastAPI-based application designed to process and transcribe audio and video files into text using the Whisper model. The application also supports processing PDF files to extract and summarize their content.

chat-application chatgpt educational-project fastapi groq-api huggingface lama llm pdf-files platform python speech-to-text text-summarization transformer whisper word2vec wordembedding

Last synced: 19 Dec 2024

https://github.com/Franky1/AIAudioTranscriber

A minimalistic web app to generate transciption for audio built using Python

openai python streamlit transcription whisper

Last synced: 24 Oct 2024

https://github.com/tylim88/Voicefu-back-end

Translate Speech Into Japanese

chatgpt speech-synthesis voicevox whisper

Last synced: 24 Oct 2024

https://github.com/ainoya/chrome-extension-web-transcriptor-ai

Privacy-focused Chrome extension that transcribes audio from browser tabs locally using transformers.js and the TabCapture API. All processing occurs within the browser, ensuring that audio data is never sent to external servers.

chrome-extension chrome-extensions transformersjs whisper

Last synced: 11 Nov 2024

https://github.com/arkapravo-ghosh/speech-to-text

Speech to Text Transcription using OpenAI Whisper v3 and FastAPI

ai fastapi huggingface machine-learning openai python3 speech-to-text transformers whisper

Last synced: 21 Dec 2024

https://gitlab.com/ifrz/asr-multi-lite

Testing of the main ASR frameworks with reduced models for low-resource languages speech recognition

distilhubert wav2vec2 whisper

Last synced: 24 Oct 2024

https://github.com/ekito-station/whisper-api-unity

UnityでOpenAI Whisper APIを使って文字起こしを行ったサンプル

unity whisper

Last synced: 20 Dec 2024

https://github.com/levysantiago/upload-ai

Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.

ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod

Last synced: 13 Nov 2024

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 16 Nov 2024