Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Whisper
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
- GitHub: https://github.com/topics/whisper
- Repo: https://github.com/openai/whisper
- Created by: OpenAI
- Released: August 2021
- Related Topics: machine-learning, artificial-intelligence, language-modeling,
- Last updated: 2025-01-28 00:32:47 UTC
- JSON Representation
https://github.com/juanestban/whisper-tnode
cli ts typescript whisper whisper-cpp whisper-ia whisper-node whisper-node-ts
Last synced: 21 Dec 2024
https://github.com/pdcalado/waste
Whisper Audio Service for Transcription and Ergonomics
productivity rofi transcription tts whisper
Last synced: 21 Jan 2025
https://github.com/notyusheng/transcribe-translate
Local web app for transcription and translation services for audio and video using Whisper models
docker full-stack nodejs react reactjs self-hosted speech-to-text transcribe translate whisper
Last synced: 11 Oct 2024
https://github.com/aspadax/subtitlegenerator
Automatically generate a subtitle for your video.
gpt machine-learning openai rust streamlit subtitles-generator whisper
Last synced: 09 Oct 2024
https://github.com/schnoddelbotz/whisper-ui
Transcribe audio/video to text, locally on macOS, Linux and Windows. A simple whisper.cpp wrapper/UI built with Go/Fyne.
ffmpeg ffmpeg-wrapper fyne gui local privacy speech-to-text transcription whisper whisper-cpp
Last synced: 27 Jan 2025
https://github.com/jowadev/interview
Interview is an interactive application crafted to empower both students and professionals in honing their skills for job interviews.
interview-preparation job-interviews nextjs professional students whisper
Last synced: 14 Dec 2024
https://github.com/mikeesto/whispercpp-android
An Android app using whisper.cpp to do voice-to-text transcriptions
android kotlin speech-to-text whisper whisper-cpp
Last synced: 17 Dec 2024
https://github.com/volkansah/text-to-speech-pygui-for-whisper
This is a simple Python-based GUI application that allows users to generate speech from text using the OpenAI API. The application provides a user-friendly interface for inputting text and selecting from different voices to create personalized audio output.
openai openai-api python-gui-tkinter python3 whisper whisper-ai
Last synced: 27 Jan 2025
https://github.com/team-mansumugang/mansumugang-backend
만수무강 서비스의 스프링 부트 어플리케이션입니다.
aws github-actions jpa jpa-hibernate spring-boot whisper
Last synced: 09 Oct 2024
https://github.com/huuquyet/phowhisper-next
Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js
nextjs onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 19 Dec 2024
https://github.com/platput/pysubs
api to get audio transcription for video files from youtube, aws s3 and such. using OpenAI Whisper
Last synced: 24 Oct 2024
https://github.com/slinusc/speaker_identification_evaluation
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
Last synced: 09 Oct 2024
https://github.com/fukuro-kun/wortweber
Wortweber ist ein sich in der Entwicklung befindendes Open-Source-Projekt, das Echtzeit-Sprachtranskription mit KI-Technologie erforscht. Es dient als Lern- und Experimentierplattform für Spracherkennung in Deutsch und Englisch.
Last synced: 17 Jan 2025
https://github.com/adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube
Last synced: 10 Dec 2024
https://github.com/gamut73/quizinator
Generating quizzes, on Android, from YouTube videos.
kotlin-android llm python whisper
Last synced: 19 Dec 2024
https://github.com/toomore/whisper
🔐📦📜🔑🍞 Write some notes by using the GPG encrypts.
gpg notes pgp quickstart whisper
Last synced: 23 Jan 2025
https://github.com/ayeshaaaaaaaaa/ai-powered-video-analysis-with-object-detection-and-detailed-scene-narratives
AI-driven video analysis system that extracts and transcribes audio with Whisper, detects objects using YOLO, and generates comprehensive scene descriptions with GPT-2. The project combines transcriptions and object detections to produce detailed, context-aware video narratives.
bart gpt2 video-analysis whisper yolov8
Last synced: 02 Jan 2025
https://github.com/jpzinn654/speaker-diarization-portuguese
This project implements speaker diarization for Portuguese audio using WhisperX for transcription and PyAnotAudio's Speaker-Diarization 3.1 for speaker separation. It includes a Flask UI for easy file upload, transcription, and speaker identification.
flask gender-detection portuguese-language speaker-diarization speaker-recognition speech-recognition transcription whisper
Last synced: 28 Jan 2025
https://github.com/carlosulisesochoa/whisper-ai-transcription-audio-to-text-file
A Python tool that uses OpenAI's Whisper model to batch transcribe audio files with GPU acceleration. Features include multi-language support, timestamp-based output, automatic file status checking, and CUDA support for faster processing. Perfect for transcribing lectures, interviews, or any audio content with high accuracy.
ai audio-to-text transcription whisper
Last synced: 28 Jan 2025
https://github.com/winstxnhdw/capgen
A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces.
asr automatic-speech-recognition caddy ctranslate2 docker fastapi huggingface huggingface-spaces uvicorn-gunicorn whisper
Last synced: 23 Oct 2024
https://github.com/bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
accessibility diarization speech-to-text storytelling-with-data transcription whisper
Last synced: 19 Dec 2024
https://github.com/valiantlynx/custom-whisper-api
This project provides a custom API wrapper for the open-source Whisper model using FastAPI. It allows you to integrate Whisper into your applications for automatic speech recognition (ASR) tasks.
ai docker-compose fastapi python whisper
Last synced: 10 Jan 2025
https://github.com/tranbavinhson/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 26 Dec 2024
https://github.com/marquesafonso/multilang-asr-captioner
A multilingual automatic speech recognition and video captioning tool using faster whisper. Supports real-time translation to english. Runs on consumer grade cpu.
automatic-speech-recognition captioning-videos faster-whisper whisper
Last synced: 24 Oct 2024
https://github.com/rhysdg/whisper-onnx-python
A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph
ai chatbot cuda machine-learning onnxruntime speech-to-text whisper
Last synced: 09 Oct 2024
https://github.com/bbc-esq/whisper-solo-with-gui
OpenAI's Whisper program with a simple lightweight GUI.
pyqt pyqt6 pyqt6-gui transcribe transcribe-audio-files translate whisper
Last synced: 11 Jan 2025
https://github.com/maylad31/colab-codes
some useful colab files
clip colab-notebook speech-recognition whisper zero-shot-classification
Last synced: 11 Jan 2025
https://github.com/cris-m/langgraph_examples
duckduckgo kokoro langgraph llama3-2 whisper
Last synced: 18 Jan 2025
https://github.com/antoniosbarotsis/telegram-transcriber
A Telegram bot for transcribing voice messages
telegram transcribe voice whisper
Last synced: 26 Dec 2024
https://github.com/Op27/meeting_minutes_generator
This Python application automates the process of generating meeting minutes from an audio recording. It uses the Whisper library for transcription and the OpenAI GPT models for summarizing content, then outputs the result in a Word document.
ai audio-processing document-automation meeting-minutes openai python speech-recognition text-summarization transcription whisper
Last synced: 24 Oct 2024
https://github.com/nerdimite/meetsy-backend
AI Backend for the Workshop on Building an End-to-End AI Meeting Assistant
gpt-3 nextjs sentence-transformers tailwindcss whisper
Last synced: 24 Oct 2024
https://github.com/adisol07/sharpspeech
SharpSpeech is free, local and open source way to speech and wake word recognition.
audio speech speech-recognition speech-to-text wake-word-detection wakeword whisper whisper-ai
Last synced: 19 Dec 2024
https://github.com/lazauk/aoai-entraidauth-sdkv1
Authenticating with Entra ID (former Azure AD) to access Azure OpenAI models in Python SDK v1.x
ai authentication azure azure-active-directory dall-e embeddings entra-id gpt openai whisper
Last synced: 12 Jan 2025
https://github.com/TranBaVinhSon/eth-decentralized-chat
Decentralized chat app by Ethereum Whisper protocol + Vuejs
ethereum vue vuejs whisper whisper-protocol
Last synced: 24 Oct 2024
https://github.com/niqifan007/openai-tts-stt-streamlit
A gui interface for tts (text-to-speech) and stt (speech-to-text) interfaces using the openai api developed by Streamlit, with a history function一个使用Streamlit开发的openai的api接口的tts(文字转语音)和stt(语音转文字)接口的gui界面,带有历史记录功能
openai openai-api streamlit stt-gui tts tts-gui whisper whisper-api
Last synced: 09 Oct 2024
https://github.com/bluebirdback/groq-subtitles
Batch video subtitle generation using Groq Whisper API
groq speech-to-text subtitles video whisper
Last synced: 21 Dec 2024
https://github.com/aixerum/faster-whisper
faster-whisper is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models. This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.
ctranslate2 gpu transcription whisper
Last synced: 07 Jan 2025
https://github.com/theaussiepom/wyoming-openai
OpenAI SST and TTS support for the Wyoming protocol
home-assistant home-assistant-assist openai sst tts whisper wyoming
Last synced: 21 Dec 2024
https://github.com/zdwolfe/transcription-tools
Docker video transcriber, wrapper around OpenAI
openai transcription whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/arkapravo-ghosh/speech-to-text
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
ai fastapi huggingface machine-learning openai python3 speech-to-text transformers whisper
Last synced: 21 Dec 2024
https://github.com/miosipof/whisper_inference
OpenAI Whisper ASR inference on CPU with OpenVino, PyTorch or Huggingface
asr inference machine-learning openvino pytorch whisper
Last synced: 07 Jan 2025
https://github.com/ashot72/answering-questions-about-images
You can upload images, ask questions about images using voice prompts, then listen to the responses in voice
answering-questions blip-2-ai-model gtts large-language-models llm replicate speech-to-text text-to-speech whisper
Last synced: 30 Dec 2024
https://github.com/miosipof/asr_train
Fine-tuning OpenAI Whisper for ASR tasks on low-size datasets
asr machine-learning nlp whisper
Last synced: 07 Jan 2025
https://github.com/chloelavrat/speech-to-text-app
Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.
audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/ekito-station/whisper-api-unity
UnityでOpenAI Whisper APIを使って文字起こしを行ったサンプル
Last synced: 20 Dec 2024
https://github.com/vifill/audio-recorder-and-summarizer
This project is a Python script that records system audio on macOS using BlackHole, transcribes the audio using OpenAI's Whisper API, and summarizes the transcription using OpenAI's GPT models
ai audio blackhole gpt openai records summarize system whisper
Last synced: 20 Dec 2024
https://github.com/valkryst/whisper_automations
Various scripts for automating tasks using OpenAI's Whisper.
automation openai subtitle subtitle-generator transcription translation whisper
Last synced: 26 Dec 2024
https://github.com/soenneker/soenneker.libraries.whisper.ctranslate
Simply adds the Whisper_CTrantlate2 Windows executable, updated daily (if available)
ai csharp ctranslate ctranslate2 dotnet faster libraries library whisper whisperctranslate
Last synced: 29 Dec 2024
https://github.com/bilelouahmed/vocal-assistant
Python voice assistant (based on SpeechRecognition, Whisper and XTTS models) designed to transcribe speech to text, translate across languages, engage in chat mode, and ultimately respond vocally.
chatbot llm mistral-7b neo4j python rag speech-recognition text-to-speech transcription whisper xtts
Last synced: 21 Dec 2024
https://github.com/huuquyet/phowhisper-tiny
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/zahidhasann88/video-summarizer
A videos by extracting audio and generating summaries based on the audio content.
nodejs openai typescript whisper
Last synced: 07 Jan 2025
https://github.com/cnseniorious000/dl-a2t
download, audio-to-text PyPI: https://pypi.org/p/dl-a2t
audio transcription whisper youtube
Last synced: 02 Jan 2025
https://github.com/evil0ctal/whisper-speech-to-text-api
An open source Speech-to-Text API. The project is based on OpenAI's Whisper model and uses the asynchronous features of FastAPI to efficiently wrap it and support more custom functions.
ai api fastapi openai-whisper speech-to-text speech-to-text-api whisper whisper-ai whisper-api
Last synced: 25 Oct 2024
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 25 Oct 2024
https://github.com/cp3249/athena_project
Athena is an AI assistant project that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and provides a basic framework for extending functionalities through a modular tool system.
Last synced: 15 Jan 2025
https://github.com/ubos-tech/node-red-contrib-speech-to-text-ubos
Learn how to turn audio into text.
ai low-code lowcode node-red node-red-contrib node-red-flow openai openai-api openai-whisper speech-to-text whisper whisper-ai whisper-api
Last synced: 20 Jan 2025
https://github.com/mottla/speech-to-text
Local and fast speech to text (STT) with speaker recognition. Transcibe your meetings confidentially.
huggingface speech-recognition stt teams transcription translation whisper zoom
Last synced: 21 Jan 2025
https://github.com/xi-rick/captains-log
Captain's Log is your personal AI-powered voice transcription logbook. This innovative web application allows you to transcribe spoken words into text, organize your thoughts, and manage important notes. Built with cutting-edge technology and creative design, Captain's Log sets sail to revolutionize how you capture and manage ideas.
audio-recorder audio-visualizer javascript mongodb mongodb-atlas nextjs once-ui openai react reactjs shadcn-ui tailwindcss typescript voice whisper
Last synced: 21 Jan 2025
https://github.com/luizcalaca/transcricao-medica
Full Stack + Whisper Transcription + Node.js REST API + VITE + React.js + Railway deploy
full-stack nodejs openai openai-api railway reactjs sequelize sequelize-orm vite whisper whisper-ai
Last synced: 25 Jan 2025
https://github.com/sbadulin/obsidian-dictation-plugin
Obsidian dictation plugin
dictation gpt-35-turbo obsidian obsidian-plugin openai speech-to-text whisper
Last synced: 07 Dec 2024
https://github.com/huuquyet/phowhisper-small
Converted clone of PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
onnx-models phowhisper speech-recognition transformersjs vietnamese vinai whisper
Last synced: 06 Dec 2024
https://github.com/pjarbas/azure-ai
Examples using Azure AI services (DALLE3, Text to Speech, Whisper)
azure-openai dalle-3 image-generation-ai speech-synthesis text-to-speech whisper
Last synced: 21 Jan 2025
https://github.com/jgw96/speech-to-text-web-toolkit
Making Speech-To-Text on the web easy, both local and in the cloud
ai lit transformersjs webcomponents whisper
Last synced: 06 Dec 2024
https://github.com/joaobraganca555/extractionanalysistool
Cloud-based tool for multimedia data extraction and analysis, focusing on influencer content. Utilizes YOLOv8 for object/logo detection, Whisper.AI for speech recognition, and EasyOCR for OCR. Includes sentiment analysis with a scalable microservice architecture for content monitoring.
aws-s3 content-monitor docker easyocr fastapi image-classification logo-detection microservices multimedia-data-analysis object-detection ocr python rabbitmq sentiment-analysis speech-recognition streamlit whisper yolov8
Last synced: 28 Jan 2025
https://github.com/arslanex/whisperdemo
A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.
audio-processing openai openai-whisper python whisper
Last synced: 23 Nov 2024
https://github.com/fkiller/whispertranscript
Transcribe voice from mic input using OpenAI Whisper API.
llm openai transcribe transcript transcription webaudio whisper
Last synced: 06 Jan 2025
https://github.com/tomdewildt/whisper-experiment
Experiments using the Whisper model from Open AI
colab jupyter python transcribe transformers translate whisper
Last synced: 27 Dec 2024
https://github.com/philogicae/docker-faster-whisper-fr-api
Docker - Faster Whisper FR - RunPod Serverless API
ctranslate2 docker faster-whisper french runpod serverless whisper
Last synced: 08 Jan 2025
https://github.com/bilalhameed248/whisper-fine-tuning-for-pronunciation-learning
Fine Tuning of Whisper Speech To Text Base Model For Pronunciation Learning
deep-learning deep-neural-networks dnn fine-tuning openai pronunciation python seq2seq speech speech-recognition speech-synthesis speech-to-text whisper whisper-ai
Last synced: 16 Jan 2025
https://github.com/educa-ch/educa24-speech-to-summary
Demonstrator for an open-source speech-to-summary workflow
langchain ollama open-source open-weight speech-to-text summarization whisper
Last synced: 11 Oct 2024
https://github.com/neiltron/autocap
ALL CAPS
closedcaptions ml subtitles transcription whisper
Last synced: 19 Dec 2024
https://github.com/njorogemaurice/speech-recognition-openai-whisper
This project is a web-based application that utilizes OpenAI's Whisper for speech-to-text conversion. The application allows users to upload audio files or record audio directly from their browser, and then converts the speech in these audio files to text using the Whisper model.
openai speech-recognition speech-to-text whisper
Last synced: 14 Jan 2025
https://github.com/a-iceberg/whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops monitoring mssqlserver openai prompt-engineering python resource-management timestamps uvicorn-gunicorn whisper
Last synced: 18 Jan 2025
https://github.com/sudiptab2100/waku-user-chat
Waku Chat using Usernames
communication-protocol decentralised-application decentralized ethereum ipfs libp2p waku waku-connect web3 whisper zk-snarks zkp
Last synced: 20 Dec 2024
https://github.com/luluw8071/whisper-tune
Finetuning Whisper on your own voice
Last synced: 14 Dec 2024
https://github.com/kristofferv98/whisper_turboapi
An optimized FastAPI server for OpenAI's Whisper whisper-large-v3-turbo model using MLX turbo optimization
ai api asynchronous audio audio-processing fastapi huggingface machine-learning macos mlx model-serving nlp openai optimization python speech-to-text synchronous transcription whisper whisper-turbo
Last synced: 14 Dec 2024
https://github.com/kitschpatrol/ambient-novel
An interface for nonlinear interactive exploration of a novel.
ambient book fiction interactive novel svelte whisper
Last synced: 20 Jan 2025
https://github.com/soenneker/soenneker.runners.whisper.ctranslate
Automatically updates the Soenneker.Whisper.CTranslate package
ai csharp ctranslate ctranslate2 dotnet faster library runner runners whisper whisperctranslate
Last synced: 28 Dec 2024
https://github.com/sivakumar-mahalingam/subtitle-generator
🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python
ai audio-model audio-processing automatic-speech-recognition openai-whisper python speech-recognition speech-to-text subtitle-generator whisper
Last synced: 09 Oct 2024
https://github.com/orhancavus/transcribe_video
Extract Subtitles from YouTube Videos with OpenAI Whisper and Insanely Fast Whisper
insanely-fast speach-to-text whisper
Last synced: 09 Jan 2025
https://github.com/homelab-00/longformstt
A python script that utilizes faster-whisper and pytorch for long form transcription. Uses silence detection with RMS/peak value. Has global hotkeys for easy use.
faster-whisper python speech-to-text whisper
Last synced: 09 Jan 2025
https://github.com/aidayang/faster-whisper-oneclick
Faster-whisper一键启动整合包带GUI界面
deep-learning faster-whisper inference openai quantization speech-recognition speech-to-text transformer whisper
Last synced: 09 Jan 2025
https://github.com/mickekring/top-of-mind-beromfabriken
Att ge beröm till en kollega kan kännas lite pinsamt, men forskning har visat att det kan få oss att må bättre på jobbet och att vi till och med blir mer produktiva. Att få höra att kollegor värdesätter och uppmärksammar en ökar ens välmående helt enkelt.
api gpt openai python transcription whisper
Last synced: 16 Jan 2025
https://github.com/man2dev/whisper-cpp
dev fork of https://src.fedoraproject.org/rpms/whisper-cpp
fedora fedora-repository linux whisper whisper-cpp whispercpp
Last synced: 09 Oct 2024
https://github.com/yousofss/speechtotext
Speech-to-Text using OpenAI's Whisper model
audio-to-text openai openai-whisper speech-to-text transcription whisper whisper-ai
Last synced: 09 Oct 2024
https://github.com/nexuslux/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 09 Oct 2024
https://github.com/vlazic/json-verbose-to-vtt-converter
Transform `json_verbose` transcriptions from OpenAI, Groq, or command-line tools into VTT files with this Deno converter.
converter groq json json-verbose openai vtt webvtt whisper
Last synced: 26 Jan 2025
https://github.com/levysantiago/upload-ai
Este é um sistema que utiliza Whisper e ChatGPT da OpenAI para gerar títulos e descrições a partir da análise de vídeos submetidos.
ai artificial-intelligence axios chatgpt fastify ffmpeg nlw-13 node openai prisma react rocketseat tailwindcss typescript vite whisper zod
Last synced: 12 Jan 2025
https://github.com/tristan-mcinnis/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 17 Jan 2025
https://github.com/meain/raus
Record audio until silence (RAUS)
audio hammerspoon transcription whisper whisper-cpp
Last synced: 17 Jan 2025
https://github.com/escarrie/transcriptaudio
This is a script that can be used to transcript audio file into text file using Whisper AI
Last synced: 17 Jan 2025
https://github.com/flo-bit/youtube-speaker-separation
simple python script that outputs separate audio files for each speaker in a youtube video, using whisper on replicate
speaker-diarization speech-to-text text-to-speech voice-cloning whisper youtube
Last synced: 19 Dec 2024
https://github.com/khushijtrivedi/speech
The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.
ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper
Last synced: 09 Oct 2024