Projects in Awesome Lists tagged with diarization
A curated list of projects in awesome lists tagged with diarization .
https://github.com/argmaxinc/argmax-oss-swift
On-device Speech AI for Apple Silicon
diarization inference ios macos pyannote qwen3-tts speech-recognition speech-to-text swift text-to-speech transformers visionos watchos whisper
Last synced: 19 Apr 2026
https://github.com/purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx
Last synced: 14 May 2025
https://github.com/Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx
Last synced: 28 Mar 2025
https://github.com/r3gm/sonitranslate
Synchronized Translation for Videos. Video dubbing
asr audio-processing automatic-dubbing diarization document-translator dubbing speech-to-text stt subtitle-to-speech text-to-speech translate-audio translate-video translation tts video-dubbing
Last synced: 12 Oct 2025
https://github.com/transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx
Last synced: 07 Apr 2025
https://github.com/microsoft/unispeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
diarization pytorch speaker-verification speech speech-diarization speech-processing speech-recognition speech-separation
Last synced: 04 Apr 2025
https://github.com/revdotcom/reverb
Open source inference code for Rev's model
asr asr-model canary deeplearning diarization docker huggingface neural-network open-source opensource pyannote rev revai speaker-diarization speech-recognition speech-to-text speechrecognition wenet whisper
Last synced: 15 May 2025
https://github.com/homelab-00/transcriptionsuite
A fully local and private Speech-To-Text app with cross-platform support, speaker diarization, Audio Notebook mode, LM Studio integration, and both longform and live transcription.
diarization dictation docker faster-whisper linux local macos mlx nemo notebook open-source parakeet realtime speech-to-text tailscale transcription vibevoice whisper whisperx windows
Last synced: 27 Apr 2026
https://github.com/thewh1teagle/sherpa-rs
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
audio diarization embeddings rust sherpa speech-recognition
Last synced: 08 Apr 2025
https://github.com/suyashmore/mevonai-speech-emotion-recognition
Identify the emotion of multiple speakers in an Audio Segment
artificial-intelligence colab-notebook convolutional-neural-networks deep-learning diarization emotion-analysis emotion-recognition keras-tensorflow machine-learning mfcc mfcc-analysis speech-processing uis-rnn
Last synced: 18 Oct 2025
https://github.com/narcotic-sh/senko
Very fast, accurate speaker diarization
audio-ai diarization fbank pyannote rapids silero-vad speaker-diarization zanshin
Last synced: 02 Oct 2025
https://github.com/desh2608/dover-lap
Python package for combining diarization system outputs.
diarization dover-lap ensemble-machine-learning
Last synced: 17 Mar 2025
https://github.com/bunyaminergen/callytics
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.
denoising diarization forced-alignment llama3 llm openai opensource sentiment-analysis speech-emotion-recognition speech-processing speech-recognition speech-to-text summary topic-modeling transcription voice-activity-detection voice-recognition
Last synced: 03 Apr 2025
https://github.com/wq2012/simpleder
A lightweight library to compute Diarization Error Rate (DER).
diarization machine-learning metrics speaker-diarization speech-processing speech-recognition
Last synced: 30 Aug 2025
https://github.com/jschmie/scraibe
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
diarization speech-to-text transcription
Last synced: 08 Sep 2025
https://github.com/thewh1teagle/pyannote-rs
pyannote audio diarization in rust
asr diarization onnxruntime rust speech-recognition whisper
Last synced: 23 Oct 2025
https://github.com/R3gm/SoniTranslate
Synchronized Translation for Videos
audio-processing diarization translate-audio translate-video translation
Last synced: 11 Apr 2025
https://github.com/picovoice/falcon
On-device speaker diarization powered by deep learning
deep-learning diarization on-device speaker-diarization speaker-recognition
Last synced: 31 Mar 2025
https://github.com/desh2608/spyder
Simple Python package for fast DER computation
Last synced: 02 Aug 2025
https://github.com/namastexlabs/murmurai
🎙️ Drop-in replacement for paid transcription APIs. Self-hosted, GPU-powered, speaker diarization. Free forever: uvx murmurai
ai api asr diarization stt text-to-speech transcription tts whisper-ai whisperx
Last synced: 29 Jan 2026
https://github.com/harmlessman/pafts
PAFTS : Library That Preprocessing Audio For TTS.
asr diarization separator speech-to-text stt tts whisper
Last synced: 30 Apr 2026
https://github.com/cadia-lvl/kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
ahc audio-files diarization icelandic kaldi mfccs plda speaker-diarization wav
Last synced: 11 Mar 2026
https://github.com/pulijon/sttcast
Transcription from mp3 files to html with or without embedded player
ansible artificial-intelligence automation aws-ec2 aws-s3 diarization g4dn gpu iac puppet python terraform transcription vagrant vosk-engine whisper whisperx
Last synced: 12 Apr 2025
https://github.com/thewh1teagle/loud.cpp
Whisper.cpp with diarization
cpp diarization onnxruntime openai sherpa-onnx transcription whisper
Last synced: 08 Oct 2025
https://github.com/r3dbars/transcripted
Turn meetings and dictation into clean notes. Transcripted keeps it local and turns spoken audio into .md files.
agent-context coreml diarization dictation local-ai local-first macos meeting-recorder privacy speaker-recognition speech-to-text swift transcription
Last synced: 15 May 2026
https://github.com/Gr122lyBr/voicetag
Speaker identification powered by pyannote and resemblyzer
audio-transcription deep-learning deepgram diarization groq machine-learning nlp pyannote python resemblyzer speaker-diarization speaker-identification speaker-recognition speech-processing speech-to-text transcription voice-recognition whisper whisper-ai
Last synced: 03 Apr 2026
https://github.com/elmiraghorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl
Last synced: 10 Oct 2025
https://github.com/bunyaminergen/wavlmmsdd
This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.
diarization embedding microsoft nvidia-nemo speaker-diarization speech speech-embedding wavlm
Last synced: 15 Aug 2025
https://github.com/jeanjerome/echoinstone
EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.
alignment diarization localhost pyannote python transcribe whisper
Last synced: 10 Apr 2025
https://github.com/amatofrancesco99/speech_to_dialogue
This streamlit web-app has been developed in order to obtain starting from a recorded audio-track the correspondent dialogue.
diarization python speech-to-text streamlit-application
Last synced: 16 Mar 2025
https://github.com/adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube
Last synced: 03 Feb 2026
https://github.com/schemalabz/opencouncil-tasks
Backend task processing service for the OpenCouncil platform. This service handles various audio, video, and content processing tasks through both a REST API and CLI interface.
ai-summarizer civic-tech diarization hacktoberfest transcription
Last synced: 25 Apr 2026
https://github.com/edoardopona/ara
Ara (think parrot :parrot: ) is a script / api to transcribe and diarise audio. It uses Whisper and Pyannote
audio-processing deep-learning diarization language transcription whisper
Last synced: 25 Apr 2026
https://github.com/bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
accessibility diarization speech-to-text storytelling-with-data transcription whisper
Last synced: 20 Jan 2026
https://github.com/eddiegulay/rtrimmer
Python package to trim RTTM diarization files and optionally audio files to a user-specified time range.
audio diarization pyannote rttm tanzania trimming
Last synced: 02 Apr 2026
https://github.com/theseraphim/scribe-forge-ai
🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.
audio-analysis audio-cleaning audio-processing audio-transcription diarization ffmpeg huggingface machine-learning multi-speaker nlp offline-transcription openai-whisper pyannote python speaker-diarization speech-recognition speech-to-text timestamps transcription-tool whisper
Last synced: 05 May 2026
https://github.com/fgonzalesc/transcripcion_ai
TranscripciĂłn de audios con Azure Speech y extracciĂłn de insights con Open AI
ai azure dataprocessing diarization openai-api python speechtotext
Last synced: 18 May 2026
https://github.com/nicknaskida/insanely-fast-whisper
Incredibly fast Whisper-large-v3 with speaker diarization
diarization speaker-diarization transfromers whisper whisper-ai whisper-faster whisper-large
Last synced: 29 Sep 2025
https://github.com/mtwn105/audio-intel
AudioIntel - Audio/Video Intelligence, Transcripts, Summary, and much more
ai assemblyai audio audio-processing diarization lemur sonet speaker-diarization speaker-recognition speech-recognition speech-to-text transcript
Last synced: 04 Apr 2025
https://github.com/smwlms/transcriberapp
Local app for private transcription & analysis of audio with Whisper, Pyannote & Ollama.
audio diarization flask local-llm ollama pyannote svelte transcription whisper
Last synced: 16 Apr 2026
https://github.com/connortbot/podcast-diarizer
pipeline for speaker diarization using various clustering methods
Last synced: 20 Apr 2026
https://github.com/donbraulio/speechembeddings
Research on speech processing, speaker identification and audio diarization
diarization speaker-identification speech-processing speechbrain
Last synced: 21 Apr 2026
https://github.com/shahadathhs/voice-to-text
A standalone, fully local voice-to-text system using OpenAI's Whisper (open-source). All models run on your machine—no API keys, no cloud calls; audio never leaves your device. This version supports Dual Output (Original + Translation) and Local-First Speaker Diarization (no tokens required).
diarization python translation
Last synced: 26 Apr 2026
https://github.com/redflag-bugs/trannote
trannote is a baby project for getting transcription and diarization of speaker.
diarization transcription whisper
Last synced: 12 Mar 2025
https://github.com/cadia-lvl/diar-az
Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back
corpus-processing diarization parsing rttm
Last synced: 11 Mar 2026
https://github.com/ekhodzitsky/polyvoice
Speaker diarization for Rust — who spoke when, without Python. Silero VAD + WeSpeaker + AHC in a single Pipeline::run() call.
audio diarization machine-learning onnx python-bindings rust speaker-diarization speech vad voice
Last synced: 16 May 2026
https://github.com/aathifzahir/whisprsplit
A powerful, local speech-to-text transcription system that combines OpenAI's Whisper for accurate transcription with pyannote.audio for speaker diarization (identifying who spoke when). Perfect for meetings, interviews, podcasts, and any audio/video content that needs accurate transcription with speaker identification.
diarization speaker-recognition speech speech-diarization speech-recognition speech-to-text transcribe transcript transcription
Last synced: 19 Aug 2025
https://github.com/flaviodelgrosso/whisper-transcriber
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
ai audio-to-text diarization openai torch whisper
Last synced: 28 Jan 2026
https://github.com/shivxmr/speech-diarization
Speech Diarization
diarization python speech-recognition whisper
Last synced: 18 Apr 2026
https://github.com/nicknaskida/cog-whisper-diarization
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx
Last synced: 01 Oct 2025
https://github.com/lfenzo/poc-meeting-summarization
Proof of concept implementing multi-speaker recording transcription summarization
diarization llms summarization transcription
Last synced: 24 Jul 2025
https://github.com/mssoftjp/ai-transcriber-cli
CLI for transcribing audio and video files via the OpenAI speech-to-text API
4o-transcribe audio-transcription cli cli-tool diarization speech-to-text stt transcription video-transcription whisper
Last synced: 14 Apr 2026