An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with diarization

A curated list of projects in awesome lists tagged with diarization .

https://github.com/transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx

Last synced: 07 Apr 2025

https://github.com/homelab-00/transcriptionsuite

A fully local and private Speech-To-Text app with cross-platform support, speaker diarization, Audio Notebook mode, LM Studio integration, and both longform and live transcription.

diarization dictation docker faster-whisper linux local macos mlx nemo notebook open-source parakeet realtime speech-to-text tailscale transcription vibevoice whisper whisperx windows

Last synced: 27 Apr 2026

https://github.com/thewh1teagle/sherpa-rs

Rust bindings to https://github.com/k2-fsa/sherpa-onnx

audio diarization embeddings rust sherpa speech-recognition

Last synced: 08 Apr 2025

https://github.com/narcotic-sh/senko

Very fast, accurate speaker diarization

audio-ai diarization fbank pyannote rapids silero-vad speaker-diarization zanshin

Last synced: 02 Oct 2025

https://github.com/desh2608/dover-lap

Python package for combining diarization system outputs.

diarization dover-lap ensemble-machine-learning

Last synced: 17 Mar 2025

https://github.com/bunyaminergen/callytics

Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.

denoising diarization forced-alignment llama3 llm openai opensource sentiment-analysis speech-emotion-recognition speech-processing speech-recognition speech-to-text summary topic-modeling transcription voice-activity-detection voice-recognition

Last synced: 03 Apr 2025

https://github.com/wq2012/simpleder

A lightweight library to compute Diarization Error Rate (DER).

diarization machine-learning metrics speaker-diarization speech-processing speech-recognition

Last synced: 30 Aug 2025

https://github.com/jschmie/scraibe

Tool for automatic transcription and speaker diarization based on whisper and pyannote.

diarization speech-to-text transcription

Last synced: 08 Sep 2025

https://github.com/thewh1teagle/pyannote-rs

pyannote audio diarization in rust

asr diarization onnxruntime rust speech-recognition whisper

Last synced: 23 Oct 2025

https://github.com/picovoice/falcon

On-device speaker diarization powered by deep learning

deep-learning diarization on-device speaker-diarization speaker-recognition

Last synced: 31 Mar 2025

https://github.com/desh2608/spyder

Simple Python package for fast DER computation

der diarization

Last synced: 02 Aug 2025

https://github.com/namastexlabs/murmurai

🎙️ Drop-in replacement for paid transcription APIs. Self-hosted, GPU-powered, speaker diarization. Free forever: uvx murmurai

ai api asr diarization stt text-to-speech transcription tts whisper-ai whisperx

Last synced: 29 Jan 2026

https://github.com/harmlessman/pafts

PAFTS : Library That Preprocessing Audio For TTS.

asr diarization separator speech-to-text stt tts whisper

Last synced: 30 Apr 2026

https://github.com/cadia-lvl/kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.

ahc audio-files diarization icelandic kaldi mfccs plda speaker-diarization wav

Last synced: 11 Mar 2026

https://github.com/r3dbars/transcripted

Turn meetings and dictation into clean notes. Transcripted keeps it local and turns spoken audio into .md files.

agent-context coreml diarization dictation local-ai local-first macos meeting-recorder privacy speaker-recognition speech-to-text swift transcription

Last synced: 15 May 2026

https://github.com/elmiraghorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl

Last synced: 10 Oct 2025

https://github.com/bunyaminergen/wavlmmsdd

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

diarization embedding microsoft nvidia-nemo speaker-diarization speech speech-embedding wavlm

Last synced: 15 Aug 2025

https://github.com/jeanjerome/echoinstone

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

alignment diarization localhost pyannote python transcribe whisper

Last synced: 10 Apr 2025

https://github.com/amatofrancesco99/speech_to_dialogue

This streamlit web-app has been developed in order to obtain starting from a recorded audio-track the correspondent dialogue.

diarization python speech-to-text streamlit-application

Last synced: 16 Mar 2025

https://github.com/adamelkholyy/whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles

asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube

Last synced: 03 Feb 2026

https://github.com/schemalabz/opencouncil-tasks

Backend task processing service for the OpenCouncil platform. This service handles various audio, video, and content processing tasks through both a REST API and CLI interface.

ai-summarizer civic-tech diarization hacktoberfest transcription

Last synced: 25 Apr 2026

https://github.com/edoardopona/ara

Ara (think parrot :parrot: ) is a script / api to transcribe and diarise audio. It uses Whisper and Pyannote

audio-processing deep-learning diarization language transcription whisper

Last synced: 25 Apr 2026

https://github.com/bigyaa/transcription-system

This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.

accessibility diarization speech-to-text storytelling-with-data transcription whisper

Last synced: 20 Jan 2026

https://github.com/eddiegulay/rtrimmer

Python package to trim RTTM diarization files and optionally audio files to a user-specified time range.

audio diarization pyannote rttm tanzania trimming

Last synced: 02 Apr 2026

https://github.com/theseraphim/scribe-forge-ai

🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.

audio-analysis audio-cleaning audio-processing audio-transcription diarization ffmpeg huggingface machine-learning multi-speaker nlp offline-transcription openai-whisper pyannote python speaker-diarization speech-recognition speech-to-text timestamps transcription-tool whisper

Last synced: 05 May 2026

https://github.com/fgonzalesc/transcripcion_ai

TranscripciĂłn de audios con Azure Speech y extracciĂłn de insights con Open AI

ai azure dataprocessing diarization openai-api python speechtotext

Last synced: 18 May 2026

https://github.com/smwlms/transcriberapp

Local app for private transcription & analysis of audio with Whisper, Pyannote & Ollama.

audio diarization flask local-llm ollama pyannote svelte transcription whisper

Last synced: 16 Apr 2026

https://github.com/connortbot/podcast-diarizer

pipeline for speaker diarization using various clustering methods

clustering diarization

Last synced: 20 Apr 2026

https://github.com/donbraulio/speechembeddings

Research on speech processing, speaker identification and audio diarization

diarization speaker-identification speech-processing speechbrain

Last synced: 21 Apr 2026

https://github.com/shahadathhs/voice-to-text

A standalone, fully local voice-to-text system using OpenAI's Whisper (open-source). All models run on your machine—no API keys, no cloud calls; audio never leaves your device. This version supports Dual Output (Original + Translation) and Local-First Speaker Diarization (no tokens required).

diarization python translation

Last synced: 26 Apr 2026

https://github.com/redflag-bugs/trannote

trannote is a baby project for getting transcription and diarization of speaker.

diarization transcription whisper

Last synced: 12 Mar 2025

https://github.com/cadia-lvl/diar-az

Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back

corpus-processing diarization parsing rttm

Last synced: 11 Mar 2026

https://github.com/ekhodzitsky/polyvoice

Speaker diarization for Rust — who spoke when, without Python. Silero VAD + WeSpeaker + AHC in a single Pipeline::run() call.

audio diarization machine-learning onnx python-bindings rust speaker-diarization speech vad voice

Last synced: 16 May 2026

https://github.com/aathifzahir/whisprsplit

A powerful, local speech-to-text transcription system that combines OpenAI's Whisper for accurate transcription with pyannote.audio for speaker diarization (identifying who spoke when). Perfect for meetings, interviews, podcasts, and any audio/video content that needs accurate transcription with speaker identification.

diarization speaker-recognition speech speech-diarization speech-recognition speech-to-text transcribe transcript transcription

Last synced: 19 Aug 2025

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 28 Jan 2026

https://github.com/nicknaskida/cog-whisper-diarization

Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx

Last synced: 01 Oct 2025

https://github.com/lfenzo/poc-meeting-summarization

Proof of concept implementing multi-speaker recording transcription summarization

diarization llms summarization transcription

Last synced: 24 Jul 2025

https://github.com/mssoftjp/ai-transcriber-cli

CLI for transcribing audio and video files via the OpenAI speech-to-text API

4o-transcribe audio-transcription cli cli-tool diarization speech-to-text stt transcription video-transcription whisper

Last synced: 14 Apr 2026