An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with diarization

A curated list of projects in awesome lists tagged with diarization .

https://github.com/transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx

Last synced: 07 Apr 2025

https://github.com/thewh1teagle/sherpa-rs

Rust bindings to https://github.com/k2-fsa/sherpa-onnx

audio diarization embeddings rust sherpa speech-recognition

Last synced: 08 Apr 2025

https://github.com/desh2608/dover-lap

Python package for combining diarization system outputs.

diarization dover-lap ensemble-machine-learning

Last synced: 17 Mar 2025

https://github.com/bunyaminergen/callytics

Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.

denoising diarization forced-alignment llama3 llm openai opensource sentiment-analysis speech-emotion-recognition speech-processing speech-recognition speech-to-text summary topic-modeling transcription voice-activity-detection voice-recognition

Last synced: 03 Apr 2025

https://github.com/wq2012/simpleder

A lightweight library to compute Diarization Error Rate (DER).

diarization machine-learning metrics speaker-diarization speech-processing speech-recognition

Last synced: 12 Apr 2025

https://github.com/thewh1teagle/pyannote-rs

pyannote audio diarization in rust

asr diarization onnxruntime rust speech-recognition whisper

Last synced: 08 Feb 2025

https://github.com/jschmie/scraibe

Tool for automatic transcription and speaker diarization based on whisper and pyannote.

diarization speech-to-text transcription

Last synced: 06 Apr 2025

https://github.com/picovoice/falcon

On-device speaker diarization powered by deep learning

deep-learning diarization on-device speaker-diarization speaker-recognition

Last synced: 31 Mar 2025

https://github.com/desh2608/spyder

Simple Python package for fast DER computation

der diarization

Last synced: 17 Mar 2025

https://github.com/elmiraghorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl

Last synced: 28 Jan 2025

https://github.com/jeanjerome/echoinstone

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

alignment diarization localhost pyannote python transcribe whisper

Last synced: 10 Apr 2025

https://github.com/adamelkholyy/whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles

asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube

Last synced: 12 Apr 2025

https://github.com/amatofrancesco99/speech_to_dialogue

This streamlit web-app has been developed in order to obtain starting from a recorded audio-track the correspondent dialogue.

diarization python speech-to-text streamlit-application

Last synced: 16 Mar 2025

https://github.com/fgonzalesc/transcripcion_ai

Transcripción de audios con Azure Speech y extracción de insights con Open AI

ai azure dataprocessing diarization openai-api python speechtotext

Last synced: 31 Mar 2025

https://github.com/bigyaa/transcription-system

This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.

accessibility diarization speech-to-text storytelling-with-data transcription whisper

Last synced: 06 Apr 2025

https://github.com/donbraulio/speechembeddings

Research on speech processing, speaker identification and audio diarization

diarization speaker-identification speech-processing speechbrain

Last synced: 13 Mar 2025

https://github.com/lfenzo/poc-meeting-summarization

Proof of concept implementing multi-speaker recording transcription summarization

diarization llms summarization transcription

Last synced: 11 Mar 2025

https://github.com/redflag-bugs/trannote

trannote is a baby project for getting transcription and diarization of speaker.

diarization transcription whisper

Last synced: 12 Mar 2025

https://github.com/connortbot/podcast-diarizer

pipeline for speaker diarization using various clustering methods

clustering diarization

Last synced: 12 Mar 2025

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 06 Apr 2025

https://github.com/bunyaminergen/wavlmmsdd

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

diarization embedding microsoft nvidia-nemo speaker-diarization speech speech-embedding wavlm

Last synced: 15 Feb 2025

https://github.com/nicknaskida/cog-whisper-diarization

Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx

Last synced: 20 Jan 2025

https://github.com/smwlms/transcriberapp

Local app for private transcription & analysis of audio with Whisper, Pyannote & Ollama.

audio diarization flask local-llm ollama pyannote svelte transcription whisper

Last synced: 14 Apr 2025