Projects in Awesome Lists tagged with diarization

https://github.com/purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx

Last synced: 14 May 2025

https://github.com/Purfview/whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx

Last synced: 28 Mar 2025

https://github.com/transcriptionstream/transcriptionstream

turnkey self-hosted offline transcription and diarization service with llm summary

automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx

Last synced: 07 Apr 2025

https://github.com/microsoft/unispeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

diarization pytorch speaker-verification speech speech-diarization speech-processing speech-recognition speech-separation

Last synced: 04 Apr 2025

https://github.com/revdotcom/reverb

Open source inference code for Rev's model

asr asr-model canary deeplearning diarization docker huggingface neural-network open-source opensource pyannote rev revai speaker-diarization speech-recognition speech-to-text speechrecognition wenet whisper

Last synced: 15 May 2025

https://github.com/thewh1teagle/sherpa-rs

Rust bindings to https://github.com/k2-fsa/sherpa-onnx

audio diarization embeddings rust sherpa speech-recognition

Last synced: 08 Apr 2025

https://github.com/suyashmore/mevonai-speech-emotion-recognition

Identify the emotion of multiple speakers in an Audio Segment

artificial-intelligence colab-notebook convolutional-neural-networks deep-learning diarization emotion-analysis emotion-recognition keras-tensorflow machine-learning mfcc mfcc-analysis speech-processing uis-rnn

Last synced: 29 Apr 2025

https://github.com/desh2608/dover-lap

Python package for combining diarization system outputs.

diarization dover-lap ensemble-machine-learning

Last synced: 17 Mar 2025

https://github.com/bunyaminergen/callytics

Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.

denoising diarization forced-alignment llama3 llm openai opensource sentiment-analysis speech-emotion-recognition speech-processing speech-recognition speech-to-text summary topic-modeling transcription voice-activity-detection voice-recognition

Last synced: 03 Apr 2025

https://github.com/wq2012/simpleder

A lightweight library to compute Diarization Error Rate (DER).

diarization machine-learning metrics speaker-diarization speech-processing speech-recognition

Last synced: 12 Apr 2025

https://github.com/thewh1teagle/pyannote-rs

pyannote audio diarization in rust

asr diarization onnxruntime rust speech-recognition whisper

Last synced: 08 Feb 2025

https://github.com/R3gm/SoniTranslate

Synchronized Translation for Videos

audio-processing diarization translate-audio translate-video translation

Last synced: 11 Apr 2025

https://github.com/jschmie/scraibe

Tool for automatic transcription and speaker diarization based on whisper and pyannote.

diarization speech-to-text transcription

Last synced: 06 Apr 2025

https://github.com/picovoice/falcon

On-device speaker diarization powered by deep learning

deep-learning diarization on-device speaker-diarization speaker-recognition

Last synced: 31 Mar 2025

https://github.com/desh2608/spyder

Simple Python package for fast DER computation

der diarization

Last synced: 17 Mar 2025

https://github.com/pulijon/sttcast

Transcription from mp3 files to html with or without embedded player

ansible artificial-intelligence automation aws-ec2 aws-s3 diarization g4dn gpu iac puppet python terraform transcription vagrant vosk-engine whisper whisperx

Last synced: 12 Apr 2025

https://github.com/thewh1teagle/loud.cpp

Whisper.cpp with diarization

cpp diarization onnxruntime openai sherpa-onnx transcription whisper

Last synced: 13 Jan 2025

https://github.com/elmiraghorbani/gpt-speaker-diarization

Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.

asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl

Last synced: 28 Jan 2025

https://github.com/jeanjerome/echoinstone

EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.

alignment diarization localhost pyannote python transcribe whisper

Last synced: 10 Apr 2025

https://github.com/adamelkholyy/whisper-yt

Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles

asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube

Last synced: 12 Apr 2025

https://github.com/amatofrancesco99/speech_to_dialogue

This streamlit web-app has been developed in order to obtain starting from a recorded audio-track the correspondent dialogue.

diarization python speech-to-text streamlit-application

Last synced: 16 Mar 2025

https://github.com/nicknaskida/insanely-fast-whisper

Incredibly fast Whisper-large-v3 with speaker diarization

diarization speaker-diarization transfromers whisper whisper-ai whisper-faster whisper-large

Last synced: 19 Jan 2025

https://github.com/fgonzalesc/transcripcion_ai

Transcripción de audios con Azure Speech y extracción de insights con Open AI

ai azure dataprocessing diarization openai-api python speechtotext

Last synced: 31 Mar 2025

https://github.com/bigyaa/transcription-system

This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.

accessibility diarization speech-to-text storytelling-with-data transcription whisper

Last synced: 06 Apr 2025

https://github.com/donbraulio/speechembeddings

Research on speech processing, speaker identification and audio diarization

diarization speaker-identification speech-processing speechbrain

Last synced: 13 Mar 2025

https://github.com/lfenzo/poc-meeting-summarization

Proof of concept implementing multi-speaker recording transcription summarization

diarization llms summarization transcription

Last synced: 11 Mar 2025

https://github.com/mtwn105/audio-intel

AudioIntel - Audio/Video Intelligence, Transcripts, Summary, and much more

ai assemblyai audio audio-processing diarization lemur sonet speaker-diarization speaker-recognition speech-recognition speech-to-text transcript

Last synced: 04 Apr 2025

https://github.com/redflag-bugs/trannote

trannote is a baby project for getting transcription and diarization of speaker.

diarization transcription whisper

Last synced: 12 Mar 2025

https://github.com/connortbot/podcast-diarizer

pipeline for speaker diarization using various clustering methods

clustering diarization

Last synced: 12 Mar 2025

https://github.com/flaviodelgrosso/whisper-transcriber

Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text

ai audio-to-text diarization openai torch whisper

Last synced: 06 Apr 2025

https://github.com/bunyaminergen/wavlmmsdd

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

diarization embedding microsoft nvidia-nemo speaker-diarization speech speech-embedding wavlm

Last synced: 15 Feb 2025

https://github.com/nicknaskida/cog-whisper-diarization

Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote

diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx

Last synced: 20 Jan 2025

https://github.com/smwlms/transcriberapp

Local app for private transcription & analysis of audio with Whisper, Pyannote & Ollama.

audio diarization flask local-llm ollama pyannote svelte transcription whisper

Last synced: 14 Apr 2025