Projects in Awesome Lists tagged with diarization
A curated list of projects in awesome lists tagged with diarization .
https://github.com/purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx
Last synced: 14 May 2025
https://github.com/Purfview/whisper-standalone-win
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
asr ctranslate2 diarization faster-whisper openai speaker-diarization speech-recognition speech-to-text subtitles transcriber uvr vocal-extractor whisper whisper-faster whisperx
Last synced: 28 Mar 2025
https://github.com/transcriptionstream/transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
automation diarization llm mistral-7b ollama speaker-diarization speech-recognition transcription whisper whisperx
Last synced: 07 Apr 2025
https://github.com/microsoft/unispeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
diarization pytorch speaker-verification speech speech-diarization speech-processing speech-recognition speech-separation
Last synced: 04 Apr 2025
https://github.com/revdotcom/reverb
Open source inference code for Rev's model
asr asr-model canary deeplearning diarization docker huggingface neural-network open-source opensource pyannote rev revai speaker-diarization speech-recognition speech-to-text speechrecognition wenet whisper
Last synced: 15 May 2025
https://github.com/thewh1teagle/sherpa-rs
Rust bindings to https://github.com/k2-fsa/sherpa-onnx
audio diarization embeddings rust sherpa speech-recognition
Last synced: 08 Apr 2025
https://github.com/suyashmore/mevonai-speech-emotion-recognition
Identify the emotion of multiple speakers in an Audio Segment
artificial-intelligence colab-notebook convolutional-neural-networks deep-learning diarization emotion-analysis emotion-recognition keras-tensorflow machine-learning mfcc mfcc-analysis speech-processing uis-rnn
Last synced: 29 Apr 2025
https://github.com/desh2608/dover-lap
Python package for combining diarization system outputs.
diarization dover-lap ensemble-machine-learning
Last synced: 17 Mar 2025
https://github.com/bunyaminergen/callytics
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.
denoising diarization forced-alignment llama3 llm openai opensource sentiment-analysis speech-emotion-recognition speech-processing speech-recognition speech-to-text summary topic-modeling transcription voice-activity-detection voice-recognition
Last synced: 03 Apr 2025
https://github.com/wq2012/simpleder
A lightweight library to compute Diarization Error Rate (DER).
diarization machine-learning metrics speaker-diarization speech-processing speech-recognition
Last synced: 12 Apr 2025
https://github.com/thewh1teagle/pyannote-rs
pyannote audio diarization in rust
asr diarization onnxruntime rust speech-recognition whisper
Last synced: 08 Feb 2025
https://github.com/R3gm/SoniTranslate
Synchronized Translation for Videos
audio-processing diarization translate-audio translate-video translation
Last synced: 11 Apr 2025
https://github.com/jschmie/scraibe
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
diarization speech-to-text transcription
Last synced: 06 Apr 2025
https://github.com/picovoice/falcon
On-device speaker diarization powered by deep learning
deep-learning diarization on-device speaker-diarization speaker-recognition
Last synced: 31 Mar 2025
https://github.com/desh2608/spyder
Simple Python package for fast DER computation
Last synced: 17 Mar 2025
https://github.com/pulijon/sttcast
Transcription from mp3 files to html with or without embedded player
ansible artificial-intelligence automation aws-ec2 aws-s3 diarization g4dn gpu iac puppet python terraform transcription vagrant vosk-engine whisper whisperx
Last synced: 12 Apr 2025
https://github.com/thewh1teagle/loud.cpp
Whisper.cpp with diarization
cpp diarization onnxruntime openai sherpa-onnx transcription whisper
Last synced: 13 Jan 2025
https://github.com/elmiraghorbani/gpt-speaker-diarization
Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.
asr diarization gpt-4 openai speaker-diarization speech-recognition speech-to-text voice-activity-detection whisper youtube-dl
Last synced: 28 Jan 2025
https://github.com/jeanjerome/echoinstone
EchoInStone is an audio processing tool that transcribes, diarizes, and aligns speaker segments from audio files, prioritizing accuracy and reliability.
alignment diarization localhost pyannote python transcribe whisper
Last synced: 10 Apr 2025
https://github.com/adamelkholyy/whisper-yt
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
asr diarization huggingface-datasets pyannote transcription whisper word-error-rate youtube
Last synced: 12 Apr 2025
https://github.com/amatofrancesco99/speech_to_dialogue
This streamlit web-app has been developed in order to obtain starting from a recorded audio-track the correspondent dialogue.
diarization python speech-to-text streamlit-application
Last synced: 16 Mar 2025
https://github.com/nicknaskida/insanely-fast-whisper
Incredibly fast Whisper-large-v3 with speaker diarization
diarization speaker-diarization transfromers whisper whisper-ai whisper-faster whisper-large
Last synced: 19 Jan 2025
https://github.com/fgonzalesc/transcripcion_ai
Transcripción de audios con Azure Speech y extracción de insights con Open AI
ai azure dataprocessing diarization openai-api python speechtotext
Last synced: 31 Mar 2025
https://github.com/bigyaa/transcription-system
This versatile tool is designed for anyone in need of a robust solution for transcribing and diarizing large volumes of audio files. Whether you are dealing with terabytes or even larger quantities, our tool ensures efficient and accurate processing. Ideal for researchers, content creators, and businesses.
accessibility diarization speech-to-text storytelling-with-data transcription whisper
Last synced: 06 Apr 2025
https://github.com/donbraulio/speechembeddings
Research on speech processing, speaker identification and audio diarization
diarization speaker-identification speech-processing speechbrain
Last synced: 13 Mar 2025
https://github.com/lfenzo/poc-meeting-summarization
Proof of concept implementing multi-speaker recording transcription summarization
diarization llms summarization transcription
Last synced: 11 Mar 2025
https://github.com/mtwn105/audio-intel
AudioIntel - Audio/Video Intelligence, Transcripts, Summary, and much more
ai assemblyai audio audio-processing diarization lemur sonet speaker-diarization speaker-recognition speech-recognition speech-to-text transcript
Last synced: 04 Apr 2025
https://github.com/redflag-bugs/trannote
trannote is a baby project for getting transcription and diarization of speaker.
diarization transcription whisper
Last synced: 12 Mar 2025
https://github.com/connortbot/podcast-diarizer
pipeline for speaker diarization using various clustering methods
Last synced: 12 Mar 2025
https://github.com/flaviodelgrosso/whisper-transcriber
Use OpenAI's Whisper to transcribe audio files and diariaze speakers of the transcribed text
ai audio-to-text diarization openai torch whisper
Last synced: 06 Apr 2025
https://github.com/bunyaminergen/wavlmmsdd
This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.
diarization embedding microsoft nvidia-nemo speaker-diarization speech speech-embedding wavlm
Last synced: 15 Feb 2025
https://github.com/nicknaskida/cog-whisper-diarization
Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote
diarization openai-whisper pyannote replicate speaker-diarization whisper whisper-faster whisperx
Last synced: 20 Jan 2025
https://github.com/smwlms/transcriberapp
Local app for private transcription & analysis of audio with Whisper, Pyannote & Ollama.
audio diarization flask local-llm ollama pyannote svelte transcription whisper
Last synced: 14 Apr 2025