https://github.com/bluebirdback/arena_speaker_diarization

This repository reviews different AI models for "Speaker Diarization" to help users find the best fit for their needs.
https://github.com/bluebirdback/arena_speaker_diarization

Last synced: 8 months ago
JSON representation

This repository reviews different AI models for "Speaker Diarization" to help users find the best fit for their needs.

Host: GitHub
URL: https://github.com/bluebirdback/arena_speaker_diarization
Owner: BlueBirdBack
License: mit
Created: 2024-04-30T09:35:57.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-05-01T12:55:09.000Z (about 2 years ago)
Last Synced: 2025-06-05T04:43:38.170Z (12 months ago)
Language: Python
Size: 130 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# ⚔ Arena Speaker Diarization 🗣️

This repository compares the performance of various AI models in solving the "Speaker Diarization" task. Speaker Diarization is the process of identifying different speakers in an audio recording and labeling which parts of the audio were spoken by each person.

For example, if you have a recording of a meeting with multiple participants, Speaker Diarization can automatically separate the audio into segments and label each segment with the corresponding speaker's name or ID. This makes transcripts and analysis of multi-speaker recordings much easier to follow.

The goal is to help users choose the best available Speaker Diarization model for their specific use case and requirements.

## AssemblyAI

AssemblyAI is a cloud-based API that allows you to transcribe audio files. It's a good option for small to medium-sized files.

### Getting Started

https://www.assemblyai.com/app/

- [x] [Transcribe your first audio file](./assemblyai/transcribe_audio.py)
- [x] [Identify speakers in your audio](./assemblyai/identify_speakers.py)
- [x] [Identifying highlights in your audio](./assemblyai/identify_highlights.py)
- [x] [Summarize your audio file with an LLM](./assemblyai/summarize_audio.py)
- [x] [Transcribe live audio streams](./assemblyai/transcribe_live.py)

#### Known Issues

🚨 AssemblyAI's live transcription feature seems glitchy.

I said: "The key differences and improvements between CUDA 11.8 and CUDA 12.1 primarily revolve around compatibility, performance enhancements, and support for new GPU architectures. Here's a detailed comparison based on the provided sources"

But it repeated and jumbled my words.

![](./assets/240501_Cursor.png)

For further information or to share your thoughts, please visit https://twitter.com/BlueBirdBack/status/1785587598293094833.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bluebirdback/arena_speaker_diarization

Awesome Lists containing this project

README