Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-speaker-embedding
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
https://github.com/ranchlai/awesome-speaker-embedding
Last synced: 5 days ago
JSON representation
-
Challenges
- Short-duration Speaker Verification (SdSV) Challenge 2020
- VoxCeleb Speaker Recognition Challenge (VoxSRC 2019)
- VoxCeleb Speaker Recognition Challenge (VoxSRC 2020)
- VoxCeleb Speaker Recognition Challenge (VoxSRC 2021)
- Short-duration Speaker Verification (SdSV) Challenge 2021
- CTS Speaker Recognition Challenge 2020
- Far-Field Speaker Verification Challenge (FFSVC 2020)
-
Code/Tools/Frameworks/Libraries
-
- VGGVox
- SincNet
- 3D CNN
- GE2E
- asv-subtools
- Resemblyzer - level representation of a voice through a deep learning model (referred to as the voice encoder).
- Res2Net
- voxceleb_trainer
- pytorch_xvectors - vectors.
- Speechbrain
- DeepSpeaker - to-End Neural Speaker Embedding System.
- voxceleb - visual dataset consisting of short clips of human speech, extracted from interview videos uploaded to YouTube
- Triplet-loss
- kaldi
-
Wining solutions of Challenges
-
More-recent papers
- Attention Back-end - end, model: TDNN, Resnet, data: cn-celeb
-
-
Must-read papers
- 02\
- 03\
- 04\
- 05\
- 06\
- 07\ - based/time-delay/multi-class, softmax + cross-entropy loss
- 08\ - vector</b> paper Johns Hopkins, based on TDNN, improved by adding Noise and reverberation for augmentation
- 09\ - vector</b>' paper from Johns Hopkins
- 10\
- 11\ - vector</b>' paper from Johns Hopkins
- 12\ - norm paper, useful for score normalization
- 01\
-
Benchmarks (not very accurate)
-
Must-read technical reports
-
Datasets
- TIMIT - free
- NIST SRE - free
- AIShell-1
- AIShell-2 - free for commercial
- AIShell-3
- AIShell-4
- HI-MIA - field text-dependent speaker verification and keyword spotting
- SITW
- Voxceleb 1&2
- Cn-Celeb 1&2 - genres speaker dataset in the wild, utterances are from chinese celebrities.
-
Great Talks / Tutorials
Categories
Sub Categories
Keywords
speaker-verification
4
speaker-recognition
4
pytorch
3
convolutional-neural-networks
2
deep-learning
2
speaker-identification
2
artificial-intelligence
1
asr
1
audio
1
audio-processing
1
cnn
1
digital-signal-processing
1
filtering
1
neural-networks
1
python
1
signal-processing
1
speech-processing
1
speech-recognition
1
timit
1
waveform
1
3d
1
backbone
1
jittor
1
multi-scale
1
res2net
1
metric-learning
1
voxceleb
1
speaker-diarization
1
speaker-embeddings
1