Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with speaker-recognition
A curated list of projects in awesome lists tagged with speaker-recognition .
https://github.com/nvidia/nemo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts
Last synced: 29 Sep 2024
https://github.com/NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts
Last synced: 30 Jul 2024
https://github.com/speechbrain/speechbrain
A PyTorch-based Speech Toolkit
asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition
Last synced: 29 Sep 2024
https://github.com/pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
overlapped-speech-detection pretrained-models pytorch speaker-change-detection speaker-diarization speaker-embedding speaker-recognition speaker-verification speech-activity-detection speech-processing voice-activity-detection
Last synced: 29 Sep 2024
https://github.com/google/uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
clustering machine-learning speaker-diarization speaker-recognition supervised-clustering supervised-learning uis-rnn
Last synced: 30 Sep 2024
https://github.com/mravanelli/sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform
Last synced: 30 Sep 2024
https://github.com/mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform
Last synced: 02 Aug 2024
https://github.com/clovaai/voxceleb_trainer
In defence of metric learning for speaker recognition
metric-learning speaker-recognition speaker-verification voxceleb
Last synced: 30 Sep 2024
https://github.com/athena-team/athena
an open-source implementation of sequence-to-sequence based speech processing engine
asr ctc deployment sequence-to-sequence speaker-recognition speech-recognition speech-synthesis tensorflow transformer tts unsupervised-learning wfst
Last synced: 08 Aug 2024
https://github.com/astorfi/3d-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
3d convolutional-neural-networks deep-learning speaker-recognition
Last synced: 03 Oct 2024
https://github.com/astorfi/3D-convolutional-speaker-recognition
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
3d convolutional-neural-networks deep-learning speaker-recognition
Last synced: 31 Jul 2024
https://github.com/yeyupiaoling/voiceprintrecognition-pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
arcface ecapa-tdnn pytorch speaker-recognition voice-recognition
Last synced: 03 Oct 2024
https://github.com/wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
asv campplus cnceleb dino ecapa-tdnn eres2net nist-sre plda production-ready pytorch repvgg resnet self-supervised-learning speaker-diarization speaker-recognition speaker-verification ssl tdnn voxceleb xvector
Last synced: 08 Aug 2024
https://github.com/cvqluu/Angular-Penalty-Softmax-Losses-Pytorch
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
am-softmax arcface embedding face-recognition face-verification fashion-mnist fmnist-dataset loss-function loss-functions metric-learning normface pytorch speaker-recognition sphereface
Last synced: 01 Aug 2024
https://github.com/speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit
Last synced: 02 Aug 2024
https://github.com/manojpamk/pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
speaker-diarization speaker-embeddings speaker-recognition speaker-verification
Last synced: 02 Aug 2024
https://github.com/yeyupiaoling/voiceprintrecognition-tensorflow
使用Tensorflow实现声纹识别
arcface speaker-recognition tensorflow voice-recognition
Last synced: 03 Oct 2024
https://github.com/yeyupiaoling/voiceprintrecognition-paddlepaddle
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
arcface ecapa-tdnn paddlepaddle speaker-recognition voice-recognition
Last synced: 03 Oct 2024
https://github.com/atul-anand-jha/speaker-identification-python
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
python-2 speaker-identification speaker-recognition
Last synced: 01 Oct 2024
https://github.com/IBM-Cloud/chatbot-watson-android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
android android-studio chatbot conversation conversation-service dialog entity ibm-cloud ibm-cloud-solutions ibm-watson ibm-watson-services intent java speaker-diarization speaker-recognition speech watson watson-services workspace
Last synced: 04 Aug 2024
https://github.com/jefflai108/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
kaldi learnable-dictionary-encoding pytorch speaker-identification speaker-recognition speaker-verification speech-processing
Last synced: 07 Aug 2024
https://github.com/Anwarvic/Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
gmm gmm-ubm i-vector identity-vector identity-verification sidekit speaker-identification speaker-recognition speaker-verification ubm
Last synced: 07 Aug 2024
https://github.com/seongmin-kye/meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
meta-learning short-utterances speaker-recognition speaker-verification
Last synced: 03 Aug 2024
https://github.com/cyrta/voxceleb
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
corpus dataset speaker speaker-identification speaker-recognition speaker-verification speech
Last synced: 03 Aug 2024
https://github.com/andi611/Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
apc feature-extraction mockingjay phone-classification phoneme-prediction pytorch pytorch-implementation representation-learning sentiment-classification speaker-classification speaker-recognition speech speech-representation
Last synced: 07 Aug 2024
https://github.com/maxhollmann/voxceleb-luigi
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
luigi speaker-embedding speaker-recognition speaker-verification voxceleb
Last synced: 05 Aug 2024
https://github.com/mycrazycracy/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
kaldi kaldi-asr machine-learning neural-network speaker-identification speaker-recognition speaker-verification speech-processing tensorflow
Last synced: 02 Aug 2024
https://github.com/limdongjin/ignkafasr
Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)
apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper
Last synced: 29 Jul 2024