Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with speaker-recognition

A curated list of projects in awesome lists tagged with speaker-recognition .

https://github.com/nvidia/nemo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts

Last synced: 16 Dec 2024

https://github.com/NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts

Last synced: 25 Oct 2024

https://github.com/pyannote/pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

overlapped-speech-detection pretrained-models pytorch speaker-change-detection speaker-diarization speaker-embedding speaker-recognition speaker-verification speech-activity-detection speech-processing voice-activity-detection

Last synced: 17 Dec 2024

https://github.com/google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

clustering machine-learning speaker-diarization speaker-recognition supervised-clustering supervised-learning uis-rnn

Last synced: 18 Dec 2024

https://github.com/clovaai/voxceleb_trainer

In defence of metric learning for speaker recognition

metric-learning speaker-recognition speaker-verification voxceleb

Last synced: 21 Dec 2024

https://github.com/athena-team/athena

an open-source implementation of sequence-to-sequence based speech processing engine

asr ctc deployment sequence-to-sequence speaker-recognition speech-recognition speech-synthesis tensorflow transformer tts unsupervised-learning wfst

Last synced: 28 Nov 2024

https://github.com/yeyupiaoling/voiceprintrecognition-pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

arcface ecapa-tdnn pytorch speaker-recognition voice-recognition

Last synced: 20 Dec 2024

https://github.com/astorfi/3d-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

3d convolutional-neural-networks deep-learning speaker-recognition

Last synced: 20 Dec 2024

https://github.com/astorfi/3D-convolutional-speaker-recognition

:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

3d convolutional-neural-networks deep-learning speaker-recognition

Last synced: 28 Oct 2024

https://github.com/speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit

Last synced: 13 Nov 2024

https://github.com/manojpamk/pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

speaker-diarization speaker-embeddings speaker-recognition speaker-verification

Last synced: 11 Nov 2024

https://github.com/yeyupiaoling/voiceprintrecognition-paddlepaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

arcface ecapa-tdnn paddlepaddle speaker-recognition voice-recognition

Last synced: 18 Dec 2024

https://github.com/vita-group/autospeech

[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang

automl autospeech neural-architecture-search pytorch speaker-recognition

Last synced: 19 Dec 2024

https://github.com/atul-anand-jha/speaker-identification-python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

python-2 speaker-identification speaker-recognition

Last synced: 18 Dec 2024

https://github.com/lihanghang/casr-demo

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

baidu-aip casr-demo ctc flask-application gmm pyaudio speaker-recognition speech-to-text

Last synced: 19 Dec 2024

https://github.com/Anwarvic/Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

gmm gmm-ubm i-vector identity-vector identity-verification sidekit speaker-identification speaker-recognition speaker-verification ubm

Last synced: 27 Nov 2024

https://github.com/vidyasagarmsc/watbot

An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.

android android-studio assistant chatbot cognitive-services conversation conversation-service dialog entity ibm-cloud intent speaker-diarization speaker-labels speaker-recognition speech speech-to-text text-to-speech watson watson-assistant-service workspace

Last synced: 17 Nov 2024

https://github.com/seongmin-kye/meta-SR

Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)

meta-learning short-utterances speaker-recognition speaker-verification

Last synced: 17 Nov 2024

https://github.com/cyrta/voxceleb

mirror of VoxCeleb dataset - a large-scale speaker identification dataset

corpus dataset speaker speaker-identification speaker-recognition speaker-verification speech

Last synced: 17 Nov 2024

https://github.com/wadaboa/titanet

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

d-vectors ml4cv nvidia speaker-embeddings speaker-identification speaker-recognition speaker-verification titanet unibo

Last synced: 28 Oct 2024

https://github.com/maxhollmann/voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

luigi speaker-embedding speaker-recognition speaker-verification voxceleb

Last synced: 02 Dec 2024

https://github.com/wq2012/speakerrecognitionfromscratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

attention-mechanism deep-learning librispeech lstm neural-network pytorch speaker-recognition speaker-recognition-systems transformer transformer-models

Last synced: 07 Nov 2024

https://github.com/zycv/openspeaker

OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.

speaker-recognition speaker-verification voiceprint-recognition

Last synced: 30 Nov 2024

https://github.com/picovoice/falcon

On-device speaker diarization powered by deep learning

deep-learning diarization on-device speaker-diarization speaker-recognition

Last synced: 19 Dec 2024

https://github.com/zycv/speaker-recognition-based-on-deep-learning-an-overview

This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》

deep-learning speaker-recognition speaker-verification

Last synced: 30 Nov 2024

https://github.com/alifarazz/september

:microphone: An offline text-independent speaker recognition system

ansi-c cross-platform gtk3 openal speaker-recognition sptk

Last synced: 06 Nov 2024

https://github.com/picovoice/eagle

On-device speaker recognition engine powered by deep learning

speaker-embedding speaker-identification speaker-recognition

Last synced: 22 Nov 2024

https://github.com/zabir-nabil/audioperm

A python library for generating different permutations of audible segments from audio files.

audio-augmentation audio-classification audio-processing augmentation speaker-recognition speech-augmentation

Last synced: 02 Dec 2024

https://github.com/prmelehan/speaker-recognition

Recognizing a speaker using Deep Learning

cnn deep-learning keras python scipy speaker-recognition

Last synced: 09 Nov 2024

https://github.com/fabioravila/recognito-csharp

Text Independent Speaker Recognition in CSharp, based on recognito in Java

csharp offline recognito-csharp speaker-identification speaker-recognition

Last synced: 13 Nov 2024

https://github.com/nicolay-r/book-persona-retriever

Workflow for literature character personality profiling 📚 which is solely relies on book content 📖

books dialogs dialogue-systems human-level-attributes parlai personality-traits pipeline project-gutenberg ranking-system response-generation sampling speaker-recognition

Last synced: 19 Dec 2024

https://github.com/smorodov/kaldi_vosk_win_cmake

cmake based kaldi + vosk + microphone speech recognition example

kaldi speaker-recognition speech-recognition speech-to-text voice-recognition vosk

Last synced: 06 Nov 2024

https://github.com/limdongjin/ignkafasr

Real-Time In-memory Speaker Verification and Speech Recognition Project using apache ignite, apache kafka, speechbrain, whisper, stomp, spring webflux, kubernetes(k8s)

apache-ignite apache-kafka asr audio-recorder google-kubernetes-engine k8s kubernetes speaker-recognition speaker-verification speech-recognition speechbrain springframework stomp stompwebsocket webflux whisper

Last synced: 24 Oct 2024

https://github.com/manohara-ai/speaker_recognition

Recognizing speakers using fully connected CNN with help of spectrograms

cnn machine-learning machine-learning-algorithms pytorch speaker-recognition spectrogram

Last synced: 13 Dec 2024

https://github.com/anurag1101/robot_speaker

A simple, interactive Python-based text-to-speech application that converts user input into spoken words using pyttsx3. Type any text, hear it spoken aloud, and enjoy a friendly farewell when you’re done!

libraries modules python python3 speaker-recognition speech

Last synced: 21 Dec 2024

https://github.com/slinusc/speaker_recognition_evaluation

Investigating Layer-Specific Performance in Speaker Recognition with XLS-R Architecture

hidden-states speaker-recognition xls-r

Last synced: 10 Nov 2024

https://github.com/mathusanm6/amaze-voice-lab

The goal of this research project is to be able to control the movements of characters in a Maze game using real-time voice commands such as saying out loud Up, Down, Left or Right.

asr automatic-speech-recognition game java maze research speaker-diarization speaker-recognition voice-recognition

Last synced: 13 Dec 2024