Projects in Awesome Lists tagged with kaldi
A curated list of projects in awesome lists tagged with kaldi .
https://github.com/kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
c-plus-plus cuda kaldi shell speaker-id speaker-verification speech speech-recognition speech-to-text
Last synced: 12 May 2025
https://github.com/espnet/espnet
End-to-End Speech Processing Toolkit
chainer deep-learning end-to-end kaldi machine-translation pytorch singing-voice-synthesis speaker-diarization speech-enhancement speech-recognition speech-separation speech-synthesis speech-translation spoken-language-understanding text-to-speech voice-conversion
Last synced: 08 Apr 2026
https://github.com/alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
android asr deep-learning deep-neural-networks deepspeech google-speech-to-text ios kaldi offline privacy python raspberry-pi speaker-identification speaker-verification speech-recognition speech-to-text speech-to-text-android stt voice-recognition vosk
Last synced: 12 May 2025
https://github.com/mravanelli/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
asr deep-learning deep-neural-networks dnn dnn-hmm gru kaldi lstm lstm-neural-networks multilayer-perceptron-network pytorch recurrent-neural-networks rnn rnn-model speech speech-recognition timit
Last synced: 15 May 2025
https://github.com/montrealcorpustools/montreal-forced-aligner
Command line utility for forced alignment using Kaldi
acoustic-model forced-alignment grapheme-to-phone kaldi pronunciation-dictionary python
Last synced: 02 Feb 2026
https://github.com/DragonComputer/Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant
Last synced: 12 Apr 2025
https://github.com/dragoncomputer/dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant
Last synced: 01 Apr 2025
https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
acoustic-model forced-alignment grapheme-to-phone kaldi pronunciation-dictionary python
Last synced: 08 May 2025
https://github.com/lhotse-speech/lhotse
Tools for handling multimodal data in machine learning projects.
ai audio data deep-learning kaldi machine-learning python pytorch speech speech-recognition
Last synced: 20 Apr 2026
https://github.com/pykaldi/pykaldi
A Python wrapper for Kaldi
asr clif feature-extraction kaldi language-model numpy openfst python speech speech-recognition wrapper
Last synced: 13 Apr 2025
https://github.com/freewym/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
asr end-to-end fairseq kaldi python pytorch speech-recognition
Last synced: 05 Apr 2025
https://github.com/alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
asr grpc kaldi python saas speech-recognition vosk webrtc websocket
Last synced: 11 Jun 2025
https://github.com/srvk/eesen
The official repository of the Eesen project
asr ctc ctc-loss kaldi speech-recognition speech-to-text tensorflow
Last synced: 02 May 2025
https://github.com/bbc/react-transcript-editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
bbc-news-labs kaldi news-labs react stt textav transcript transcript-editor transcription
Last synced: 15 May 2025
https://github.com/gooofy/zamia-speech
Open tools and data for cloudless automatic speech recognition
asr cmu-sphinx kaldi language-model lexicon sequitur speech-corpora speech-recognition voxforge
Last synced: 05 Apr 2025
https://github.com/ccoreilly/vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
asr kaldi speech-recognition speech-to-text stt typescript vosk wasm webassembly
Last synced: 16 May 2025
https://github.com/funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
beamforming kaldi rir-generator speech speech-enhancement speech-separation time-frequency-masking
Last synced: 01 Apr 2025
https://github.com/daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
coding command-and-control dictation grammars kaldi kaldi-asr kaldi-grammar python speech-recognition speech-to-text voice voice-coding voice-commands voice-control
Last synced: 04 Apr 2025
https://github.com/sergeyshk/speech-to-text-russian
Проект для распознавания речи на русском языке на основе pykaldi.
asr kaldi pykaldi russian-specific speech-recognition speech-to-text
Last synced: 17 Jun 2025
https://github.com/SergeyShk/Speech-to-Text-Russian
Проект для распознавания речи на русском языке на основе pykaldi.
asr kaldi pykaldi russian-specific speech-recognition speech-to-text
Last synced: 19 Jul 2025
https://github.com/jcsilva/docker-kaldi-gstreamer-server
Dockerfile for kaldi-gstreamer-server.
asr docker kaldi kaldi-gstreamer-server worker-server
Last synced: 04 May 2025
https://github.com/nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
file-formats fileio kaldi pure-python python python2 python3 speech-recognition
Last synced: 15 May 2025
https://github.com/XiaoMi/kaldi-onnx
Kaldi model converter to ONNX
android ios kaldi mace onnx speech-recognition
Last synced: 03 May 2025
https://github.com/xiaomi/kaldi-onnx
Kaldi model converter to ONNX
android ios kaldi mace onnx speech-recognition
Last synced: 09 Apr 2025
https://github.com/csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor
Last synced: 08 Apr 2025
https://github.com/istupakov/onnx-asr
Automatic Speech Recognition in Python using ONNX models
asr conformer kaldi onnx python speech-recognition speech-to-text stt whisper
Last synced: 26 Dec 2025
https://github.com/jzlianglu/pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
horovod kaldi pykaldi pytorch speech-toolkit
Last synced: 06 Aug 2025
https://github.com/gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
asr kaldi kaldi-asr python python-2 speech-recognition wrapper
Last synced: 17 Nov 2025
https://github.com/CoEDL/elpis
🙊 software for creating speech recognition models.
automatic-speech-recognition computational-linguistics docker kaldi linguistics python transcription
Last synced: 08 May 2025
https://github.com/jimbozhang/kaldi-gop
Kaldi-based goodness of pronunciation (GOP)
Last synced: 24 Oct 2025
https://github.com/jinserk/pytorch-asr
ASR with PyTorch
asr capsule-network ctc decoder deepspeech densenet dictation kaldi kaldi-decoder lattice lvcsr pyro python pytorch pytorch-binding resnet speech speech-recognition ss-vae transcription
Last synced: 19 Jul 2025
https://github.com/funcwj/aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
end-to-end kaldi multi-channel speech speech-enhancement speech-recognition speech-separation
Last synced: 01 Apr 2025
https://github.com/jefflai108/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
kaldi learnable-dictionary-encoding pytorch speaker-identification speaker-recognition speaker-verification speech-processing
Last synced: 19 Jul 2025
https://github.com/RicherMans/PLDA
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
kaldi plda speaker-verification
Last synced: 11 May 2025
https://github.com/skit-ai/kaldi-serve
Server framework for Kaldi ASR Toolkit
asr grpc grpc-server kaldi kaldi-asr kaldi-server speech-recognition speech-to-text
Last synced: 11 Apr 2025
https://github.com/matteo-convertino/vosk-build-model
How to create your own model for vosk
deep-learning deep-neural-networks guide kaldi speech-recognition tutorial voice-recognition vosk walkthrough
Last synced: 10 Apr 2025
https://github.com/yh1008/speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
cnn dnn kaldi speech-recognition speech-to-text
Last synced: 05 May 2025
https://github.com/opensource-spraakherkenning-nl/Kaldi_NL
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
dutch kaldi speech-recognition speech-recognition-model
Last synced: 29 Jun 2026
https://github.com/syntithenai/opensnips
Open source projects related to Snips https://snips.ai/.
asr audio-server dialog docker hark hotwords kaldi nlu porcupine rasa snips snips-skills snowboy speech speechrecognition
Last synced: 21 Feb 2026
https://github.com/grib0ed0v/kaldi-for-russian
kaldi kaldi-asr machine-learning-algorithms speech-recognition
Last synced: 05 May 2025
https://github.com/mravanelli/pytorch_mlp_for_asr
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit
Last synced: 27 Jul 2025
https://github.com/mravanelli/pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit
Last synced: 19 Jul 2025
https://github.com/mravanelli/theano-kaldi-rnn
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
deep-learning deep-neural-networks gated-recurrent-units gru kaldi recurrent-neural-networks rnn theano theano-kaldi-rnns timit
Last synced: 07 Oct 2025
https://github.com/mycrazycracy/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
kaldi kaldi-asr machine-learning neural-network speaker-identification speaker-recognition speaker-verification speech-processing tensorflow
Last synced: 03 May 2025
https://github.com/aalto-speech/subword-kaldi
Properly handle position-dependent phones in a subword lexicon FST
Last synced: 25 Dec 2025
https://github.com/uiuc-sst/asr24
24-hour Automatic Speech Recognition
asr g2p kaldi language-model transcription
Last synced: 04 May 2025
https://github.com/proger/uk
Фонограми та синтагми: інструменти обробки
dataset-generation forced-alignment hmm kaldi speech-recognition ukrainian ukrainian-language
Last synced: 20 Jan 2026
https://github.com/nttcslab-sp/torchain
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Last synced: 21 Aug 2025
https://github.com/daanzu/kaldi_ag_training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
custom fine-tuning kaldi kaldi-asr personal speech speech-recognition speech-to-text training
Last synced: 19 Apr 2025
https://github.com/AASHISHAG/asr-german
Automatic Speech Recognition (ASR) - German
asr gsoc-2019 kaldi mozilla-deepspeech red-hen-labs speech speech-recognition
Last synced: 06 Apr 2025
https://github.com/cadia-lvl/kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
ahc audio-files diarization icelandic kaldi mfccs plda speaker-diarization wav
Last synced: 11 Mar 2026
https://github.com/luan78zaoha/kaldi-timit-sre-ivector
Develop speaker recognition model based on i-vector using TIMIT database
chinese i-vector kaldi speaker-recognition speaker-verification sre
Last synced: 11 Mar 2025
https://github.com/openvoiceos/ovos-stt-plugin-vosk
vosk STT plugin for mycroft
asr automatic-speech-recognition hacktoberfest kaldi speech-recognition speech-to-text stt vosk
Last synced: 16 May 2025
https://github.com/OpenVoiceOS/ovos-stt-plugin-vosk
vosk STT plugin for mycroft
asr automatic-speech-recognition hacktoberfest kaldi speech-recognition speech-to-text stt vosk
Last synced: 10 May 2025
https://github.com/agrover112/goodness-of-pronunciation-pipelines-for-oov-problem
Goodness of Pronunciation Pipelines for OOV Removal
asr hidden-markov-model kaldi kaldi-asr lexicon-based oov speech speech-recognition
Last synced: 23 Apr 2025
https://github.com/smorodov/kaldi_vosk_win_cmake
cmake based kaldi + vosk + microphone speech recognition example
kaldi speaker-recognition speech-recognition speech-to-text voice-recognition vosk
Last synced: 10 Apr 2025
https://github.com/dfordivam/hskaldi
Kaldi-ASR haskell binding experiments
haskell haskell-bindings kaldi
Last synced: 12 Apr 2025
https://github.com/agrover112/kaldi-notes
Resources helpful for Kaldi
asr hacktoberfest kaldi kaldi-asr kaldi-librispeech speech speech-recognition
Last synced: 28 Feb 2026
https://github.com/linagora-labs/ssak
SSAK contains helpers and tools to process data and train/infer ASR models.
asr data-processing kaldi machine-learning nemo speech-recognition speech-to-text toolkit whisper
Last synced: 05 Oct 2025
https://github.com/msqr1/kaldi-wasm2
New compilation guide for Kaldi to WASM
Last synced: 03 Feb 2026
https://github.com/alx741/kaldi_spanish_dimex100
Kaldi ASR Spanish example using the DIMEx100 corpus
Last synced: 04 Nov 2025
https://github.com/alx741/kaldi-gstreamer-server-haskell-client
kaldi-gstreamer-server haskell client
client gstreamer haskell kaldi
Last synced: 04 Nov 2025
https://github.com/xx205/switchboard_training_in_minutes
PyTorch with horovod setup for distributed training of Switchboard-1 Phase 1 training data in minutes, without hurting the accuracy.
kaldi pytorch speech-recognition switchboard
Last synced: 17 Jun 2026
https://github.com/jailuthra/asr
Kaldi ASR wrapper scripts
asr kaldi praat speech speech-recognition
Last synced: 28 Jul 2025
https://github.com/sidgupta234/indian_english_asr
An Indian English ASR system based on Hidden Markov Models (HMM) has been designed using Kaldi(Povey et al., 2011).
asr indian-english-speech-data kaldi kaldi-asr
Last synced: 23 Mar 2025
https://github.com/aalto-speech/conversation-assistant
Conversation Assistant iOS-app and Kaldi ASR server for real-time automatic speech recognition in conversational situations.
augmented-reality-application ios kaldi server
Last synced: 23 May 2026
https://github.com/andi611/kaldi-librispeech-fmllr
This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features
fmllr kaldi kaldi-librispeech librispeech librispeech-fmllr
Last synced: 21 Jul 2025
https://github.com/jrmeyer/interspeech-2018
I submitted this paper to Interspeech 2018. The paper was not accepted. The reviewer comments are included in the repo.
interspeech2018 kaldi multi-task-learning rejection
Last synced: 17 Jan 2026
https://github.com/techiaith/kaldi-cy
Adnabod lleferydd Cymraeg gyda Kaldi ASR | Welsh language speech recognition using Kaldi ASR
kaldi speech speech-recognition welsh
Last synced: 17 Jan 2026
https://github.com/mthrok/tkaldi
Kaldi-ASR powered by PyTorch C++ API (Experimental)
Last synced: 18 May 2026
https://github.com/techiaith/docker-kaldi-cy
Amgylchedd hwyluso hyfforddi adnabod lleferydd Kaldi Cymraeg
cymraeg docker kaldi speech speech-recognition welsh
Last synced: 17 Jan 2026
https://github.com/simzhou/vscode-kaldi-reader
SCP & ARK Viewer for VS Code
Last synced: 11 Jun 2026