An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with kaldi

A curated list of projects in awesome lists tagged with kaldi .

https://github.com/kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

c-plus-plus cuda kaldi shell speaker-id speaker-verification speech speech-recognition speech-to-text

Last synced: 12 May 2025

https://github.com/mravanelli/pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

asr deep-learning deep-neural-networks dnn dnn-hmm gru kaldi lstm lstm-neural-networks multilayer-perceptron-network pytorch recurrent-neural-networks rnn rnn-model speech speech-recognition timit

Last synced: 15 May 2025

https://github.com/lhotse-speech/lhotse

Tools for handling multimodal data in machine learning projects.

ai audio data deep-learning kaldi machine-learning python pytorch speech speech-recognition

Last synced: 20 Apr 2026

https://github.com/freewym/espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

asr end-to-end fairseq kaldi python pytorch speech-recognition

Last synced: 05 Apr 2025

https://github.com/alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

asr grpc kaldi python saas speech-recognition vosk webrtc websocket

Last synced: 11 Jun 2025

https://github.com/srvk/eesen

The official repository of the Eesen project

asr ctc ctc-loss kaldi speech-recognition speech-to-text tensorflow

Last synced: 02 May 2025

https://github.com/bbc/react-transcript-editor

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

bbc-news-labs kaldi news-labs react stt textav transcript transcript-editor transcription

Last synced: 15 May 2025

https://github.com/gooofy/zamia-speech

Open tools and data for cloudless automatic speech recognition

asr cmu-sphinx kaldi language-model lexicon sequitur speech-corpora speech-recognition voxforge

Last synced: 05 Apr 2025

https://github.com/ccoreilly/vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

asr kaldi speech-recognition speech-to-text stt typescript vosk wasm webassembly

Last synced: 16 May 2025

https://github.com/funcwj/setk

Tools for Speech Enhancement integrated with Kaldi

beamforming kaldi rir-generator speech speech-enhancement speech-separation time-frequency-masking

Last synced: 01 Apr 2025

https://github.com/daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

coding command-and-control dictation grammars kaldi kaldi-asr kaldi-grammar python speech-recognition speech-to-text voice voice-coding voice-commands voice-control

Last synced: 04 Apr 2025

https://github.com/sergeyshk/speech-to-text-russian

Проект для распознавания речи на русском языке на основе pykaldi.

asr kaldi pykaldi russian-specific speech-recognition speech-to-text

Last synced: 17 Jun 2025

https://github.com/SergeyShk/Speech-to-Text-Russian

Проект для распознавания речи на русском языке на основе pykaldi.

asr kaldi pykaldi russian-specific speech-recognition speech-to-text

Last synced: 19 Jul 2025

https://github.com/nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

file-formats fileio kaldi pure-python python python2 python3 speech-recognition

Last synced: 15 May 2025

https://github.com/XiaoMi/kaldi-onnx

Kaldi model converter to ONNX

android ios kaldi mace onnx speech-recognition

Last synced: 03 May 2025

https://github.com/xiaomi/kaldi-onnx

Kaldi model converter to ONNX

android ios kaldi mace onnx speech-recognition

Last synced: 09 Apr 2025

https://github.com/Diamondfan/CTC_pytorch

CTC end -to-end ASR for timit and 863 corpus.

ctc decoder kaldi pytorch timit

Last synced: 19 Jul 2025

https://github.com/diamondfan/ctc_pytorch

CTC end -to-end ASR for timit and 863 corpus.

ctc decoder kaldi pytorch timit

Last synced: 07 May 2025

https://github.com/csukuangfj/kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor

Last synced: 08 Apr 2025

https://github.com/istupakov/onnx-asr

Automatic Speech Recognition in Python using ONNX models

asr conformer kaldi onnx python speech-recognition speech-to-text stt whisper

Last synced: 26 Dec 2025

https://github.com/jzlianglu/pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

horovod kaldi pykaldi pytorch speech-toolkit

Last synced: 06 Aug 2025

https://github.com/gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

asr kaldi kaldi-asr python python-2 speech-recognition wrapper

Last synced: 17 Nov 2025

https://github.com/CoEDL/elpis

🙊 software for creating speech recognition models.

automatic-speech-recognition computational-linguistics docker kaldi linguistics python transcription

Last synced: 08 May 2025

https://github.com/jimbozhang/kaldi-gop

Kaldi-based goodness of pronunciation (GOP)

kaldi speech-recognition

Last synced: 24 Oct 2025

https://github.com/funcwj/aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

end-to-end kaldi multi-channel speech speech-enhancement speech-recognition speech-separation

Last synced: 01 Apr 2025

https://github.com/RicherMans/PLDA

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

kaldi plda speaker-verification

Last synced: 11 May 2025

https://github.com/yh1008/speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

cnn dnn kaldi speech-recognition speech-to-text

Last synced: 05 May 2025

https://github.com/opensource-spraakherkenning-nl/Kaldi_NL

Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit

dutch kaldi speech-recognition speech-recognition-model

Last synced: 29 Jun 2026

https://github.com/mravanelli/pytorch_mlp_for_asr

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit

Last synced: 27 Jul 2025

https://github.com/mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit

Last synced: 19 Jul 2025

https://github.com/mravanelli/theano-kaldi-rnn

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

deep-learning deep-neural-networks gated-recurrent-units gru kaldi recurrent-neural-networks rnn theano theano-kaldi-rnns timit

Last synced: 07 Oct 2025

https://github.com/aalto-speech/subword-kaldi

Properly handle position-dependent phones in a subword lexicon FST

kaldi subword-units

Last synced: 25 Dec 2025

https://github.com/uiuc-sst/asr24

24-hour Automatic Speech Recognition

asr g2p kaldi language-model transcription

Last synced: 04 May 2025

https://github.com/proger/uk

Фонограми та синтагми: інструменти обробки

dataset-generation forced-alignment hmm kaldi speech-recognition ukrainian ukrainian-language

Last synced: 20 Jan 2026

https://github.com/nttcslab-sp/torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

asr kaldi pytorch

Last synced: 21 Aug 2025

https://github.com/daanzu/kaldi_ag_training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

custom fine-tuning kaldi kaldi-asr personal speech speech-recognition speech-to-text training

Last synced: 19 Apr 2025

https://github.com/AASHISHAG/asr-german

Automatic Speech Recognition (ASR) - German

asr gsoc-2019 kaldi mozilla-deepspeech red-hen-labs speech speech-recognition

Last synced: 06 Apr 2025

https://github.com/cadia-lvl/kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.

ahc audio-files diarization icelandic kaldi mfccs plda speaker-diarization wav

Last synced: 11 Mar 2026

https://github.com/luan78zaoha/kaldi-timit-sre-ivector

Develop speaker recognition model based on i-vector using TIMIT database

chinese i-vector kaldi speaker-recognition speaker-verification sre

Last synced: 11 Mar 2025

https://github.com/mathquis/node-kaldi-online-nnet3-decoder

ASR online decoding using Kaldi NNet3 GrammarFST

asr decoder kaldi nnet3 stt

Last synced: 05 May 2025

https://github.com/smorodov/kaldi_vosk_win_cmake

cmake based kaldi + vosk + microphone speech recognition example

kaldi speaker-recognition speech-recognition speech-to-text voice-recognition vosk

Last synced: 10 Apr 2025

https://github.com/dfordivam/hskaldi

Kaldi-ASR haskell binding experiments

haskell haskell-bindings kaldi

Last synced: 12 Apr 2025

https://github.com/linagora-labs/ssak

SSAK contains helpers and tools to process data and train/infer ASR models.

asr data-processing kaldi machine-learning nemo speech-recognition speech-to-text toolkit whisper

Last synced: 05 Oct 2025

https://github.com/msqr1/kaldi-wasm2

New compilation guide for Kaldi to WASM

kaldi webassembly

Last synced: 03 Feb 2026

https://github.com/alx741/kaldi_spanish_dimex100

Kaldi ASR Spanish example using the DIMEx100 corpus

asr dimex100 kaldi spanish

Last synced: 04 Nov 2025

https://github.com/alx741/kaldi-gstreamer-server-haskell-client

kaldi-gstreamer-server haskell client

client gstreamer haskell kaldi

Last synced: 04 Nov 2025

https://github.com/xx205/switchboard_training_in_minutes

PyTorch with horovod setup for distributed training of Switchboard-1 Phase 1 training data in minutes, without hurting the accuracy.

kaldi pytorch speech-recognition switchboard

Last synced: 17 Jun 2026

https://github.com/jailuthra/asr

Kaldi ASR wrapper scripts

asr kaldi praat speech speech-recognition

Last synced: 28 Jul 2025

https://github.com/sidgupta234/indian_english_asr

An Indian English ASR system based on Hidden Markov Models (HMM) has been designed using Kaldi(Povey et al., 2011).

asr indian-english-speech-data kaldi kaldi-asr

Last synced: 23 Mar 2025

https://github.com/aalto-speech/conversation-assistant

Conversation Assistant iOS-app and Kaldi ASR server for real-time automatic speech recognition in conversational situations.

augmented-reality-application ios kaldi server

Last synced: 23 May 2026

https://github.com/andi611/kaldi-librispeech-fmllr

This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features

fmllr kaldi kaldi-librispeech librispeech librispeech-fmllr

Last synced: 21 Jul 2025

https://github.com/jrmeyer/interspeech-2018

I submitted this paper to Interspeech 2018. The paper was not accepted. The reviewer comments are included in the repo.

interspeech2018 kaldi multi-task-learning rejection

Last synced: 17 Jan 2026

https://github.com/techiaith/kaldi-cy

Adnabod lleferydd Cymraeg gyda Kaldi ASR | Welsh language speech recognition using Kaldi ASR

kaldi speech speech-recognition welsh

Last synced: 17 Jan 2026

https://github.com/mthrok/tkaldi

Kaldi-ASR powered by PyTorch C++ API (Experimental)

asr kaldi pytorch

Last synced: 18 May 2026

https://github.com/techiaith/docker-kaldi-cy

Amgylchedd hwyluso hyfforddi adnabod lleferydd Kaldi Cymraeg

cymraeg docker kaldi speech speech-recognition welsh

Last synced: 17 Jan 2026

https://github.com/simzhou/vscode-kaldi-reader

SCP & ARK Viewer for VS Code

kaldi vscode-extension

Last synced: 11 Jun 2026

https://github.com/aalto-speech/kaldi-utensils

Kaldi utility scripts

kaldi utility

Last synced: 25 May 2026