Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with kaldi
A curated list of projects in awesome lists tagged with kaldi .
https://github.com/kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
c-plus-plus cuda kaldi shell speaker-id speaker-verification speech speech-recognition speech-to-text
Last synced: 16 Dec 2024
https://github.com/espnet/espnet
End-to-End Speech Processing Toolkit
chainer deep-learning end-to-end kaldi machine-translation pytorch singing-voice-synthesis speaker-diarization speech-enhancement speech-recognition speech-separation speech-synthesis speech-translation spoken-language-understanding text-to-speech voice-conversion
Last synced: 16 Dec 2024
https://github.com/alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
android asr deep-learning deep-neural-networks deepspeech google-speech-to-text ios kaldi offline privacy python raspberry-pi speaker-identification speaker-verification speech-recognition speech-to-text speech-to-text-android stt voice-recognition vosk
Last synced: 16 Dec 2024
https://github.com/mravanelli/pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
asr deep-learning deep-neural-networks dnn dnn-hmm gru kaldi lstm lstm-neural-networks multilayer-perceptron-network pytorch recurrent-neural-networks rnn rnn-model speech speech-recognition timit
Last synced: 21 Dec 2024
https://github.com/DragonComputer/Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant
Last synced: 07 Nov 2024
https://github.com/dragoncomputer/dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant
Last synced: 21 Dec 2024
https://github.com/montrealcorpustools/montreal-forced-aligner
Command line utility for forced alignment using Kaldi
acoustic-model forced-alignment grapheme-to-phone kaldi pronunciation-dictionary python
Last synced: 17 Dec 2024
https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
acoustic-model forced-alignment grapheme-to-phone kaldi pronunciation-dictionary python
Last synced: 15 Nov 2024
https://github.com/pykaldi/pykaldi
A Python wrapper for Kaldi
asr clif feature-extraction kaldi language-model numpy openfst python speech speech-recognition wrapper
Last synced: 20 Dec 2024
https://github.com/lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
ai audio data deep-learning kaldi machine-learning python pytorch speech speech-recognition
Last synced: 28 Nov 2024
https://github.com/freewym/espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
asr end-to-end fairseq kaldi python pytorch speech-recognition
Last synced: 05 Nov 2024
https://github.com/alphacep/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
asr grpc kaldi python saas speech-recognition vosk webrtc websocket
Last synced: 27 Nov 2024
https://github.com/srvk/eesen
The official repository of the Eesen project
asr ctc ctc-loss kaldi speech-recognition speech-to-text tensorflow
Last synced: 12 Nov 2024
https://github.com/bbc/react-transcript-editor
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
bbc-news-labs kaldi news-labs react stt textav transcript transcript-editor transcription
Last synced: 21 Dec 2024
https://github.com/gooofy/zamia-speech
Open tools and data for cloudless automatic speech recognition
asr cmu-sphinx kaldi language-model lexicon sequitur speech-corpora speech-recognition voxforge
Last synced: 16 Dec 2024
https://github.com/funcwj/setk
Tools for Speech Enhancement integrated with Kaldi
beamforming kaldi rir-generator speech speech-enhancement speech-separation time-frequency-masking
Last synced: 02 Nov 2024
https://github.com/ccoreilly/vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
asr kaldi speech-recognition speech-to-text stt typescript vosk wasm webassembly
Last synced: 15 Dec 2024
https://github.com/daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
coding command-and-control dictation grammars kaldi kaldi-asr kaldi-grammar python speech-recognition speech-to-text voice voice-coding voice-commands voice-control
Last synced: 18 Dec 2024
https://github.com/SergeyShk/Speech-to-Text-Russian
Проект для распознавания речи на русском языке на основе pykaldi.
asr kaldi pykaldi russian-specific speech-recognition speech-to-text
Last synced: 27 Nov 2024
https://github.com/jcsilva/docker-kaldi-gstreamer-server
Dockerfile for kaldi-gstreamer-server.
asr docker kaldi kaldi-gstreamer-server worker-server
Last synced: 13 Nov 2024
https://github.com/nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
file-formats fileio kaldi pure-python python python2 python3 speech-recognition
Last synced: 15 Dec 2024
https://github.com/XiaoMi/kaldi-onnx
Kaldi model converter to ONNX
android ios kaldi mace onnx speech-recognition
Last synced: 13 Nov 2024
https://github.com/xiaomi/kaldi-onnx
Kaldi model converter to ONNX
android ios kaldi mace onnx speech-recognition
Last synced: 18 Dec 2024
https://github.com/csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor
Last synced: 22 Dec 2024
https://github.com/jzlianglu/pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
horovod kaldi pykaldi pytorch speech-toolkit
Last synced: 09 Dec 2024
https://github.com/gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
asr kaldi kaldi-asr python python-2 speech-recognition wrapper
Last synced: 16 Dec 2024
https://github.com/CoEDL/elpis
🙊 software for creating speech recognition models.
automatic-speech-recognition computational-linguistics docker kaldi linguistics python transcription
Last synced: 15 Nov 2024
https://github.com/jinserk/pytorch-asr
ASR with PyTorch
asr capsule-network ctc decoder deepspeech densenet dictation kaldi kaldi-decoder lattice lvcsr pyro python pytorch pytorch-binding resnet speech speech-recognition ss-vae transcription
Last synced: 27 Nov 2024
https://github.com/funcwj/aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
end-to-end kaldi multi-channel speech speech-enhancement speech-recognition speech-separation
Last synced: 02 Nov 2024
https://github.com/jefflai108/pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
kaldi learnable-dictionary-encoding pytorch speaker-identification speaker-recognition speaker-verification speech-processing
Last synced: 27 Nov 2024
https://github.com/RicherMans/PLDA
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
kaldi plda speaker-verification
Last synced: 17 Nov 2024
https://github.com/yh1008/speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
cnn dnn kaldi speech-recognition speech-to-text
Last synced: 13 Nov 2024
https://github.com/grib0ed0v/kaldi-for-russian
kaldi kaldi-asr machine-learning-algorithms speech-recognition
Last synced: 13 Nov 2024
https://github.com/mravanelli/pytorch_mlp_for_asr
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit
Last synced: 02 Dec 2024
https://github.com/mravanelli/pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit
Last synced: 27 Nov 2024
https://github.com/mravanelli/theano-kaldi-rnn
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
deep-learning deep-neural-networks gated-recurrent-units gru kaldi recurrent-neural-networks rnn theano theano-kaldi-rnns timit
Last synced: 02 Dec 2024
https://github.com/mycrazycracy/tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
kaldi kaldi-asr machine-learning neural-network speaker-identification speaker-recognition speaker-verification speech-processing tensorflow
Last synced: 13 Nov 2024
https://github.com/uiuc-sst/asr24
24-hour Automatic Speech Recognition
asr g2p kaldi language-model transcription
Last synced: 13 Nov 2024
https://github.com/nttcslab-sp/torchain
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Last synced: 13 Nov 2024
https://github.com/daanzu/kaldi_ag_training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
custom fine-tuning kaldi kaldi-asr personal speech speech-recognition speech-to-text training
Last synced: 08 Nov 2024
https://github.com/AASHISHAG/asr-german
Automatic Speech Recognition (ASR) - German
asr gsoc-2019 kaldi mozilla-deepspeech red-hen-labs speech speech-recognition
Last synced: 05 Nov 2024
https://github.com/openvoiceos/ovos-stt-plugin-vosk
vosk STT plugin for mycroft
asr automatic-speech-recognition hacktoberfest kaldi speech-recognition speech-to-text stt vosk
Last synced: 19 Nov 2024
https://github.com/OpenVoiceOS/ovos-stt-plugin-vosk
vosk STT plugin for mycroft
asr automatic-speech-recognition hacktoberfest kaldi speech-recognition speech-to-text stt vosk
Last synced: 16 Nov 2024
https://github.com/agrover112/goodness-of-pronunciation-pipelines-for-oov-problem
Goodness of Pronunciation Pipelines for OOV Removal
asr hidden-markov-model kaldi kaldi-asr lexicon-based oov speech speech-recognition
Last synced: 10 Nov 2024
https://github.com/dfordivam/hskaldi
Kaldi-ASR haskell binding experiments
haskell haskell-bindings kaldi
Last synced: 07 Nov 2024
https://github.com/agrover112/kaldi-notes
Resources helpful for Kaldi
asr hacktoberfest kaldi kaldi-asr kaldi-librispeech speech speech-recognition
Last synced: 10 Nov 2024
https://github.com/smorodov/kaldi_vosk_win_cmake
cmake based kaldi + vosk + microphone speech recognition example
kaldi speaker-recognition speech-recognition speech-to-text voice-recognition vosk
Last synced: 06 Nov 2024
https://github.com/jailuthra/asr
Kaldi ASR wrapper scripts
asr kaldi praat speech speech-recognition
Last synced: 19 Nov 2024
https://github.com/alx741/kaldi-gstreamer-server-haskell-client
kaldi-gstreamer-server haskell client
client gstreamer haskell kaldi
Last synced: 07 Nov 2024
https://github.com/alx741/kaldi_spanish_dimex100
Kaldi ASR Spanish example using the DIMEx100 corpus
Last synced: 07 Nov 2024
https://github.com/xx205/switchboard_training_in_minutes
PyTorch with horovod setup for distributed training of Switchboard-1 Phase 1 training data in minutes, without hurting the accuracy.
kaldi pytorch speech-recognition switchboard
Last synced: 11 Nov 2024
https://github.com/mthrok/tkaldi
Kaldi-ASR powered by PyTorch C++ API (Experimental)
Last synced: 15 Dec 2024
https://github.com/sidgupta234/indian_english_asr
An Indian English ASR system based on Hidden Markov Models (HMM) has been designed using Kaldi(Povey et al., 2011).
asr indian-english-speech-data kaldi kaldi-asr
Last synced: 30 Nov 2024
https://github.com/jrmeyer/interspeech-2018
I submitted this paper to Interspeech 2018. The paper was not accepted. The reviewer comments are included in the repo.
interspeech2018 kaldi multi-task-learning rejection
Last synced: 15 Dec 2024
https://github.com/andi611/kaldi-librispeech-fmllr
This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features
fmllr kaldi kaldi-librispeech librispeech librispeech-fmllr
Last synced: 02 Dec 2024
https://github.com/harisbinzia/guftaarshanaas
Urdu Speech Recognition using Hidden Markov Models and Deep Neural Networks
arl kaldi language-model pronunciation-dictionary pronunciation-lexicon speech-recognition urdu
Last synced: 07 Nov 2024