Projects in Awesome Lists tagged with kaldi

https://github.com/kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

c-plus-plus cuda kaldi shell speaker-id speaker-verification speech speech-recognition speech-to-text

Last synced: 16 Dec 2024

https://github.com/espnet/espnet

End-to-End Speech Processing Toolkit

chainer deep-learning end-to-end kaldi machine-translation pytorch singing-voice-synthesis speaker-diarization speech-enhancement speech-recognition speech-separation speech-synthesis speech-translation spoken-language-understanding text-to-speech voice-conversion

Last synced: 16 Dec 2024

https://github.com/alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

android asr deep-learning deep-neural-networks deepspeech google-speech-to-text ios kaldi offline privacy python raspberry-pi speaker-identification speaker-verification speech-recognition speech-to-text speech-to-text-android stt voice-recognition vosk

Last synced: 16 Dec 2024

https://github.com/mravanelli/pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

asr deep-learning deep-neural-networks dnn dnn-hmm gru kaldi lstm lstm-neural-networks multilayer-perceptron-network pytorch recurrent-neural-networks rnn rnn-model speech speech-recognition timit

Last synced: 21 Dec 2024

https://github.com/DragonComputer/Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant

Last synced: 07 Nov 2024

https://github.com/dragoncomputer/dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

artificial-intelligence chatbot kaldi linux machine-learning nlp personal-assistant spacy speech-recognition speech-to-text text-to-speech ubuntu virtual-assistant

Last synced: 21 Dec 2024

https://github.com/montrealcorpustools/montreal-forced-aligner

Command line utility for forced alignment using Kaldi

acoustic-model forced-alignment grapheme-to-phone kaldi pronunciation-dictionary python

Last synced: 17 Dec 2024

https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

acoustic-model forced-alignment grapheme-to-phone kaldi pronunciation-dictionary python

Last synced: 15 Nov 2024

https://github.com/pykaldi/pykaldi

A Python wrapper for Kaldi

asr clif feature-extraction kaldi language-model numpy openfst python speech speech-recognition wrapper

Last synced: 20 Dec 2024

https://github.com/lhotse-speech/lhotse

Tools for handling speech data in machine learning projects.

ai audio data deep-learning kaldi machine-learning python pytorch speech speech-recognition

Last synced: 28 Nov 2024

https://github.com/freewym/espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

asr end-to-end fairseq kaldi python pytorch speech-recognition

Last synced: 05 Nov 2024

https://github.com/alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

asr grpc kaldi python saas speech-recognition vosk webrtc websocket

Last synced: 27 Nov 2024

https://github.com/srvk/eesen

The official repository of the Eesen project

asr ctc ctc-loss kaldi speech-recognition speech-to-text tensorflow

Last synced: 12 Nov 2024

https://github.com/bbc/react-transcript-editor

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

bbc-news-labs kaldi news-labs react stt textav transcript transcript-editor transcription

Last synced: 21 Dec 2024

https://github.com/gooofy/zamia-speech

Open tools and data for cloudless automatic speech recognition

asr cmu-sphinx kaldi language-model lexicon sequitur speech-corpora speech-recognition voxforge

Last synced: 16 Dec 2024

https://github.com/funcwj/setk

Tools for Speech Enhancement integrated with Kaldi

beamforming kaldi rir-generator speech speech-enhancement speech-separation time-frequency-masking

Last synced: 02 Nov 2024

https://github.com/ccoreilly/vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

asr kaldi speech-recognition speech-to-text stt typescript vosk wasm webassembly

Last synced: 15 Dec 2024

https://github.com/daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

coding command-and-control dictation grammars kaldi kaldi-asr kaldi-grammar python speech-recognition speech-to-text voice voice-coding voice-commands voice-control

Last synced: 18 Dec 2024

https://github.com/SergeyShk/Speech-to-Text-Russian

Проект для распознавания речи на русском языке на основе pykaldi.

asr kaldi pykaldi russian-specific speech-recognition speech-to-text

Last synced: 27 Nov 2024

https://github.com/jcsilva/docker-kaldi-gstreamer-server

Dockerfile for kaldi-gstreamer-server.

asr docker kaldi kaldi-gstreamer-server worker-server

Last synced: 13 Nov 2024

https://github.com/nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

file-formats fileio kaldi pure-python python python2 python3 speech-recognition

Last synced: 15 Dec 2024

https://github.com/XiaoMi/kaldi-onnx

Kaldi model converter to ONNX

android ios kaldi mace onnx speech-recognition

Last synced: 13 Nov 2024

https://github.com/xiaomi/kaldi-onnx

Kaldi model converter to ONNX

android ios kaldi mace onnx speech-recognition

Last synced: 18 Dec 2024

https://github.com/Diamondfan/CTC_pytorch

CTC end -to-end ASR for timit and 863 corpus.

ctc decoder kaldi pytorch timit

Last synced: 27 Nov 2024

https://github.com/diamondfan/ctc_pytorch

CTC end -to-end ASR for timit and 863 corpus.

ctc decoder kaldi pytorch timit

Last synced: 19 Dec 2024

https://github.com/csukuangfj/kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor

Last synced: 22 Dec 2024

https://github.com/jzlianglu/pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

horovod kaldi pykaldi pytorch speech-toolkit

Last synced: 09 Dec 2024

https://github.com/gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

asr kaldi kaldi-asr python python-2 speech-recognition wrapper

Last synced: 16 Dec 2024

https://github.com/CoEDL/elpis

🙊 software for creating speech recognition models.

automatic-speech-recognition computational-linguistics docker kaldi linguistics python transcription

Last synced: 15 Nov 2024

https://github.com/jinserk/pytorch-asr

ASR with PyTorch

asr capsule-network ctc decoder deepspeech densenet dictation kaldi kaldi-decoder lattice lvcsr pyro python pytorch pytorch-binding resnet speech speech-recognition ss-vae transcription

Last synced: 27 Nov 2024

https://github.com/funcwj/aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

end-to-end kaldi multi-channel speech speech-enhancement speech-recognition speech-separation

Last synced: 02 Nov 2024

https://github.com/jefflai108/pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

kaldi learnable-dictionary-encoding pytorch speaker-identification speaker-recognition speaker-verification speech-processing

Last synced: 27 Nov 2024

https://github.com/RicherMans/PLDA

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

kaldi plda speaker-verification

Last synced: 17 Nov 2024

https://github.com/yh1008/speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

cnn dnn kaldi speech-recognition speech-to-text

Last synced: 13 Nov 2024

https://github.com/grib0ed0v/kaldi-for-russian

kaldi kaldi-asr machine-learning-algorithms speech-recognition

Last synced: 13 Nov 2024

https://github.com/mravanelli/pytorch_mlp_for_asr

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit

Last synced: 02 Dec 2024

https://github.com/mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit

Last synced: 27 Nov 2024

https://github.com/mravanelli/theano-kaldi-rnn

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

deep-learning deep-neural-networks gated-recurrent-units gru kaldi recurrent-neural-networks rnn theano theano-kaldi-rnns timit