Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with kaldi

A curated list of projects in awesome lists tagged with kaldi .

https://github.com/kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

c-plus-plus cuda kaldi shell speaker-id speaker-verification speech speech-recognition speech-to-text

Last synced: 16 Dec 2024

https://github.com/mravanelli/pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

asr deep-learning deep-neural-networks dnn dnn-hmm gru kaldi lstm lstm-neural-networks multilayer-perceptron-network pytorch recurrent-neural-networks rnn rnn-model speech speech-recognition timit

Last synced: 21 Dec 2024

https://github.com/lhotse-speech/lhotse

Tools for handling speech data in machine learning projects.

ai audio data deep-learning kaldi machine-learning python pytorch speech speech-recognition

Last synced: 28 Nov 2024

https://github.com/freewym/espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

asr end-to-end fairseq kaldi python pytorch speech-recognition

Last synced: 05 Nov 2024

https://github.com/alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

asr grpc kaldi python saas speech-recognition vosk webrtc websocket

Last synced: 27 Nov 2024

https://github.com/srvk/eesen

The official repository of the Eesen project

asr ctc ctc-loss kaldi speech-recognition speech-to-text tensorflow

Last synced: 12 Nov 2024

https://github.com/bbc/react-transcript-editor

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

bbc-news-labs kaldi news-labs react stt textav transcript transcript-editor transcription

Last synced: 21 Dec 2024

https://github.com/gooofy/zamia-speech

Open tools and data for cloudless automatic speech recognition

asr cmu-sphinx kaldi language-model lexicon sequitur speech-corpora speech-recognition voxforge

Last synced: 16 Dec 2024

https://github.com/funcwj/setk

Tools for Speech Enhancement integrated with Kaldi

beamforming kaldi rir-generator speech speech-enhancement speech-separation time-frequency-masking

Last synced: 02 Nov 2024

https://github.com/ccoreilly/vosk-browser

A speech recognition library running in the browser thanks to a WebAssembly build of Vosk

asr kaldi speech-recognition speech-to-text stt typescript vosk wasm webassembly

Last synced: 15 Dec 2024

https://github.com/daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

coding command-and-control dictation grammars kaldi kaldi-asr kaldi-grammar python speech-recognition speech-to-text voice voice-coding voice-commands voice-control

Last synced: 18 Dec 2024

https://github.com/SergeyShk/Speech-to-Text-Russian

Проект для распознавания речи на русском языке на основе pykaldi.

asr kaldi pykaldi russian-specific speech-recognition speech-to-text

Last synced: 27 Nov 2024

https://github.com/nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

file-formats fileio kaldi pure-python python python2 python3 speech-recognition

Last synced: 15 Dec 2024

https://github.com/XiaoMi/kaldi-onnx

Kaldi model converter to ONNX

android ios kaldi mace onnx speech-recognition

Last synced: 13 Nov 2024

https://github.com/xiaomi/kaldi-onnx

Kaldi model converter to ONNX

android ios kaldi mace onnx speech-recognition

Last synced: 18 Dec 2024

https://github.com/Diamondfan/CTC_pytorch

CTC end -to-end ASR for timit and 863 corpus.

ctc decoder kaldi pytorch timit

Last synced: 27 Nov 2024

https://github.com/diamondfan/ctc_pytorch

CTC end -to-end ASR for timit and 863 corpus.

ctc decoder kaldi pytorch timit

Last synced: 19 Dec 2024

https://github.com/csukuangfj/kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor

Last synced: 22 Dec 2024

https://github.com/jzlianglu/pykaldi2

Yet another speech toolkit based on Kaldi and PyTorch

horovod kaldi pykaldi pytorch speech-toolkit

Last synced: 09 Dec 2024

https://github.com/gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

asr kaldi kaldi-asr python python-2 speech-recognition wrapper

Last synced: 16 Dec 2024

https://github.com/CoEDL/elpis

🙊 software for creating speech recognition models.

automatic-speech-recognition computational-linguistics docker kaldi linguistics python transcription

Last synced: 15 Nov 2024

https://github.com/funcwj/aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

end-to-end kaldi multi-channel speech speech-enhancement speech-recognition speech-separation

Last synced: 02 Nov 2024

https://github.com/RicherMans/PLDA

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

kaldi plda speaker-verification

Last synced: 17 Nov 2024

https://github.com/yh1008/speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

cnn dnn kaldi speech-recognition speech-to-text

Last synced: 13 Nov 2024

https://github.com/mravanelli/pytorch_mlp_for_asr

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit

Last synced: 02 Dec 2024

https://github.com/mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

asr cuda deep-learning deep-neural-networks feedforward-neural-network kaldi kaldi-asr mlp multilayer-perceptron neural-networks python pytorch speech-recognition timit

Last synced: 27 Nov 2024

https://github.com/mravanelli/theano-kaldi-rnn

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

deep-learning deep-neural-networks gated-recurrent-units gru kaldi recurrent-neural-networks rnn theano theano-kaldi-rnns timit

Last synced: 02 Dec 2024

https://github.com/uiuc-sst/asr24

24-hour Automatic Speech Recognition

asr g2p kaldi language-model transcription

Last synced: 13 Nov 2024

https://github.com/nttcslab-sp/torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

asr kaldi pytorch

Last synced: 13 Nov 2024

https://github.com/daanzu/kaldi_ag_training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

custom fine-tuning kaldi kaldi-asr personal speech speech-recognition speech-to-text training

Last synced: 08 Nov 2024

https://github.com/AASHISHAG/asr-german

Automatic Speech Recognition (ASR) - German

asr gsoc-2019 kaldi mozilla-deepspeech red-hen-labs speech speech-recognition

Last synced: 05 Nov 2024

https://github.com/mathquis/node-kaldi-online-nnet3-decoder

ASR online decoding using Kaldi NNet3 GrammarFST

asr decoder kaldi nnet3 stt

Last synced: 13 Nov 2024

https://github.com/dfordivam/hskaldi

Kaldi-ASR haskell binding experiments

haskell haskell-bindings kaldi

Last synced: 07 Nov 2024

https://github.com/smorodov/kaldi_vosk_win_cmake

cmake based kaldi + vosk + microphone speech recognition example

kaldi speaker-recognition speech-recognition speech-to-text voice-recognition vosk

Last synced: 06 Nov 2024

https://github.com/jailuthra/asr

Kaldi ASR wrapper scripts

asr kaldi praat speech speech-recognition

Last synced: 19 Nov 2024

https://github.com/alx741/kaldi-gstreamer-server-haskell-client

kaldi-gstreamer-server haskell client

client gstreamer haskell kaldi

Last synced: 07 Nov 2024

https://github.com/alx741/kaldi_spanish_dimex100

Kaldi ASR Spanish example using the DIMEx100 corpus

asr dimex100 kaldi spanish

Last synced: 07 Nov 2024

https://github.com/xx205/switchboard_training_in_minutes

PyTorch with horovod setup for distributed training of Switchboard-1 Phase 1 training data in minutes, without hurting the accuracy.

kaldi pytorch speech-recognition switchboard

Last synced: 11 Nov 2024

https://github.com/mthrok/tkaldi

Kaldi-ASR powered by PyTorch C++ API (Experimental)

asr kaldi pytorch

Last synced: 15 Dec 2024

https://github.com/sidgupta234/indian_english_asr

An Indian English ASR system based on Hidden Markov Models (HMM) has been designed using Kaldi(Povey et al., 2011).

asr indian-english-speech-data kaldi kaldi-asr

Last synced: 30 Nov 2024

https://github.com/jrmeyer/interspeech-2018

I submitted this paper to Interspeech 2018. The paper was not accepted. The reviewer comments are included in the repo.

interspeech2018 kaldi multi-task-learning rejection

Last synced: 15 Dec 2024

https://github.com/andi611/kaldi-librispeech-fmllr

This repository contains Kaldi recipes on the LibriSpeech corpora to extract fMLLR features

fmllr kaldi kaldi-librispeech librispeech librispeech-fmllr

Last synced: 02 Dec 2024

https://github.com/harisbinzia/guftaarshanaas

Urdu Speech Recognition using Hidden Markov Models and Deep Neural Networks

arl kaldi language-model pronunciation-dictionary pronunciation-lexicon speech-recognition urdu

Last synced: 07 Nov 2024