Projects in Awesome Lists tagged with deepspeech

https://github.com/mozilla/deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Last synced: 30 Dec 2024

https://github.com/mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Last synced: 25 Oct 2024

https://github.com/mozilla/stt

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Last synced: 13 Oct 2024

https://github.com/alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

android asr deep-learning deep-neural-networks deepspeech google-speech-to-text ios kaldi offline privacy python raspberry-pi speaker-identification speaker-verification speech-recognition speech-to-text speech-to-text-android stt voice-recognition vosk

Last synced: 30 Dec 2024

https://github.com/mozilla/deepspeech-examples

Examples of how to use or integrate DeepSpeech

deepspeech dotnet examples machine-learning nodejs python speech-recognition

Last synced: 29 Dec 2024

https://github.com/mozilla/DeepSpeech-examples

Examples of how to use or integrate DeepSpeech

deepspeech dotnet examples machine-learning nodejs python speech-recognition

Last synced: 06 Nov 2024

https://github.com/yeyupiaoling/paddlepaddle-deepspeech

基于PaddlePaddle实现的语音识别，中文语音识别。项目完善，识别效果好。支持Windows，Linux下训练和预测，支持Nvidia Jetson开发板预测。

asr chinese deep-learning deepspeech deepspeech2 docker nvidia-docker paddlepaddle speech-recognition speech-to-text

Last synced: 27 Dec 2024

https://github.com/yeyupiaoling/masr

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。

asr conformer deep-learning deepspeech pytorch speech speech-recognition speech-to-text squeezeformer

Last synced: 26 Dec 2024

https://github.com/abhirooptalasila/autosub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

asr autosub coqui-ai deepspeech ffmpeg mozilla-deepspeech python sox speech-to-text srt subtitle video

Last synced: 28 Dec 2024

https://github.com/abhirooptalasila/AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

asr autosub coqui-ai deepspeech ffmpeg mozilla-deepspeech python sox speech-to-text srt subtitle video

Last synced: 07 Nov 2024

https://github.com/Picovoice/speech-to-text-benchmark

speech to text benchmark framework

aws-transcribe cheetah deep-learning deep-neural-networks deepspeech edge-ai google-speech-to-text mozilla-deepspeech offline picovoice pocketsphinx privacy speech-recognition speech-to-text voice-recognition

Last synced: 21 Nov 2024

https://github.com/picovoice/speech-to-text-benchmark

speech to text benchmark framework

aws-transcribe cheetah deep-learning deep-neural-networks deepspeech edge-ai google-speech-to-text mozilla-deepspeech offline picovoice pocketsphinx privacy speech-recognition speech-to-text voice-recognition

Last synced: 28 Dec 2024

https://github.com/robmsmt/KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

asr baidu coreml ctc deep-learning deeplearning deepspeech keras machine-learning neural-network neural-networks nn speech speech-to-text speechrecognition

Last synced: 27 Nov 2024

https://github.com/mozilla/dsalign

DeepSpeech based forced alignment tool

deepspeech forced-alignment

Last synced: 26 Dec 2024

https://github.com/mozilla/DSAlign

DeepSpeech based forced alignment tool

deepspeech forced-alignment

Last synced: 22 Nov 2024

https://github.com/rolczynski/automatic-speech-recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

automatic-speech-recognition deep-learning deepspeech distill keras language-model machine-learning neural-networks speech-recognition speech-to-text tensorflow tensorflow-models

Last synced: 26 Sep 2024

https://github.com/mainro/deepspeech-server

A testing server for a speech to text service based on coqui.ai

coqui-ai deepspeech reactive-extensions reactivex rxpy speech-recognition speech-to-text

Last synced: 29 Dec 2024

https://github.com/jinserk/pytorch-asr

ASR with PyTorch

asr capsule-network ctc decoder deepspeech densenet dictation kaldi kaldi-decoder lattice lvcsr pyro python pytorch pytorch-binding resnet speech speech-recognition ss-vae transcription

Last synced: 27 Nov 2024

https://github.com/daanzu/deepspeech-websocket-server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

deepspeech deepspeech-server speech-recognition speech-to-text websocket

Last synced: 01 Nov 2024

https://github.com/ccoreilly/localstt

Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

android deepspeech speech-recognition vosk

Last synced: 01 Nov 2024

https://github.com/smlum/scription

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

aws-transcribe deepspeech google-speech-to-text mozilla-deepspeech scription speech-to-text transcription yandex-speech-kit

Last synced: 01 Nov 2024

https://github.com/ai-adv-lab/deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

arch baidu deepspeech mxnet speech speech-recognition speech-to-text stt warp-ctc

Last synced: 08 Nov 2024

https://github.com/t-vk/termux-deepspeech

Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux

android bash deepspeech mozilla offline open-source speech-recognition termux

Last synced: 14 Nov 2024

https://github.com/gdsports/nsgadget_pi

Raspberry Pi impersonates Nintendo Switch controller

adafruit arduino deepspeech nintendo-switch nintendo-switch-gamepad pinball raspberry-pi trinket-m0 usb-controller voice-control

Last synced: 19 Dec 2024

https://github.com/gdsports/NSGadget_Pi

Raspberry Pi impersonates Nintendo Switch controller

adafruit arduino deepspeech nintendo-switch nintendo-switch-gamepad pinball raspberry-pi trinket-m0 usb-controller voice-control

Last synced: 29 Oct 2024

https://github.com/thecodrr/vspeech

📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜

deepspeech machine-learning mozilla speech-to-text tensorflow v

Last synced: 11 Nov 2024

https://github.com/T-vK/Termux-DeepSpeech

Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux

android bash deepspeech mozilla offline open-source speech-recognition termux

Last synced: 23 Oct 2024

https://github.com/thecodrr/vave

🌊 A crazy simple library for reading/writing WAV files in V. Zero dependencies, 100% cross-platform.

audio deepspeech sample-rate v vave vlang wavefile

Last synced: 11 Nov 2024

https://github.com/mozilla/deepspeech-playbook

A crash course for training speech recognition models using DeepSpeech.

acoustic-model common-voice deepspeech language-model speech-recognition

Last synced: 07 Oct 2024

https://github.com/bjornbytes/lua-deepspeech

Lua Library for Speech Recognition

deepspeech speech speech-recognition speech-to-text

Last synced: 20 Nov 2024

https://github.com/rashadgarayev/trspeech-to-text

deepspeech mozilla speech-recognition tensorflow tensorflowdeepspeech turkish-language

Last synced: 07 Nov 2024

https://github.com/RashadGarayev/TRSpeech-to-text

deepspeech mozilla speech-recognition tensorflow tensorflowdeepspeech turkish-language

Last synced: 29 Nov 2024

https://github.com/ccoreilly/deepspeech-catala

Deepspeech ASR Model for the Catalan Language

asr asr-model catalan catalan-language deepspeech

Last synced: 23 Oct 2024

https://github.com/milahu/autosub-by-abhirooptalasila

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

autosub deepspeech offline speech-recognition srt subtitle-generator subtitles-generator tensorflow vtt

Last synced: 20 Nov 2024

https://github.com/ccoreilly/catalan-speech-recognition-benchmark

A benchmark of speech recognition solutions for the Catalan language

asr asr-model catala catalan catalan-language deepspeech speech-recognition speech-to-text vosk

Last synced: 11 Dec 2024

https://github.com/ocatias/AutoMash

Automatically create YouTube mashups. Given videos and a text, AutoMash will cut the videos together so the speakers in the video appears to says the given text.

creative-coding deep-speech deepspeech ibm-watson-speech meme-generator memes speech-recognition speech-to-text vosk youtube