Projects in Awesome Lists tagged with deepspeech
A curated list of projects in awesome lists tagged with deepspeech .
https://github.com/mozilla/deepspeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow
Last synced: 12 May 2025
https://github.com/mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow
Last synced: 14 Mar 2025
https://github.com/alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
android asr deep-learning deep-neural-networks deepspeech google-speech-to-text ios kaldi offline privacy python raspberry-pi speaker-identification speaker-verification speech-recognition speech-to-text speech-to-text-android stt voice-recognition vosk
Last synced: 12 May 2025
https://github.com/mozilla/deepspeech-examples
Examples of how to use or integrate DeepSpeech
deepspeech dotnet examples machine-learning nodejs python speech-recognition
Last synced: 23 Oct 2025
https://github.com/mozilla/DeepSpeech-examples
Examples of how to use or integrate DeepSpeech
deepspeech dotnet examples machine-learning nodejs python speech-recognition
Last synced: 07 Apr 2025
https://github.com/yeyupiaoling/paddlepaddle-deepspeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
asr chinese deep-learning deepspeech deepspeech2 docker nvidia-docker paddlepaddle speech-recognition speech-to-text
Last synced: 15 May 2025
https://github.com/yeyupiaoling/masr
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
asr conformer deep-learning deepspeech pytorch speech speech-recognition speech-to-text squeezeformer
Last synced: 14 May 2025
https://github.com/abhirooptalasila/AutoSub
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
asr autosub coqui-ai deepspeech ffmpeg mozilla-deepspeech python sox speech-to-text srt subtitle video
Last synced: 13 Apr 2025
https://github.com/abhirooptalasila/autosub
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
asr autosub coqui-ai deepspeech ffmpeg mozilla-deepspeech python sox speech-to-text srt subtitle video
Last synced: 04 Apr 2025
https://github.com/picovoice/speech-to-text-benchmark
speech to text benchmark framework
aws-transcribe cheetah deep-learning deep-neural-networks deepspeech edge-ai google-speech-to-text mozilla-deepspeech offline picovoice pocketsphinx privacy speech-recognition speech-to-text voice-recognition
Last synced: 04 Apr 2025
https://github.com/Picovoice/speech-to-text-benchmark
speech to text benchmark framework
aws-transcribe cheetah deep-learning deep-neural-networks deepspeech edge-ai google-speech-to-text mozilla-deepspeech offline picovoice pocketsphinx privacy speech-recognition speech-to-text voice-recognition
Last synced: 10 Jul 2025
https://github.com/robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
asr baidu coreml ctc deep-learning deeplearning deepspeech keras machine-learning neural-network neural-networks nn speech speech-to-text speechrecognition
Last synced: 19 Jul 2025
https://github.com/rolczynski/automatic-speech-recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
automatic-speech-recognition deep-learning deepspeech distill keras language-model machine-learning neural-networks speech-recognition speech-to-text tensorflow tensorflow-models
Last synced: 30 Sep 2025
https://github.com/mainro/deepspeech-server
A testing server for a speech to text service based on coqui.ai
coqui-ai deepspeech reactive-extensions reactivex rxpy speech-recognition speech-to-text
Last synced: 05 Apr 2025
https://github.com/jinserk/pytorch-asr
ASR with PyTorch
asr capsule-network ctc decoder deepspeech densenet dictation kaldi kaldi-decoder lattice lvcsr pyro python pytorch pytorch-binding resnet speech speech-recognition ss-vae transcription
Last synced: 19 Jul 2025
https://github.com/daanzu/deepspeech-websocket-server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
deepspeech deepspeech-server speech-recognition speech-to-text websocket
Last synced: 06 Sep 2025
https://github.com/ccoreilly/localstt
Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
android deepspeech speech-recognition vosk
Last synced: 15 Apr 2025
https://github.com/smlum/scription
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
aws-transcribe deepspeech google-speech-to-text mozilla-deepspeech scription speech-to-text transcription yandex-speech-kit
Last synced: 31 Mar 2025
https://github.com/ai-adv-lab/deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
arch baidu deepspeech mxnet speech speech-recognition speech-to-text stt warp-ctc
Last synced: 17 Apr 2025
https://github.com/t-vk/termux-deepspeech
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
android bash deepspeech mozilla offline open-source speech-recognition termux
Last synced: 08 Oct 2025
https://github.com/gdsports/NSGadget_Pi
Raspberry Pi impersonates Nintendo Switch controller
adafruit arduino deepspeech nintendo-switch nintendo-switch-gamepad pinball raspberry-pi trinket-m0 usb-controller voice-control
Last synced: 26 Mar 2025
https://github.com/gdsports/nsgadget_pi
Raspberry Pi impersonates Nintendo Switch controller
adafruit arduino deepspeech nintendo-switch nintendo-switch-gamepad pinball raspberry-pi trinket-m0 usb-controller voice-control
Last synced: 05 Mar 2026
https://github.com/thecodrr/vspeech
📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜
deepspeech machine-learning mozilla speech-to-text tensorflow v
Last synced: 06 Mar 2026
https://github.com/T-vK/Termux-DeepSpeech
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
android bash deepspeech mozilla offline open-source speech-recognition termux
Last synced: 11 Mar 2025
https://github.com/thecodrr/vave
🌊 A crazy simple library for reading/writing WAV files in V. Zero dependencies, 100% cross-platform.
audio deepspeech sample-rate v vave vlang wavefile
Last synced: 05 Mar 2026
https://github.com/mozilla/deepspeech-playbook
A crash course for training speech recognition models using DeepSpeech.
acoustic-model common-voice deepspeech language-model speech-recognition
Last synced: 11 Jul 2025
https://github.com/bjornbytes/lua-deepspeech
Lua Library for Speech Recognition
deepspeech speech speech-recognition speech-to-text
Last synced: 09 Jul 2025
https://github.com/ccoreilly/deepspeech-catala
Deepspeech ASR Model for the Catalan Language
asr asr-model catalan catalan-language deepspeech
Last synced: 06 May 2025
https://github.com/milahu/autosub-by-abhirooptalasila
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
autosub deepspeech offline speech-recognition srt subtitle-generator subtitles-generator tensorflow vtt
Last synced: 09 Jul 2025
https://github.com/ocatias/AutoMash
Automatically create YouTube mashups. Given videos and a text, AutoMash will cut the videos together so the speakers in the video appears to says the given text.
creative-coding deep-speech deepspeech ibm-watson-speech meme-generator memes speech-recognition speech-to-text vosk youtube
Last synced: 09 Jul 2025
https://github.com/ccoreilly/catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
asr asr-model catala catalan catalan-language deepspeech speech-recognition speech-to-text vosk
Last synced: 11 Jan 2026
https://github.com/cosmoquester/speech-recognition
Develop speech recognition models with Tensorflow 2
deepspeech listen-attend-and-spell speech-recognition tensorflow tensorflow2
Last synced: 22 Jan 2026
https://github.com/waikato-datamining/tensorflow
Various applications using tensorflow.
deep-learning deepspeech efficientdet image-classification image-segmentation model-maker object-detection python tensorflow tflite
Last synced: 30 Apr 2025
https://github.com/khaykingleb/automatic-speech-recognition
QuartzNet and Deepspeech Implementation for ASR
automatic-speech-recognition deep-learning deepspeech pytorch quartz-net speech-recognition
Last synced: 27 Apr 2026
https://github.com/waikato-ufdl/wai-annotations
Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).
common-voice conversion deepspeech festvox image-annotation mscoco python3 tfrecords vgg
Last synced: 29 Jul 2025
https://github.com/kathyreid/cpug-2021-deepspeech
Lightning talk on DeepSpeech to Canberra Python Users' Group 4th March 2021
canberra deepspeech lightning-talk
Last synced: 07 Jul 2025
https://github.com/gulabpatel/speech-to-text
deepspeech gtts speech-to-text text-to-speech wav2vec2
Last synced: 21 Jul 2025
https://github.com/raci0n/deepspeech-arm
Its a deepspeech docker container for arm.
arm armv7 container deepspeech docker mozilla-deepspeech raspberry raspberry-pi raspberry-pi-3 raspberrypi raspbian s2t speech-to-text
Last synced: 16 Dec 2025
https://github.com/ccoreilly/telegram-deepspeech-bot
A Telegram bot that infers text from voice notes using DeepSpeech
catalan deepspeech telegram telegram-bot
Last synced: 04 Apr 2025
https://github.com/en10/deepspeech
A Simplified Example of Speech Recognition
deepspeech speech-recognition speech-to-text tensorflow
Last synced: 27 Mar 2025
https://github.com/zee-bit/team-apocalypse
This is the official repo of Team Apocalypse, BIT Mesra, for IEEE Mega Project'20.
audio deepspeech electron flask python3 speech-to-text team-apocalypse tensorflow
Last synced: 14 Apr 2026
https://github.com/studiowebux/deepspeech
Ansible scripts to install and configure DeepSpeech and tensorflow on ubuntu 18.04
ansible deepspeech tensorflow ubuntu ubuntu1804
Last synced: 21 Apr 2025
https://github.com/andrew-chen-wang/your-speech-recognition
UI for Making DeepSpeech Voice Recognition Fine-Tuned to Your Voice Easier
deepspeech fine-tuning voice-assistant voice-recognition
Last synced: 05 Oct 2025
https://github.com/abdur75648/dinet-inference
Create high-resolution visually dubbed videos with DINet
avatar-generation deepspeech dinet dubbing lipgan lipsync openface video-generation wav2l wav2vec
Last synced: 31 Mar 2025
https://github.com/yui-mhcp/speech_to_text
Speech-To-Text (STT) project
audio-transcription deepspeech jasper speech-to-text stt stt-api tensorflow2 video-transcription whisper
Last synced: 11 Mar 2025