An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with deepspeech

A curated list of projects in awesome lists tagged with deepspeech .

https://github.com/mozilla/deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Last synced: 12 May 2025

https://github.com/mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

deep-learning deepspeech embedded machine-learning neural-networks offline on-device speech-recognition speech-to-text tensorflow

Last synced: 14 Mar 2025

https://github.com/mozilla/deepspeech-examples

Examples of how to use or integrate DeepSpeech

deepspeech dotnet examples machine-learning nodejs python speech-recognition

Last synced: 23 Oct 2025

https://github.com/mozilla/DeepSpeech-examples

Examples of how to use or integrate DeepSpeech

deepspeech dotnet examples machine-learning nodejs python speech-recognition

Last synced: 07 Apr 2025

https://github.com/yeyupiaoling/paddlepaddle-deepspeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

asr chinese deep-learning deepspeech deepspeech2 docker nvidia-docker paddlepaddle speech-recognition speech-to-text

Last synced: 15 May 2025

https://github.com/yeyupiaoling/masr

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

asr conformer deep-learning deepspeech pytorch speech speech-recognition speech-to-text squeezeformer

Last synced: 14 May 2025

https://github.com/abhirooptalasila/AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

asr autosub coqui-ai deepspeech ffmpeg mozilla-deepspeech python sox speech-to-text srt subtitle video

Last synced: 13 Apr 2025

https://github.com/abhirooptalasila/autosub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

asr autosub coqui-ai deepspeech ffmpeg mozilla-deepspeech python sox speech-to-text srt subtitle video

Last synced: 04 Apr 2025

https://github.com/mozilla/dsalign

DeepSpeech based forced alignment tool

deepspeech forced-alignment

Last synced: 17 Mar 2025

https://github.com/mozilla/DSAlign

DeepSpeech based forced alignment tool

deepspeech forced-alignment

Last synced: 14 Jul 2025

https://github.com/mainro/deepspeech-server

A testing server for a speech to text service based on coqui.ai

coqui-ai deepspeech reactive-extensions reactivex rxpy speech-recognition speech-to-text

Last synced: 05 Apr 2025

https://github.com/daanzu/deepspeech-websocket-server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

deepspeech deepspeech-server speech-recognition speech-to-text websocket

Last synced: 06 Sep 2025

https://github.com/ccoreilly/localstt

Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

android deepspeech speech-recognition vosk

Last synced: 15 Apr 2025

https://github.com/smlum/scription

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

aws-transcribe deepspeech google-speech-to-text mozilla-deepspeech scription speech-to-text transcription yandex-speech-kit

Last synced: 31 Mar 2025

https://github.com/ai-adv-lab/deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

arch baidu deepspeech mxnet speech speech-recognition speech-to-text stt warp-ctc

Last synced: 17 Apr 2025

https://github.com/t-vk/termux-deepspeech

Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux

android bash deepspeech mozilla offline open-source speech-recognition termux

Last synced: 08 Oct 2025

https://github.com/thecodrr/vspeech

📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜

deepspeech machine-learning mozilla speech-to-text tensorflow v

Last synced: 06 Mar 2026

https://github.com/T-vK/Termux-DeepSpeech

Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux

android bash deepspeech mozilla offline open-source speech-recognition termux

Last synced: 11 Mar 2025

https://github.com/thecodrr/vave

🌊 A crazy simple library for reading/writing WAV files in V. Zero dependencies, 100% cross-platform.

audio deepspeech sample-rate v vave vlang wavefile

Last synced: 05 Mar 2026

https://github.com/mozilla/deepspeech-playbook

A crash course for training speech recognition models using DeepSpeech.

acoustic-model common-voice deepspeech language-model speech-recognition

Last synced: 11 Jul 2025

https://github.com/bjornbytes/lua-deepspeech

Lua Library for Speech Recognition

deepspeech speech speech-recognition speech-to-text

Last synced: 09 Jul 2025

https://github.com/ccoreilly/deepspeech-catala

Deepspeech ASR Model for the Catalan Language

asr asr-model catalan catalan-language deepspeech

Last synced: 06 May 2025

https://github.com/milahu/autosub-by-abhirooptalasila

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui

autosub deepspeech offline speech-recognition srt subtitle-generator subtitles-generator tensorflow vtt

Last synced: 09 Jul 2025

https://github.com/ocatias/AutoMash

Automatically create YouTube mashups. Given videos and a text, AutoMash will cut the videos together so the speakers in the video appears to says the given text.

creative-coding deep-speech deepspeech ibm-watson-speech meme-generator memes speech-recognition speech-to-text vosk youtube

Last synced: 09 Jul 2025

https://github.com/ccoreilly/catalan-speech-recognition-benchmark

A benchmark of speech recognition solutions for the Catalan language

asr asr-model catala catalan catalan-language deepspeech speech-recognition speech-to-text vosk

Last synced: 11 Jan 2026

https://github.com/cosmoquester/speech-recognition

Develop speech recognition models with Tensorflow 2

deepspeech listen-attend-and-spell speech-recognition tensorflow tensorflow2

Last synced: 22 Jan 2026

https://github.com/waikato-ufdl/wai-annotations

Python library for converting annotated datasets into various formats (e.g., image classification, object detection and speech datasets).

common-voice conversion deepspeech festvox image-annotation mscoco python3 tfrecords vgg

Last synced: 29 Jul 2025

https://github.com/kathyreid/cpug-2021-deepspeech

Lightning talk on DeepSpeech to Canberra Python Users' Group 4th March 2021

canberra deepspeech lightning-talk

Last synced: 07 Jul 2025

https://github.com/ccoreilly/telegram-deepspeech-bot

A Telegram bot that infers text from voice notes using DeepSpeech

catalan deepspeech telegram telegram-bot

Last synced: 04 Apr 2025

https://github.com/wookay/sunyata.jl

Speech to Text using DeepSpeech 💋

deepspeech phonetics

Last synced: 01 Mar 2026

https://github.com/en10/deepspeech

A Simplified Example of Speech Recognition

deepspeech speech-recognition speech-to-text tensorflow

Last synced: 27 Mar 2025

https://github.com/zee-bit/team-apocalypse

This is the official repo of Team Apocalypse, BIT Mesra, for IEEE Mega Project'20.

audio deepspeech electron flask python3 speech-to-text team-apocalypse tensorflow

Last synced: 14 Apr 2026

https://github.com/studiowebux/deepspeech

Ansible scripts to install and configure DeepSpeech and tensorflow on ubuntu 18.04

ansible deepspeech tensorflow ubuntu ubuntu1804

Last synced: 21 Apr 2025

https://github.com/andrew-chen-wang/your-speech-recognition

UI for Making DeepSpeech Voice Recognition Fine-Tuned to Your Voice Easier

deepspeech fine-tuning voice-assistant voice-recognition

Last synced: 05 Oct 2025

https://github.com/abdur75648/dinet-inference

Create high-resolution visually dubbed videos with DINet

avatar-generation deepspeech dinet dubbing lipgan lipsync openface video-generation wav2l wav2vec

Last synced: 31 Mar 2025