Projects in Awesome Lists tagged with speechrecognition
A curated list of projects in awesome lists tagged with speechrecognition .
https://github.com/speechbrain/speechbrain
A PyTorch-based Speech Toolkit
asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition
Last synced: 13 May 2025
https://github.com/revdotcom/reverb
Open source inference code for Rev's model
asr asr-model canary deeplearning diarization docker huggingface neural-network open-source opensource pyannote rev revai speaker-diarization speech-recognition speech-to-text speechrecognition wenet whisper
Last synced: 15 May 2025
https://github.com/speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit
Last synced: 29 Jan 2026
https://github.com/samirpaulb/real-time-voice-translator
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
deep-translator final-year-project googletranslator gtts gui linguasync machine-learning ml playsound python real-time-transcription speaker-recognition speech-to-speech speech-to-text speechrecognition text-to-speech tkinter translates-audio translation voice-translator
Last synced: 13 Oct 2025
https://github.com/robmsmt/KerasDeepSpeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
asr baidu coreml ctc deep-learning deeplearning deepspeech keras machine-learning neural-network neural-networks nn speech speech-to-text speechrecognition
Last synced: 19 Jul 2025
https://github.com/goxr3plus/java-google-speech-api
🙊 Speech Recognition , Text To Speech , Google Translate
google-translate speechrecognition text-to-speech
Last synced: 16 Apr 2025
https://github.com/botbahlul/autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
auto-caption auto-subtitle captions ffmpeg google-translate-api python speech-recognition speechrecognition srt-subtitle subriptext subtitle voice-recognition voicerecognition
Last synced: 18 Jun 2025
https://github.com/syntithenai/opensnips
Open source projects related to Snips https://snips.ai/.
asr audio-server dialog docker hark hotwords kaldi nlu porcupine rasa snips snips-skills snowboy speech speechrecognition
Last synced: 21 Feb 2026
https://github.com/botbahlul/pyvosklivesubtitle
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
auto-caption caption ffmpeg google-translate-api live-caption live-subtitle pysimplegui python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk
Last synced: 27 Jul 2025
https://github.com/botbahlul/whisper_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
auto-caption auto-subtitle caption faster-whisper ffmpeg google-translate-api openai openai-whisper python speech-recognition speechrecognition subtitle voice-recognition voicerecognition whisper
Last synced: 23 Oct 2025
https://github.com/untemps/react-vocal
React component and hook to initiate a SpeechRecognition session
component hook javascript react reactjs speech speech-to-text speechrecognition web-speech-api
Last synced: 03 May 2026
https://github.com/botbahlul/android-autosrt-v2
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
android caption chaquopy ffmpeg google-translate-api googletranslate java python speech-recognition speech-to-text speechrecognition subtitle voice-recognition voice-to-text voicerecognition
Last synced: 19 Aug 2025
https://github.com/azu/transcript-audio
Transcript your audio files like Podcast using SpeechRecognition and Virtual Audio Device.
audio blackhole chrome speechrecognition transcript
Last synced: 08 Oct 2025
https://github.com/botbahlul/android-autosrt
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files
android captions chaquopy ffmpeg google-translate-api java mobile-ffmpeg python speech-recognition speech-to-text speechrecognition srt-subtitle subtitle voice-recognition voice-to-text voicerecognition
Last synced: 11 Apr 2025
https://github.com/tristan296/universal-macassistant
Advanced Personal Assistant created for macOS that utilises AppleScripts, Siri and more.
applescript macos siri speechrecognition text-to-speech
Last synced: 12 Apr 2025
https://github.com/botbahlul/vosk_autosrt
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
auto-caption auto-subtitle caption ffmpeg google-translate-api python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk
Last synced: 28 Oct 2025
https://github.com/franchesoni/s2t
:speaking_head: :keyboard: Speech-to-text on key for Linux
linux onkey openai speech speech-recognition speech-to-text speechrecognition utilities whisper
Last synced: 18 Jun 2026
https://github.com/palahsu/textspeech
A python program that helps you to read your text in lady robot voice at your pace. Text Speech!
speech-recognition speech-to-text speechrecognition text-processing text-recognition text-speech text-speeches text-to-speech textspeech
Last synced: 24 Apr 2025
https://github.com/hritikgupta/offline-speech-recognition-app
Performs action based on speech reognised offline
android-application android-library android-studio speechrecognition
Last synced: 19 Jun 2025
https://github.com/lucasrmagalhaes/supermarioenglishchallenge-js
Sistema de reconhecimento de voz em JS para aprender cores em inglês.
Last synced: 24 Jun 2025
https://github.com/pkparthk/buddy-ai
Buddy AI is a full-stack, AI-powered personal assistant that combines voice recognition, natural language processing, and advanced command interpretation. Built with Python (Flask) and React (TypeScript), it features smart web navigation, real-time system monitoring, weather/news APIs, and context-aware responses.
api artificial-intelligence flask gtts machine-learning python react reactjs rest-api speechrecognition tailwindcss typescript
Last synced: 08 Oct 2025
https://github.com/sebastienrousseau/audioanalyser
Audio Analyser, a cutting-edge application designed to transform audio recordings into actionable insights using Microsoft Azure AI. It offers audio recording, speech-to-text conversion, and in-depth text analysis, providing users with comprehensive and insightful reports.
audioanalyser audioprocessing audiorecording azure azure-openai azure-services sentiments-analysis sentiments-classification speech-to-text speechrecognition speechrecognition-python textanalysis translation
Last synced: 21 Mar 2025
https://github.com/ghousetazeem/personal-asisstant
Desktop Personal Assistant like Cortana using python and AI principles
pyautogui-automation pyqt5 python speechrecognition textrecognition tkinter
Last synced: 03 Aug 2025
https://github.com/saqqdy/uni-use
Some useful composition api
speechrecognition speechsynthesis uni-use use-downloads use-recognition use-speak use-textarea vue-hooks
Last synced: 13 Aug 2025
https://github.com/sebastienrousseau/akande
An innovative, open-source voice assistant powered by OpenAI's GPT-3, designed to provide interactive, conversational experiences through both voice and text inputs. 🐍
openai openai-chatgpt pdf-generation smartassistant speechrecognition speechrecognition-python text-to-speech voiceassistant voicecontrol
Last synced: 01 Feb 2026
https://github.com/andrey06mi/context-buddy
🎨 Build effective AI prompts effortlessly with Context Buddy's visual 10-section framework for clear and structured prompt creation.
ai api automation chatgpt claude-ai coding-assistant command-line-tool feature-development flask machine-learning ollama openai perplexity-ai raycast react rest-api speechrecognition typescript
Last synced: 02 Apr 2026
https://github.com/aaaastark/nvidia-speech-artificial-intelligence
NVIDIA Speech Artificial Intelligence. Speech AI Summit 2022.
ai-pipelines alexa common-voice mozilla mozilla-deepspeech nlp-research nvidia python pytorch speech-enhancement speech-recognition speech-synthesis speech-to-text speechrecognition zero-shot-voice-conversion
Last synced: 05 May 2026
https://github.com/arbazkhan4712/speech-to-text
A program that can convert Speech into Text using python
pyaudio python pyttsx3 speech-recognition speech-to-text speechrecognition speechrecognition-python
Last synced: 10 Apr 2025
https://github.com/yangr0/speakify
[ Speech to Text ]
bash pip3 python python3 shell speech-recognition speech-to-text speechrecognition
Last synced: 04 May 2026
https://github.com/polcats/flexiassistant
A fully customizable python-based voice assistant
speechrecognition voice-assistant voice-commands
Last synced: 03 Sep 2025
https://github.com/ashutoshpandeyofficial/jarvis
Jarvis is an AI-powered voice assistant for your laptop that helps you automate tasks, answer queries, and interact with your system using voice commands. Built using Python and various AI models, it aims to provide a seamless and smart experience for users.
openai python speechrecognition
Last synced: 21 May 2026
https://github.com/krishnasism/realtime-analysis
College Major Project. Gather pictures of the thing the speaker is currently talking about.
Last synced: 16 Jan 2026
https://github.com/romeusorionaet/nlw-expert-notes
Converta automaticamente notas de áudio em texto.
nlw note speechrecognition vite
Last synced: 11 May 2026
https://github.com/hrfmmymt/speech-input
A custom element that allows you to easily try a SpeechRecognition API on your site.
custom-elements custom-elements-v1 media-recorder mediarecorder-api speech-recognition speechrecognition web-components webcomponents
Last synced: 18 Apr 2026
https://github.com/belchenkov/speak-number-guess
Number guessing game where you speak your guess into the microphone using the speech recognition API
css3 html5 js6 speechrecognition
Last synced: 23 Jun 2026
https://github.com/leticiabhb/wavparatxt
Transcrição de Áudio WAV para Texto
audiototext python speechrecognition speechrecognition-python wav wavtotext
Last synced: 10 Jun 2025
https://github.com/projects-developer/ieee-java-project-list
IEEE Java projects encompass a wide range of applications, from Artificial Intelligence and Machine Learning to Data Science and Analytics, Networking and Cybersecurity, Internet of Things (IoT), and Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
artificialintelligence btechprojects computerscienceprojects cybersecurity dataanalytics datascience deeplearning ieeejavaprojects ieeeprojects imageprocessing iot java javabasedprojects machinelearning mtechprojects networking speechrecognition virtualassistant
Last synced: 16 May 2026
https://github.com/hyperbayislive/lucifer-assistant
Lucifer is a powerful, offline voice assistant built specifically for Windows 10. It brings deep system-level automation, real-time voice command processing, and a fully customizable HTML-based clock utility—solving limitations of the native Windows clock.
artificialintelligence automation customvoiceassistant devtools opensource productivitytools python pythonautomation pythonprojects pythonscripts pyttsx3 speechrecognition speechtotext systemcontrol tts voiceassistant windows windowsautomation
Last synced: 23 Jun 2026
https://github.com/vandodev/nlw-expert-react
nlw-expert-react, comverte notas de áudios em testo
ai ia lucide-react radix-ui react reactjs sonner speechrecognition tailwindcss typescript vite
Last synced: 10 Apr 2026
https://github.com/hugo-hattori/tictactoe_voice_controlled
This is a game project that utilizes speech recognition package for voice command feature.
game game-development pygame python speech-recognition speech-to-text speechrecognition voice-commands voice-recognition
Last synced: 12 Nov 2025
https://github.com/moe131/speech-refiner
Speech Refiner web app to help you practice your English speaking skills using GPT-4
english-grammar english-learning english-sp gpt-4 speech-recognition speech-refiner speech-to-text speechrecognition
Last synced: 09 Apr 2025
https://github.com/adityakadam1994/voice-recognition-app
Just a fun with voice recognition app. It accepts certain commands please check read me file for it.
speech-synthesis speech-to-text speechrecognition
Last synced: 09 Oct 2025
https://github.com/detain/magic-spells
Magic Spells is a game to practice Spelling Words using Speech Recognition
game learning school school-education speech speech-recognition speech-to-text speechrecognition spelling spelling-checker spelling-practice study studying
Last synced: 16 Mar 2026
https://github.com/manalisbhavsar/voice-book-finder
A Library System that allows to search, issue, and manage books with voice-based search functionality. Also includes user authentication and tracking book availability using a MySQL database.
mysql-database os pymysql-connection-pool python smtplib speechrecognition tkinter
Last synced: 17 Apr 2026
https://github.com/harshpimpale/sihlegalassistant
A prototype for the SIH 2024 Police Department, where users can speak about a crime scenario and receive relevant IPC sections that apply to the situation.
faiss gemini langchain llm python speechrecognition streamit transformers vector-embedding
Last synced: 02 May 2026
https://github.com/adriwco/nlw-expert-notes
Converte automaticamente notas de áudio em texto | API SpeechRecognition
date-fns lucide-react notes-app react react-dialog sonner speechrecognition speechrecognition-api tailwind typescript vite
Last synced: 03 May 2026
https://github.com/harshpimpale/customised-invitation
This project creates customized invitations by using tools for text and speech processing. Ideal for sending personalized invites, it uses image manipulation and text-to-speech technologies for a complete multimedia experience.
pandas pillow python shutil speechrecognition
Last synced: 05 May 2026
https://github.com/jordane-chaves/nlw-expert-notes
NLW #14 Expert | Notes
radix-ui reactjs speechrecognition tailwindcss
Last synced: 07 May 2026
https://github.com/luizmiguelrosa/karen-virtual-assistant
A simple virtual assistant
nlp python speech-recognition speechrecognition virtual-assistant virtualassistant
Last synced: 02 Apr 2025