An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with speechrecognition

A curated list of projects in awesome lists tagged with speechrecognition .

https://github.com/speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit

Last synced: 29 Jan 2026

https://github.com/goxr3plus/java-google-speech-api

🙊 Speech Recognition , Text To Speech , Google Translate

google-translate speechrecognition text-to-speech

Last synced: 16 Apr 2025

https://github.com/botbahlul/autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

auto-caption auto-subtitle captions ffmpeg google-translate-api python speech-recognition speechrecognition srt-subtitle subriptext subtitle voice-recognition voicerecognition

Last synced: 18 Jun 2025

https://github.com/botbahlul/pyvosklivesubtitle

PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE

auto-caption caption ffmpeg google-translate-api live-caption live-subtitle pysimplegui python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk

Last synced: 27 Jul 2025

https://github.com/botbahlul/whisper_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

auto-caption auto-subtitle caption faster-whisper ffmpeg google-translate-api openai openai-whisper python speech-recognition speechrecognition subtitle voice-recognition voicerecognition whisper

Last synced: 23 Oct 2025

https://github.com/untemps/react-vocal

React component and hook to initiate a SpeechRecognition session

component hook javascript react reactjs speech speech-to-text speechrecognition web-speech-api

Last synced: 03 May 2026

https://github.com/botbahlul/android-autosrt-v2

ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES

android caption chaquopy ffmpeg google-translate-api googletranslate java python speech-recognition speech-to-text speechrecognition subtitle voice-recognition voice-to-text voicerecognition

Last synced: 19 Aug 2025

https://github.com/azu/transcript-audio

Transcript your audio files like Podcast using SpeechRecognition and Virtual Audio Device.

audio blackhole chrome speechrecognition transcript

Last synced: 08 Oct 2025

https://github.com/botbahlul/android-autosrt

ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files

android captions chaquopy ffmpeg google-translate-api java mobile-ffmpeg python speech-recognition speech-to-text speechrecognition srt-subtitle subtitle voice-recognition voice-to-text voicerecognition

Last synced: 11 Apr 2025

https://github.com/tristan296/universal-macassistant

Advanced Personal Assistant created for macOS that utilises AppleScripts, Siri and more.

applescript macos siri speechrecognition text-to-speech

Last synced: 12 Apr 2025

https://github.com/botbahlul/vosk_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

auto-caption auto-subtitle caption ffmpeg google-translate-api python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk

Last synced: 28 Oct 2025

https://github.com/franchesoni/s2t

:speaking_head: :keyboard: Speech-to-text on key for Linux

linux onkey openai speech speech-recognition speech-to-text speechrecognition utilities whisper

Last synced: 18 Jun 2026

https://github.com/palahsu/textspeech

A python program that helps you to read your text in lady robot voice at your pace. Text Speech!

speech-recognition speech-to-text speechrecognition text-processing text-recognition text-speech text-speeches text-to-speech textspeech

Last synced: 24 Apr 2025

https://github.com/lucasrmagalhaes/supermarioenglishchallenge-js

Sistema de reconhecimento de voz em JS para aprender cores em inglês.

dio js speechrecognition

Last synced: 24 Jun 2025

https://github.com/pkparthk/buddy-ai

Buddy AI is a full-stack, AI-powered personal assistant that combines voice recognition, natural language processing, and advanced command interpretation. Built with Python (Flask) and React (TypeScript), it features smart web navigation, real-time system monitoring, weather/news APIs, and context-aware responses.

api artificial-intelligence flask gtts machine-learning python react reactjs rest-api speechrecognition tailwindcss typescript

Last synced: 08 Oct 2025

https://github.com/sebastienrousseau/audioanalyser

Audio Analyser, a cutting-edge application designed to transform audio recordings into actionable insights using Microsoft Azure AI. It offers audio recording, speech-to-text conversion, and in-depth text analysis, providing users with comprehensive and insightful reports.

audioanalyser audioprocessing audiorecording azure azure-openai azure-services sentiments-analysis sentiments-classification speech-to-text speechrecognition speechrecognition-python textanalysis translation

Last synced: 21 Mar 2025

https://github.com/ghousetazeem/personal-asisstant

Desktop Personal Assistant like Cortana using python and AI principles

pyautogui-automation pyqt5 python speechrecognition textrecognition tkinter

Last synced: 03 Aug 2025

https://github.com/sebastienrousseau/akande

An innovative, open-source voice assistant powered by OpenAI's GPT-3, designed to provide interactive, conversational experiences through both voice and text inputs. 🐍

openai openai-chatgpt pdf-generation smartassistant speechrecognition speechrecognition-python text-to-speech voiceassistant voicecontrol

Last synced: 01 Feb 2026

https://github.com/andrey06mi/context-buddy

🎨 Build effective AI prompts effortlessly with Context Buddy's visual 10-section framework for clear and structured prompt creation.

ai api automation chatgpt claude-ai coding-assistant command-line-tool feature-development flask machine-learning ollama openai perplexity-ai raycast react rest-api speechrecognition typescript

Last synced: 02 Apr 2026

https://github.com/polcats/flexiassistant

A fully customizable python-based voice assistant

speechrecognition voice-assistant voice-commands

Last synced: 03 Sep 2025

https://github.com/ashutoshpandeyofficial/jarvis

Jarvis is an AI-powered voice assistant for your laptop that helps you automate tasks, answer queries, and interact with your system using voice commands. Built using Python and various AI models, it aims to provide a seamless and smart experience for users.

openai python speechrecognition

Last synced: 21 May 2026

https://github.com/krishnasism/realtime-analysis

College Major Project. Gather pictures of the thing the speaker is currently talking about.

python speechrecognition

Last synced: 16 Jan 2026

https://github.com/romeusorionaet/nlw-expert-notes

Converta automaticamente notas de áudio em texto.

nlw note speechrecognition vite

Last synced: 11 May 2026

https://github.com/hrfmmymt/speech-input

A custom element that allows you to easily try a SpeechRecognition API on your site.

custom-elements custom-elements-v1 media-recorder mediarecorder-api speech-recognition speechrecognition web-components webcomponents

Last synced: 18 Apr 2026

https://github.com/belchenkov/speak-number-guess

Number guessing game where you speak your guess into the microphone using the speech recognition API

css3 html5 js6 speechrecognition

Last synced: 23 Jun 2026

https://github.com/projects-developer/ieee-java-project-list

IEEE Java projects encompass a wide range of applications, from Artificial Intelligence and Machine Learning to Data Science and Analytics, Networking and Cybersecurity, Internet of Things (IoT), and Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials

artificialintelligence btechprojects computerscienceprojects cybersecurity dataanalytics datascience deeplearning ieeejavaprojects ieeeprojects imageprocessing iot java javabasedprojects machinelearning mtechprojects networking speechrecognition virtualassistant

Last synced: 16 May 2026

https://github.com/hyperbayislive/lucifer-assistant

Lucifer is a powerful, offline voice assistant built specifically for Windows 10. It brings deep system-level automation, real-time voice command processing, and a fully customizable HTML-based clock utility—solving limitations of the native Windows clock.

artificialintelligence automation customvoiceassistant devtools opensource productivitytools python pythonautomation pythonprojects pythonscripts pyttsx3 speechrecognition speechtotext systemcontrol tts voiceassistant windows windowsautomation

Last synced: 23 Jun 2026

https://github.com/vandodev/nlw-expert-react

nlw-expert-react, comverte notas de áudios em testo

ai ia lucide-react radix-ui react reactjs sonner speechrecognition tailwindcss typescript vite

Last synced: 10 Apr 2026

https://github.com/hugo-hattori/tictactoe_voice_controlled

This is a game project that utilizes speech recognition package for voice command feature.

game game-development pygame python speech-recognition speech-to-text speechrecognition voice-commands voice-recognition

Last synced: 12 Nov 2025

https://github.com/moe131/speech-refiner

Speech Refiner web app to help you practice your English speaking skills using GPT-4

english-grammar english-learning english-sp gpt-4 speech-recognition speech-refiner speech-to-text speechrecognition

Last synced: 09 Apr 2025

https://github.com/adityakadam1994/voice-recognition-app

Just a fun with voice recognition app. It accepts certain commands please check read me file for it.

speech-synthesis speech-to-text speechrecognition

Last synced: 09 Oct 2025

https://github.com/manalisbhavsar/voice-book-finder

A Library System that allows to search, issue, and manage books with voice-based search functionality. Also includes user authentication and tracking book availability using a MySQL database.

mysql-database os pymysql-connection-pool python smtplib speechrecognition tkinter

Last synced: 17 Apr 2026

https://github.com/harshpimpale/sihlegalassistant

A prototype for the SIH 2024 Police Department, where users can speak about a crime scenario and receive relevant IPC sections that apply to the situation.

faiss gemini langchain llm python speechrecognition streamit transformers vector-embedding

Last synced: 02 May 2026

https://github.com/adriwco/nlw-expert-notes

Converte automaticamente notas de áudio em texto | API SpeechRecognition

date-fns lucide-react notes-app react react-dialog sonner speechrecognition speechrecognition-api tailwind typescript vite

Last synced: 03 May 2026

https://github.com/harshpimpale/customised-invitation

This project creates customized invitations by using tools for text and speech processing. Ideal for sending personalized invites, it uses image manipulation and text-to-speech technologies for a complete multimedia experience.

pandas pillow python shutil speechrecognition

Last synced: 05 May 2026