Projects in Awesome Lists tagged with speechrecognition

https://github.com/speechbrain/speechbrain

A PyTorch-based Speech Toolkit

asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition

Last synced: 30 Dec 2024

https://github.com/speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit

Last synced: 13 Nov 2024

https://github.com/revdotcom/reverb

Open source inference code for Rev's model

asr asr-model canary deeplearning diarization docker huggingface neural-network open-source opensource pyannote rev revai speaker-diarization speech-recognition speech-to-text speechrecognition wenet whisper

Last synced: 29 Dec 2024

https://github.com/robmsmt/KerasDeepSpeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

asr baidu coreml ctc deep-learning deeplearning deepspeech keras machine-learning neural-network neural-networks nn speech speech-to-text speechrecognition

Last synced: 27 Nov 2024

https://github.com/goxr3plus/java-google-speech-api

🙊 Speech Recognition , Text To Speech , Google Translate

google-translate speechrecognition text-to-speech

Last synced: 30 Dec 2024

https://github.com/botbahlul/autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

auto-caption auto-subtitle captions ffmpeg google-translate-api python speech-recognition speechrecognition srt-subtitle subriptext subtitle voice-recognition voicerecognition

Last synced: 31 Oct 2024

https://github.com/botbahlul/whisper_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module which is a reimplementation of OpenAI Whisper module) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

auto-caption auto-subtitle caption faster-whisper ffmpeg google-translate-api openai openai-whisper python speech-recognition speechrecognition subtitle voice-recognition voicerecognition whisper

Last synced: 09 Oct 2024

https://github.com/botbahlul/pyvosklivesubtitle

PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE

auto-caption caption ffmpeg google-translate-api live-caption live-subtitle pysimplegui python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk

Last synced: 14 Nov 2024

https://github.com/azu/transcript-audio

Transcript your audio files like Podcast using SpeechRecognition and Virtual Audio Device.

audio blackhole chrome speechrecognition transcript

Last synced: 23 Oct 2024

https://github.com/botbahlul/android-autosrt

ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files

android captions chaquopy ffmpeg google-translate-api java mobile-ffmpeg python speech-recognition speech-to-text speechrecognition srt-subtitle subtitle voice-recognition voice-to-text voicerecognition

Last synced: 14 Nov 2024

https://github.com/botbahlul/android-autosrt-v2

ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES

android caption chaquopy ffmpeg google-translate-api googletranslate java python speech-recognition speech-to-text speechrecognition subtitle voice-recognition voice-to-text voicerecognition

Last synced: 14 Nov 2024

https://github.com/botbahlul/vosk_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

auto-caption auto-subtitle caption ffmpeg google-translate-api python speech-recognition speechrecognition subtitle voice-recognition voicerecognition vosk

Last synced: 14 Nov 2024

https://github.com/palahsu/textspeech

A python program that helps you to read your text in lady robot voice at your pace. Text Speech!

speech-recognition speech-to-text speechrecognition text-processing text-recognition text-speech text-speeches text-to-speech textspeech

Last synced: 10 Nov 2024

https://github.com/saqqdy/uni-use

Some useful composition api

speechrecognition speechsynthesis uni-use use-downloads use-recognition use-speak use-textarea vue-hooks

Last synced: 18 Nov 2024

https://github.com/lucasrmagalhaes/supermarioenglishchallenge-js

Sistema de reconhecimento de voz em JS para aprender cores em inglês.

dio js speechrecognition

Last synced: 12 Nov 2024

https://github.com/sebastienrousseau/audioanalyser

Audio Analyser, a cutting-edge application designed to transform audio recordings into actionable insights using Microsoft Azure AI. It offers audio recording, speech-to-text conversion, and in-depth text analysis, providing users with comprehensive and insightful reports.

audioanalyser audioprocessing audiorecording azure azure-openai azure-services sentiments-analysis sentiments-classification speech-to-text speechrecognition speechrecognition-python textanalysis translation

Last synced: 28 Oct 2024

https://github.com/ghousetazeem/personal-asisstant

Desktop Personal Assistant like Cortana using python and AI principles

pyautogui-automation pyqt5 python speechrecognition textrecognition tkinter

Last synced: 27 Dec 2024

https://github.com/sebastienrousseau/akande

An innovative, open-source voice assistant powered by OpenAI's GPT-3, designed to provide interactive, conversational experiences through both voice and text inputs. 🐍

openai openai-chatgpt pdf-generation smartassistant speechrecognition speechrecognition-python text-to-speech voiceassistant voicecontrol

Last synced: 12 Oct 2024

https://github.com/arbazkhan4712/speech-to-text

A program that can convert Speech into Text using python

pyaudio python pyttsx3 speech-recognition speech-to-text speechrecognition speechrecognition-python

Last synced: 07 Nov 2024

https://github.com/yangr0/speakify

[ Speech to Text ]

bash pip3 python python3 shell speech-recognition speech-to-text speechrecognition

Last synced: 26 Nov 2024

https://github.com/romeusorionaet/nlw-expert-notes

Converta automaticamente notas de áudio em texto.

nlw note speechrecognition vite

Last synced: 12 Nov 2024

https://github.com/aaaastark/nvidia-speech-artificial-intelligence

NVIDIA Speech Artificial Intelligence. Speech AI Summit 2022.

ai-pipelines alexa common-voice mozilla mozilla-deepspeech nlp-research nvidia python pytorch speech-enhancement speech-recognition speech-synthesis speech-to-text speechrecognition zero-shot-voice-conversion

Last synced: 15 Nov 2024

https://github.com/hrfmmymt/speech-input

A custom element that allows you to easily try a SpeechRecognition API on your site.

custom-elements custom-elements-v1 media-recorder mediarecorder-api speech-recognition speechrecognition web-components webcomponents

Last synced: 17 Nov 2024

https://github.com/detain/magic-spells

Magic Spells is a game to practice Spelling Words using Speech Recognition

game learning school school-education speech speech-recognition speech-to-text speechrecognition spelling spelling-checker spelling-practice study studying

Last synced: 19 Nov 2024

https://github.com/adriwco/nlw-expert-notes

Converte automaticamente notas de áudio em texto | API SpeechRecognition

date-fns lucide-react notes-app react react-dialog sonner speechrecognition speechrecognition-api tailwind typescript vite

Last synced: 04 Dec 2024

https://github.com/pedrohvfernandes/nlw-expert

date-fns docker fastify localstorage lucide-react nodejs postcss postgresql prisma profanity radix-ui reactjs redis sonner speechrecognition tailwindcss typescript vite websocket

Last synced: 19 Nov 2024

https://github.com/moe131/speech-refiner

Speech Refiner web app to help you practice your English speaking skills using GPT-4

english-grammar english-learning english-sp gpt-4 speech-recognition speech-refiner speech-to-text speechrecognition

Last synced: 23 Dec 2024

https://github.com/omarcoaur3lio/nlw-notes

App para criação de notas de texto

radix-ui reactjs speechrecognition tailwindcss typescript

Last synced: 20 Nov 2024

https://github.com/harshpimpale/customised-invitation

This project creates customized invitations by using tools for text and speech processing. Ideal for sending personalized invites, it uses image manipulation and text-to-speech technologies for a complete multimedia experience.

pandas pillow python shutil speechrecognition

Last synced: 29 Nov 2024

https://github.com/sucodelarangela/secret-number-with-speech

Web Speech API only available for Chrome.

speech-recognition speechrecognition web-speech-api webspeechapi

Last synced: 20 Nov 2024

https://github.com/leticiabhb/wavparatxt

Transcrição de Áudio WAV para Texto

audiototext python speechrecognition speechrecognition-python wav wavtotext

Last synced: 18 Nov 2024

https://github.com/vandodev/nlw-expert-react

nlw-expert-react, comverte notas de áudios em testo

ai ia lucide-react radix-ui react reactjs sonner speechrecognition tailwindcss typescript vite

Last synced: 07 Dec 2024

https://github.com/harshpimpale/sihlegalassistant

A prototype for the SIH 2024 Police Department, where users can speak about a crime scenario and receive relevant IPC sections that apply to the situation.

faiss gemini langchain llm python speechrecognition streamit transformers vector-embedding