Projects in Awesome Lists tagged with speech-analysis
A curated list of projects in awesome lists tagged with speech-analysis .
https://github.com/jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
clonevoice speech-analysis sts tts voice-assistant
Last synced: 14 May 2025
https://github.com/praat/praat.github.io
Praat: Doing Phonetics By Computer
acoustics phonetics speech speech-analysis
Last synced: 25 Apr 2026
https://github.com/praat/praat
Praat: Doing Phonetics By Computer
acoustics phonetics speech speech-analysis
Last synced: 14 May 2025
https://github.com/mmorise/world
A high-quality speech analysis, manipulation and synthesis system
speech-analysis speech-synthesis vocoder
Last synced: 14 May 2025
https://github.com/mmorise/World
A high-quality speech analysis, manipulation and synthesis system
speech-analysis speech-synthesis vocoder
Last synced: 04 May 2025
https://github.com/haoheliu/voicefixer
General Speech Restoration
declipping denoise dereverberation mel speech speech-analysis speech-enhancement speech-processing speech-synthesis super-resolution tts vocoder
Last synced: 14 May 2025
https://github.com/dmitryryumin/interspeech-2023-24-papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission
Last synced: 24 Jan 2026
https://github.com/gemengtju/Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
deep-learning deep-neural-networks signal-processing speech-analysis speech-processing speech-separation
Last synced: 01 Apr 2025
https://github.com/jcvasquezc/DisVoice
feature extraction from speech signals
articulation pathological-speech phonation prosody signal-processing speech-analysis
Last synced: 17 Mar 2025
https://github.com/speechbrain/speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit
Last synced: 29 Jan 2026
https://github.com/haoheliu/voicefixer_main
General Speech Restoration
machine-learning speech speech-analysis speech-enhancement speech-processing speech-synthesis speech-to-text tts
Last synced: 06 Apr 2025
https://github.com/shahabks/myprosody
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
acoustic-features acoustic-model phonemes prosody python-library speech-analysis speech-patterns voice-recognition
Last synced: 03 Jul 2025
https://github.com/philipperemy/tensorflow-ctc-speech-recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
ctc ctc-loss deep-learning machine-learning speech-analysis speech-recognition speech-to-text tensorflow tensorflow-1-0 tutorial
Last synced: 02 May 2025
https://github.com/at16k/at16k
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
asr asr-model automatic-speech-recognition pretrained-models speech-analysis speech-api speech-recognition speech-recognizer speech-to-text voice-commands voice-recognition
Last synced: 13 Jul 2025
https://github.com/JusperLee/Calculate-SNR-SDR
Script to calculate SNR and SDR using python
sdr speech-analysis speech-separation
Last synced: 01 Apr 2025
https://github.com/google/localized-narratives
Localized Narratives
computer-vision image-captioning speech-analysis
Last synced: 13 Apr 2026
https://google.github.io/localized-narratives/
Localized Narratives
computer-vision image-captioning speech-analysis
Last synced: 16 Mar 2025
https://github.com/hyeonsangjeon/computing-korean-stt-error-rates
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지
amazon aws cer character-error-rate computing-error-rates evaluate evaluation-functions evaluation-metrics korean normalization speech-analysis speech-recognition speech-to-text test text-digitisation text-evaluation transcribe wer word-error-rate
Last synced: 22 Apr 2025
https://github.com/lennes/spect
SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/
analysis annotation conversational-speech corpus-linguistics corpus-tools praat spect speech speech-analysis speech-corpus spoken-language transcript transcription
Last synced: 03 Apr 2025
https://github.com/montrealcorpustools/polyglotdb
Language data store and linguistic query API
acoustics database influxdb neo4j rest-api speech-analysis speech-processing
Last synced: 06 Apr 2025
https://github.com/tabahi/webspeechanalyzer
JS speech analyzer for fast speech analysis and labeling
audio-analysis audio-processing feature feature-engineering feature-extraction formant-detection music music-information-retrieval music-visualizer phonemes signal-processing spectrum spectrum-analyzer speech speech-analysis speech-processing speech-recognition
Last synced: 11 Mar 2026
https://github.com/ringabout/scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
arraymancer audio digital-signal-processing mfcc nim scientific-computing speech-analysis speech-processing speech-recognition wav
Last synced: 18 Mar 2025
https://github.com/virajbhutada/speech-emotion-recognition
This repository houses a robust speech emotion recognition system, featuring signal processing scripts, machine learning algorithms, and comprehensive documentation. It accurately classifies emotions in spoken language, enabling applications like sentiment analysis and emotion-aware systems.
audio-processing emotion-recognition machine-learning natural-language-processing python signal-processing speech-analysis speech-recognition
Last synced: 28 Apr 2025
https://github.com/zoobereq/emotional_speech
A script extracting features of emotionally charged speech
parselmouth praat speech speech-analysis speech-emotion-recognition speech-processing
Last synced: 24 Aug 2025
https://github.com/hlorenzi/vowel-analysis
Vowel formant frequency synthesis and analysis on the browser -- https://hlorenzi.github.io/vowel-analysis/
formant-detection frequency-analysis international-phonetic-alphabet ipa speech-analysis speech-recognition speech-synthesis vowel vowel-chart vowel-formants vowel-recognition vowels web-application webapp
Last synced: 10 Jan 2026
https://github.com/jjwroeloffs/dynamicfluency-core
The base python package for DynamicFluency: Monitor and understand the dynamicity of linguistic aspects in (L2) speech
nlp praat praatscript python speech-analysis
Last synced: 15 May 2026
https://github.com/jatin-8898/native-speech-recognition
A native Speech Recognition without using any api made using JS :speech_balloon:
html js json json-api native native-development native-speech-recognition speech speech-analysis speech-balloon speech-recognition speech-synthesis speech-to-text
Last synced: 16 Apr 2026
https://github.com/kasumikitsune/phontracer
一款基于 Praat (Parselmouth) 的高效语音声调(基频)特征批量提取工具。支持长音频自动切分、独立音频匹配、可视化边界微调及标准化数据导出。
acoustic-phonetics dialectology gui linguistics parselmouth phonetics praat python speech-analysis tone-extraction
Last synced: 31 May 2026
https://github.com/jjwroeloffs/dynamicfluency
DynamicFluency - Monitor and understand the dynamicity of linguistic aspects in (L2) speech
nlp praat praatscript speech-analysis
Last synced: 17 Feb 2026
https://github.com/rezadrian01/Simakin
An AI-powered Qur’an recitation companion that listens to users’ recitations, transcribes them, and provides feedback after processing. Built with Remix, Prisma, MySQL, and Gemini API.
ai fullstack gemini islamic-app memorize mysql prisma quran react-router recitation remix speech-analysis
Last synced: 11 Jun 2026
https://github.com/bakrawy2025/emotion-sentiment-classifier
Emotion & sentiment classifier in Python using TF-IDF + Logistic Regression (scikit-learn). Includes joblib model saving, evaluation and CLI prediction. 🐱💻
audio-processing confusion-matrix data-science emotion-detection-emotion-classification emotion-recognition interactive-prediction jupyter-notebook lstm-sentiment-analysis machine-learning naive-bayes-classifier nltk recurrent-neural-networks scikit-learn social speech-analysis tensorflow text-classification tweets
Last synced: 19 Aug 2025
https://github.com/axlerquiza/mental-state-recognizer
A web-based/gui-based mental state recognizer that analyzes audio recordings and predicts the user's mental state using ML models trained on speech features like MFCC.
audio-processing deep-learning machine-learning mental-health python speech-analysis
Last synced: 05 May 2025
https://github.com/meghaarajeev/emosense-emotionanalysis-machine-learning
👩🏿💻IIIT Hyderabad Reasearch Teaser Programme : We developed a robust emotion😃 recognition system utilizing machine learning techniques on the 🗣️CREMA-D dataset to classify various emotions expressed in audio recordings🎙️ accurately.
crema-d dataset emosense emotion-recognition emotion-speech iiit-hyderabad internship miniproject research-teaser speech-analysis speech-recognition
Last synced: 02 Apr 2025
https://github.com/deliprofesor/behavioral-insights-and-data-exploration
This project analyzes Spanish speech data, focusing on acoustic features and demographics. It includes data cleaning, outlier detection, clustering, and time series modeling (ARIMA, Holt-Winters) to uncover patterns in speech duration and word frequency.
acoustic-features arima clustering data-analysis holt-winters k-means machine-learning speech-analysis time-series-analysis
Last synced: 10 Apr 2025
https://github.com/manojc/speech-recognition
poc for speech recognition using annyang speech recognition library.
angular angular2 css html5 speech-analysis speech-processing speech-recognition speech-recognizer speech-to-text speechtotext typescript
Last synced: 07 Oct 2025