Projects in Awesome Lists tagged with speech-analysis

https://github.com/jianchang512/clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

clonevoice speech-analysis sts tts voice-assistant

Last synced: 14 May 2025

https://github.com/praat/praat.github.io

Praat: Doing Phonetics By Computer

acoustics phonetics speech speech-analysis

Last synced: 25 Apr 2026

https://github.com/praat/praat

Praat: Doing Phonetics By Computer

acoustics phonetics speech speech-analysis

Last synced: 14 May 2025

https://github.com/mmorise/world

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 14 May 2025

https://github.com/mmorise/World

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 04 May 2025

https://github.com/haoheliu/voicefixer

General Speech Restoration

declipping denoise dereverberation mel speech speech-analysis speech-enhancement speech-processing speech-synthesis super-resolution tts vocoder

Last synced: 14 May 2025

https://github.com/dmitryryumin/interspeech-2023-24-papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission

Last synced: 24 Jan 2026

https://github.com/gemengtju/Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

deep-learning deep-neural-networks signal-processing speech-analysis speech-processing speech-separation

Last synced: 01 Apr 2025

https://github.com/jcvasquezc/DisVoice

feature extraction from speech signals

articulation pathological-speech phonation prosody signal-processing speech-analysis

Last synced: 17 Mar 2025

https://github.com/speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit

Last synced: 29 Jan 2026

https://github.com/haoheliu/voicefixer_main

General Speech Restoration

machine-learning speech speech-analysis speech-enhancement speech-processing speech-synthesis speech-to-text tts

Last synced: 06 Apr 2025

https://github.com/shahabks/myprosody

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

acoustic-features acoustic-model phonemes prosody python-library speech-analysis speech-patterns voice-recognition

Last synced: 03 Jul 2025

https://github.com/philipperemy/tensorflow-ctc-speech-recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

ctc ctc-loss deep-learning machine-learning speech-analysis speech-recognition speech-to-text tensorflow tensorflow-1-0 tutorial

Last synced: 02 May 2025

https://github.com/at16k/at16k

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

asr asr-model automatic-speech-recognition pretrained-models speech-analysis speech-api speech-recognition speech-recognizer speech-to-text voice-commands voice-recognition

Last synced: 13 Jul 2025

https://github.com/JusperLee/Calculate-SNR-SDR

Script to calculate SNR and SDR using python

sdr speech-analysis speech-separation

Last synced: 01 Apr 2025

https://github.com/google/localized-narratives

Localized Narratives

computer-vision image-captioning speech-analysis

Last synced: 13 Apr 2026

https://google.github.io/localized-narratives/

Localized Narratives

computer-vision image-captioning speech-analysis

Last synced: 16 Mar 2025

https://github.com/hyeonsangjeon/computing-korean-stt-error-rates

STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지

amazon aws cer character-error-rate computing-error-rates evaluate evaluation-functions evaluation-metrics korean normalization speech-analysis speech-recognition speech-to-text test text-digitisation text-evaluation transcribe wer word-error-rate

Last synced: 22 Apr 2025

https://github.com/lennes/spect

SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/

analysis annotation conversational-speech corpus-linguistics corpus-tools praat spect speech speech-analysis speech-corpus spoken-language transcript transcription

Last synced: 03 Apr 2025

https://github.com/montrealcorpustools/polyglotdb

Language data store and linguistic query API

acoustics database influxdb neo4j rest-api speech-analysis speech-processing

Last synced: 06 Apr 2025

https://github.com/tabahi/webspeechanalyzer

JS speech analyzer for fast speech analysis and labeling

audio-analysis audio-processing feature feature-engineering feature-extraction formant-detection music music-information-retrieval music-visualizer phonemes signal-processing spectrum spectrum-analyzer speech speech-analysis speech-processing speech-recognition

Last synced: 11 Mar 2026

https://github.com/ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

arraymancer audio digital-signal-processing mfcc nim scientific-computing speech-analysis speech-processing speech-recognition wav

Last synced: 18 Mar 2025

https://github.com/virajbhutada/speech-emotion-recognition

This repository houses a robust speech emotion recognition system, featuring signal processing scripts, machine learning algorithms, and comprehensive documentation. It accurately classifies emotions in spoken language, enabling applications like sentiment analysis and emotion-aware systems.

audio-processing emotion-recognition machine-learning natural-language-processing python signal-processing speech-analysis speech-recognition

Last synced: 28 Apr 2025

https://github.com/zoobereq/emotional_speech

A script extracting features of emotionally charged speech

parselmouth praat speech speech-analysis speech-emotion-recognition speech-processing

Last synced: 24 Aug 2025

https://github.com/hlorenzi/vowel-analysis

Vowel formant frequency synthesis and analysis on the browser -- https://hlorenzi.github.io/vowel-analysis/

formant-detection frequency-analysis international-phonetic-alphabet ipa speech-analysis speech-recognition speech-synthesis vowel vowel-chart vowel-formants vowel-recognition vowels web-application webapp

Last synced: 10 Jan 2026

https://github.com/jjwroeloffs/dynamicfluency-core

The base python package for DynamicFluency: Monitor and understand the dynamicity of linguistic aspects in (L2) speech

nlp praat praatscript python speech-analysis

Last synced: 15 May 2026

https://github.com/jatin-8898/native-speech-recognition

A native Speech Recognition without using any api made using JS :speech_balloon:

html js json json-api native native-development native-speech-recognition speech speech-analysis speech-balloon speech-recognition speech-synthesis speech-to-text

Last synced: 16 Apr 2026

https://github.com/kasumikitsune/phontracer

一款基于 Praat (Parselmouth) 的高效语音声调（基频）特征批量提取工具。支持长音频自动切分、独立音频匹配、可视化边界微调及标准化数据导出。

acoustic-phonetics dialectology gui linguistics parselmouth phonetics praat python speech-analysis tone-extraction

Last synced: 31 May 2026

https://github.com/jjwroeloffs/dynamicfluency

DynamicFluency - Monitor and understand the dynamicity of linguistic aspects in (L2) speech

nlp praat praatscript speech-analysis

Last synced: 17 Feb 2026

https://github.com/rezadrian01/Simakin

An AI-powered Qur’an recitation companion that listens to users’ recitations, transcribes them, and provides feedback after processing. Built with Remix, Prisma, MySQL, and Gemini API.

ai fullstack gemini islamic-app memorize mysql prisma quran react-router recitation remix speech-analysis

Last synced: 11 Jun 2026

https://github.com/bakrawy2025/emotion-sentiment-classifier

Emotion & sentiment classifier in Python using TF-IDF + Logistic Regression (scikit-learn). Includes joblib model saving, evaluation and CLI prediction. 🐱💻

audio-processing confusion-matrix data-science emotion-detection-emotion-classification emotion-recognition interactive-prediction jupyter-notebook lstm-sentiment-analysis machine-learning naive-bayes-classifier nltk recurrent-neural-networks scikit-learn social speech-analysis tensorflow text-classification tweets

Last synced: 19 Aug 2025

https://github.com/axlerquiza/mental-state-recognizer

A web-based/gui-based mental state recognizer that analyzes audio recordings and predicts the user's mental state using ML models trained on speech features like MFCC.

audio-processing deep-learning machine-learning mental-health python speech-analysis

Last synced: 05 May 2025

https://github.com/meghaarajeev/emosense-emotionanalysis-machine-learning

👩🏿‍💻IIIT Hyderabad Reasearch Teaser Programme : We developed a robust emotion😃 recognition system utilizing machine learning techniques on the 🗣️CREMA-D dataset to classify various emotions expressed in audio recordings🎙️ accurately.

crema-d dataset emosense emotion-recognition emotion-speech iiit-hyderabad internship miniproject research-teaser speech-analysis speech-recognition

Last synced: 02 Apr 2025

https://github.com/abhinavbammidi1401/speech_processing

speech-analysis speech-processing speech-recognition speech-synthesis

Last synced: 15 Mar 2025

https://github.com/deliprofesor/behavioral-insights-and-data-exploration

This project analyzes Spanish speech data, focusing on acoustic features and demographics. It includes data cleaning, outlier detection, clustering, and time series modeling (ARIMA, Holt-Winters) to uncover patterns in speech duration and word frequency.

acoustic-features arima clustering data-analysis holt-winters k-means machine-learning speech-analysis time-series-analysis

Last synced: 10 Apr 2025

https://github.com/manojc/speech-recognition

poc for speech recognition using annyang speech recognition library.

angular angular2 css html5 speech-analysis speech-processing speech-recognition speech-recognizer speech-to-text speechtotext typescript

Last synced: 07 Oct 2025