An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with speech-analysis

A curated list of projects in awesome lists tagged with speech-analysis .

https://github.com/jianchang512/clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

clonevoice speech-analysis sts tts voice-assistant

Last synced: 14 May 2025

https://github.com/praat/praat.github.io

Praat: Doing Phonetics By Computer

acoustics phonetics speech speech-analysis

Last synced: 25 Apr 2026

https://github.com/praat/praat

Praat: Doing Phonetics By Computer

acoustics phonetics speech speech-analysis

Last synced: 14 May 2025

https://github.com/mmorise/world

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 14 May 2025

https://github.com/mmorise/World

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 04 May 2025

https://github.com/dmitryryumin/interspeech-2023-24-papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission

Last synced: 24 Jan 2026

https://github.com/gemengtju/Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

deep-learning deep-neural-networks signal-processing speech-analysis speech-processing speech-separation

Last synced: 01 Apr 2025

https://github.com/speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

beamforming deep-learning deeplearning librispeech neural-network neural-networks speaker-identification speaker-recognition speaker-verification speech speech-analysis speech-api speech-emotion-recognition speech-processing speech-recognition speech-recognizer speech-separation speech-to-text speechrecognition timit

Last synced: 29 Jan 2026

https://github.com/shahabks/myprosody

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

acoustic-features acoustic-model phonemes prosody python-library speech-analysis speech-patterns voice-recognition

Last synced: 03 Jul 2025

https://github.com/philipperemy/tensorflow-ctc-speech-recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

ctc ctc-loss deep-learning machine-learning speech-analysis speech-recognition speech-to-text tensorflow tensorflow-1-0 tutorial

Last synced: 02 May 2025

https://github.com/at16k/at16k

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

asr asr-model automatic-speech-recognition pretrained-models speech-analysis speech-api speech-recognition speech-recognizer speech-to-text voice-commands voice-recognition

Last synced: 13 Jul 2025

https://github.com/JusperLee/Calculate-SNR-SDR

Script to calculate SNR and SDR using python

sdr speech-analysis speech-separation

Last synced: 01 Apr 2025

https://github.com/lennes/spect

SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/

analysis annotation conversational-speech corpus-linguistics corpus-tools praat spect speech speech-analysis speech-corpus spoken-language transcript transcription

Last synced: 03 Apr 2025

https://github.com/ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

arraymancer audio digital-signal-processing mfcc nim scientific-computing speech-analysis speech-processing speech-recognition wav

Last synced: 18 Mar 2025

https://github.com/virajbhutada/speech-emotion-recognition

This repository houses a robust speech emotion recognition system, featuring signal processing scripts, machine learning algorithms, and comprehensive documentation. It accurately classifies emotions in spoken language, enabling applications like sentiment analysis and emotion-aware systems.

audio-processing emotion-recognition machine-learning natural-language-processing python signal-processing speech-analysis speech-recognition

Last synced: 28 Apr 2025

https://github.com/zoobereq/emotional_speech

A script extracting features of emotionally charged speech

parselmouth praat speech speech-analysis speech-emotion-recognition speech-processing

Last synced: 24 Aug 2025

https://github.com/jjwroeloffs/dynamicfluency-core

The base python package for DynamicFluency: Monitor and understand the dynamicity of linguistic aspects in (L2) speech

nlp praat praatscript python speech-analysis

Last synced: 15 May 2026

https://github.com/kasumikitsune/phontracer

一款基于 Praat (Parselmouth) 的高效语音声调(基频)特征批量提取工具。支持长音频自动切分、独立音频匹配、可视化边界微调及标准化数据导出。

acoustic-phonetics dialectology gui linguistics parselmouth phonetics praat python speech-analysis tone-extraction

Last synced: 31 May 2026

https://github.com/jjwroeloffs/dynamicfluency

DynamicFluency - Monitor and understand the dynamicity of linguistic aspects in (L2) speech

nlp praat praatscript speech-analysis

Last synced: 17 Feb 2026

https://github.com/rezadrian01/Simakin

An AI-powered Qur’an recitation companion that listens to users’ recitations, transcribes them, and provides feedback after processing. Built with Remix, Prisma, MySQL, and Gemini API.

ai fullstack gemini islamic-app memorize mysql prisma quran react-router recitation remix speech-analysis

Last synced: 11 Jun 2026

https://github.com/axlerquiza/mental-state-recognizer

A web-based/gui-based mental state recognizer that analyzes audio recordings and predicts the user's mental state using ML models trained on speech features like MFCC.

audio-processing deep-learning machine-learning mental-health python speech-analysis

Last synced: 05 May 2025

https://github.com/meghaarajeev/emosense-emotionanalysis-machine-learning

👩🏿‍💻IIIT Hyderabad Reasearch Teaser Programme : We developed a robust emotion😃 recognition system utilizing machine learning techniques on the 🗣️CREMA-D dataset to classify various emotions expressed in audio recordings🎙️ accurately.

crema-d dataset emosense emotion-recognition emotion-speech iiit-hyderabad internship miniproject research-teaser speech-analysis speech-recognition

Last synced: 02 Apr 2025

https://github.com/deliprofesor/behavioral-insights-and-data-exploration

This project analyzes Spanish speech data, focusing on acoustic features and demographics. It includes data cleaning, outlier detection, clustering, and time series modeling (ARIMA, Holt-Winters) to uncover patterns in speech duration and word frequency.

acoustic-features arima clustering data-analysis holt-winters k-means machine-learning speech-analysis time-series-analysis

Last synced: 10 Apr 2025