An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with mfcc

A curated list of projects in awesome lists tagged with mfcc .

https://github.com/aubio/aubio

a library for audio and music analysis

analysis annotation audio beat c extraction mfcc music onset pitch python sound tempo-tracking

Last synced: 14 May 2025

https://github.com/csukuangfj/kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor

Last synced: 08 Apr 2025

https://github.com/zhuozhuocrayon/acoustickeyboard-web

❓声学键盘|脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。

deep-learning django lstm mfcc tensorflow

Last synced: 23 Jul 2025

https://github.com/mycroftai/sonopy

A simple audio feature extraction library

audio-processing library mel-spectrogram mfcc sound spectrogram

Last synced: 11 Jul 2025

https://github.com/stefantaubert/mel-cepstral-distance

A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".

cepstral distance distortion divergence dtw dynamic-time-warping language linguistics mcd mel mfcc objective-evaluation spectrogram spectrum speech-quality speech-synthesis text-to-speech tts voice-cloning

Last synced: 28 Jul 2025

https://github.com/k-farruh/speech-accent-detection

The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.

accent accent-detection english-languages mfcc native-speakers

Last synced: 08 Sep 2025

https://github.com/fragit/fragit-main

FragIt main repository

fragments mfcc molecule python

Last synced: 13 Apr 2025

https://github.com/ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

arraymancer audio digital-signal-processing mfcc nim scientific-computing speech-analysis speech-processing speech-recognition wav

Last synced: 18 Mar 2025

https://github.com/dhruvesh13/audio-genre-classification

Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours

audio-gene logistic-regression machine-learning mfcc python-2

Last synced: 22 Aug 2025

https://github.com/shoyamanishi/androidmfcc

26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON intrinsics

android asr audio cpp fft java mfcc neon-simd-intrinsics spectrum visualizer

Last synced: 14 Apr 2025

https://github.com/orbxball/timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

data-preprocessing deep-learning mfcc phone speech-recognition timit timit-dataset

Last synced: 18 Mar 2025

https://github.com/8g6-new/cara

A high performance spectrogram with STFT Mel and MFCC support in pure C

audio-visualizer bark bfcc c dsp mel mfcc spectrogram stft

Last synced: 02 Aug 2025

https://github.com/ragibson/mfcc-speech-recognition

Real-time speech recognition via "Mel-Frequency Cepstral Coefficients" neural networks.

machine-learning mfcc neural-network pytorch real-time speech-recognition

Last synced: 05 Jul 2025

https://github.com/dnvt/burn-speech-training

End-to-end speech model training pipeline built on Burn — MFCC features, CTC loss, LibriSpeech loader, SpeechOcean762 evaluation

burn ctc machine-learning mfcc pronunciation rust speech training-pipeline

Last synced: 30 May 2026

https://github.com/oowais/muses

Audio Comparison system for comparing mp3/wav audio using mfcc, rhythm and other features

audio audio-analysis audio-processing feature-extraction mfcc music-information-retrieval python rhythm

Last synced: 30 Apr 2025

https://github.com/javierantoran/moby_dick_whale_audio_detection

Feature extraction, HMMs, Neural Nets, and Boosting for Kaggle Cornell Whale detection challenge.

cross-validation extract-features feature-extraction gradient-boosting hmm mfcc neural-network spectrogram whale

Last synced: 04 Apr 2025

https://github.com/brucewlee/lama-music-genre-dataset

.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa

africa asia audio-processing classification dataset genre genre-classification genre-suggestion genres-classification harvard-dataverse lama mfcc music music-library signal-processing sound

Last synced: 19 Oct 2025

https://github.com/certainlywrong/mfcc_bee

Implementação do algoritmo de extração de características em dart.

dart mfcc mfcc-extractor

Last synced: 26 Jan 2026

https://github.com/linto-ai/gpu-ne10-mfcc

Some works on accelerating MFCC features extraction with NEON NE10 and Videocore IV on Raspberry Pi

arm mfcc raspberry-pi

Last synced: 16 May 2026

https://github.com/parvatijay2901/footstep-voice-identification

MiiCare (Technical test): Detect the footstep

asr footstep gmm mfcc voice-recognition

Last synced: 10 Oct 2025

https://github.com/linto-ai/sfeatpy

Library to extract MFCC features from audio signal

feature-extraction mfcc mfcc-features python3 speech-processing

Last synced: 07 Nov 2025

https://github.com/ankurs287/a-general-purpose-audio-tagging-system

A general purpose audio tagging system that classifies a wide range of sounds (ranging from car horns to strumming of guitar) that we hear on a daily basis.

deep-learning deep-neural-networks mfcc svm transfer-learning

Last synced: 12 Sep 2025

https://github.com/mathquis/node-gist

Node binding for the Gist Audio Analysis Library

analysis audio fft gist mfcc pitch spectrum

Last synced: 05 May 2025

https://github.com/kennykarnama/go-mfcc

MFCC Implementatin in Go

dsp golang mfcc signal-processing

Last synced: 06 Oct 2025

https://github.com/Mike014/Audio-Classification

This is a prototype Django application that allows users to upload audio files and classify them using machine learning techniques.

ai audio django django-application machine-learning mfcc pca python

Last synced: 26 Oct 2025

https://github.com/nazir20/gender-classification-using-mfcc-with-lstm-model

Gender classification from audio signals is an important task with various applications, such as speech recognition systems, voice assistants, and speaker verification. This project aims to demonstrate how to use Mel Frequency Cepstral Coefficients as features and a Long Short-Term Memory (LSTM) neural network for gender classification.

audio-processing deep-learning deep-neural-networks gender-classification lstm lstm-neural-networks mfcc mfcc-extractor neural-networks

Last synced: 22 Mar 2025

https://github.com/ozymandiasthegreat/wakeword-zero

Personal wake word detector, ported to TypeScript/WASM

dtw fft hotword mfcc nativescript node wakeword wasm web

Last synced: 29 Sep 2025

https://github.com/nannigalaxy/audio-preprocessing-tool

Audio preprocessing tool for signal processing and machine learning applications.

audio-processing augmentation machine-learning mfcc signal-processing

Last synced: 22 Jun 2025

https://github.com/hamedzarei/ms-speechprocessing-hw03

HW03 of Speech Processing

htk lpc mfcc speech-processing

Last synced: 05 Sep 2025

https://github.com/kennykarnama/mfcc

MFCC Feature Extraction In Matlab

dsp matlab mfcc

Last synced: 07 Oct 2025

https://github.com/loharmurtaza/fog_detection_subject_dependent

This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"

accelerometer detection eeg emg f1-score freezing-of-gait gyroscope machine-learning mfcc multi-modal-learning rf sensitivity skin-conductance specificity svm

Last synced: 20 Jan 2026

https://github.com/jubinjacob03/genre-classification-recommendation_spotify

Project for classifying audio files into different genres using the K-Nearest Neighbors (KNN) algorithm.

knn-classification mfcc python streamlit

Last synced: 15 May 2026

https://github.com/payalmh5/emotionrecognition

Emotion Recognition in Speech: This project leverages advanced machine learning techniques to classify emotions from speech using the Toronto Emotional Speech Set (TESS). By extracting Mel-Frequency Cepstral Coefficients (MFCC) and utilizing an LSTM-based deep learning model, the project accurately identifies emotions like anger, happiness, and sad

data-science deep-learning emotion-recognition keras lstm mfcc model-development model-training

Last synced: 30 Sep 2025

https://github.com/rusenaite/speaker-identification-using-ml

A speaker recognition system using machine learning (SVM) with MFCC, chroma, and tonnetz features extracted from short audio clips.

chroma mfcc ml python svm tonnetz

Last synced: 17 Apr 2026

https://github.com/anishagg17/voice_to_gender_classifier

Identify a voice as male or female, based upon acoustic properties of the voice and speech extracted by processing audio.

gmm librosa mfcc scipy speech-processing

Last synced: 30 Apr 2026

https://github.com/deepthipathlawath20/emotion-recognition-bimodal

Bimodal emotion recognition (face + speech) with feature-level fusion and classic ML classifiers.

audio computer-vision emotion-recognition knn mfcc multimodal navie-bayes-algorithm python scikit-learn svm tensorflow

Last synced: 01 May 2026

https://github.com/dhanushi2620/aquasignature

Deep learning model using CRNN and MFCC features to classify underwater sounds and detect foreign threats based on acoustic frequency shifts.

acoustic-signature ai-for-defense anomaly-detection deep-learning-models keras librosa mfcc spectrogram tensorflow

Last synced: 09 May 2026

https://github.com/loharmurtaza/FoG_detection_subject_dependent

This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"

accelerometer detection eeg emg f1-score freezing-of-gait gyroscope machine-learning mfcc multi-modal-learning rf sensitivity skin-conductance specificity svm

Last synced: 29 Sep 2025