Projects in Awesome Lists tagged with mfcc
A curated list of projects in awesome lists tagged with mfcc .
https://github.com/ddbourgin/numpy-ml
Machine learning, in numpy
attention bayesian-inference gaussian-mixture-models gaussian-processes good-turing-smoothing gradient-boosting hidden-markov-models knn lstm machine-learning mfcc neural-networks reinforcement-learning resnet topic-modeling vae wavenet wgan-gp word2vec
Last synced: 12 May 2025
https://github.com/aubio/aubio
a library for audio and music analysis
analysis annotation audio beat c extraction mfcc music onset pitch python sound tempo-tracking
Last synced: 14 May 2025
https://github.com/libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform
Last synced: 13 Mar 2025
https://github.com/libaudioflux/audioflux
A library for audio and music analysis, feature extraction.
audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform
Last synced: 14 May 2025
https://github.com/x4nth055/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
deep-learning emotion-detection emotion-recognition emotion-recognizer feature-extraction gradient-boosting keras kneighborsclassifier librosa machine-learning mfcc mlp-classifier neural-networks random-forest-classifier recurrent-neural-networks sklearn speech-emotion-recognition support-vector-machine
Last synced: 04 Apr 2025
https://github.com/ar1st0crat/nwaves
.NET DSP library with a lot of audio processing functions
adaptive-filtering audio dsp fda feature-extraction filtering lpc mfcc mir noise pitch psychoacoustics resampling signal sound-effects sound-synthesis time-stretch wav wavelets
Last synced: 15 May 2025
https://superkogito.github.io/spafe/
:sound: spafe: Simplified Python Audio Features Extraction
audio audio-analysis beat dsp features-extraction filterbank frequencies frequency frequency-analysis gammatone-filterbanks mfcc music music-information-retrieval pitch python signal-processing sound speech-processing time-frequency-analysis voice
Last synced: 24 May 2026
https://github.com/superkogito/spafe
:sound: spafe: Simplified Python Audio Features Extraction
audio audio-analysis beat dsp features-extraction filterbank frequencies frequency frequency-analysis gammatone-filterbanks mfcc music music-information-retrieval pitch python signal-processing sound speech-processing time-frequency-analysis voice
Last synced: 14 May 2025
https://github.com/SuperKogito/spafe
:sound: spafe: Simplified Python Audio Features Extraction
audio audio-analysis beat dsp features-extraction filterbank frequencies frequency frequency-analysis gammatone-filterbanks mfcc music music-information-retrieval pitch python signal-processing sound speech-processing time-frequency-analysis voice
Last synced: 14 Jul 2025
https://github.com/adamstark/Gist
A C++ Library for Audio Analysis
audio audio-analysis c-plus-plus fft gist mfcc mir music music-information-retrieval onset-detection pitch-tracking spectral-analysis
Last synced: 16 Mar 2025
https://github.com/sp-nitech/sptk
A suite of speech signal processing tools
audio-processing cepstrum cpp dsp lpc lsp mfcc signal-processing speech speech-processing sptk unix-command
Last synced: 24 Dec 2025
https://github.com/jsingh811/pyAudioProcessing
Audio feature extraction and classification
audio-data audio-files chroma-features classifier classifier-options classify classify-audio classify-audio-samples feature-extraction gfcc gfcc-extractor gfcc-features hyperparameter-tuning mfcc mfcc-extractor mfcc-features pyaudioprocessing spectral-features wav-files
Last synced: 15 Mar 2025
https://github.com/csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor
Last synced: 08 Apr 2025
https://github.com/superkogito/voice-based-gender-recognition
:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
data-science gaussian-mixture-models gender gender-classification gender-detection gender-recognition gender-recognition-by-voice gmm machine-learning mel-frequencies mfcc scikit-learn scikit-learn-python signal speaker speech vocal voice
Last synced: 04 Apr 2025
https://github.com/tympanix/subsync
Synchronize your subtitles using machine learning
delay fix machine-learning mfcc neural-network shift shift-subtitle speech-detection subsync subtitle subtitles
Last synced: 09 Apr 2025
https://github.com/suyashmore/mevonai-speech-emotion-recognition
Identify the emotion of multiple speakers in an Audio Segment
artificial-intelligence colab-notebook convolutional-neural-networks deep-learning diarization emotion-analysis emotion-recognition keras-tensorflow machine-learning mfcc mfcc-analysis speech-processing uis-rnn
Last synced: 18 Oct 2025
https://github.com/zhuozhuocrayon/acoustickeyboard-web
❓声学键盘|脑洞大开:做一个能听懂键盘敲击键位的「玩具」,学习信号处理 / 深度学习 / 安卓 / Django。
deep-learning django lstm mfcc tensorflow
Last synced: 23 Jul 2025
https://github.com/mycroftai/sonopy
A simple audio feature extraction library
audio-processing library mel-spectrogram mfcc sound spectrogram
Last synced: 11 Jul 2025
https://github.com/mathquis/node-personal-wakeword
Personal wake word detector
dtw hotword-detection hotword-detector mfcc node wakeword
Last synced: 24 Oct 2025
https://github.com/stefantaubert/mel-cepstral-distance
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the method proposed by Robert F. Kubichek in "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment".
cepstral distance distortion divergence dtw dynamic-time-warping language linguistics mcd mel mfcc objective-evaluation spectrogram spectrum speech-quality speech-synthesis text-to-speech tts voice-cloning
Last synced: 28 Jul 2025
https://github.com/superkogito/voice-based-speaker-identification
:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM
gaussian-mixture-models gmm machine-learning mel-frequencies mel-frequency-cepstral-coefficients mfcc scikit-learn scikit-learn-python signal speaker-identification speaker-recognition speech vocal voice
Last synced: 06 Sep 2025
https://github.com/aubio/vamp-aubio-plugins
aubio plugins for Vamp
analysis aubio audio beat beat-detection beat-tracking mfcc music music-information-retrieval onset onset-detection tempo tempo-detection tempo-tracking vamp-plugins
Last synced: 13 Apr 2025
https://github.com/k-farruh/speech-accent-detection
The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines accent based audio record. The result of the model could be used to determine accents and help decrease accents to English learning students and improve accents by training.
accent accent-detection english-languages mfcc native-speakers
Last synced: 08 Sep 2025
https://github.com/fragit/fragit-main
FragIt main repository
fragments mfcc molecule python
Last synced: 13 Apr 2025
https://github.com/dataxujing/asr-paper
:fire: ASR教程: https://dataxujing.github.io/ASR-paper/
asr citrinet conformer contextnet ctc dnn-hmm fbank gmm-hmm jasper las mfcc mocha neural-transducer quartznet rnn-t speech-transformer squeezeformer tandem transformer-transducer wfst
Last synced: 08 Oct 2025
https://github.com/ringabout/scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
arraymancer audio digital-signal-processing mfcc nim scientific-computing speech-analysis speech-processing speech-recognition wav
Last synced: 18 Mar 2025
https://github.com/ihabbendidi/voice-authentification-api
A RESTFUL API implementation of an authentification system using voice fingerprint
api authentication encoding flask gmm machine-learning mfcc mfcc-analysis mfcc-extractor mfcc-features security server voice voice-recognition
Last synced: 16 Aug 2025
https://github.com/dhruvesh13/audio-genre-classification
Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours
audio-gene logistic-regression machine-learning mfcc python-2
Last synced: 22 Aug 2025
https://github.com/shoyamanishi/androidmfcc
26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON intrinsics
android asr audio cpp fft java mfcc neon-simd-intrinsics spectrum visualizer
Last synced: 14 Apr 2025
https://github.com/orbxball/timit-preprocessor
Extract mfcc vectors and phones from TIMIT dataset
data-preprocessing deep-learning mfcc phone speech-recognition timit timit-dataset
Last synced: 18 Mar 2025
https://github.com/javierantoran/tiger-costume-voice-conversion
Voice Alignment and Conversion with Neural Networks and the WORLD codec.
alignment dwt dynamic-time-warping mfcc mlpg neural-network speaker speech sptk trajectory-generation voice voice-alignment voice-conversion voice-generation
Last synced: 25 Oct 2025
https://github.com/baggepinnen/lpvspectral.jl
Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.
frequencies least-squares lomb-scargle-periodogram lpv mel-spectrogram mfcc periodogram power-spectral-density spectrogram spectrum spectrum-analyzer spectrum-identification system-identification time-series-analysis
Last synced: 03 Jan 2026
https://github.com/lucko515/speech-commands-recognition
Recognizing common speech commands using Keras and Tensorflow.
convolutional-neural-networks gru keras lstm mfcc recurrent-neural-networks speech-recognition speech-to-text tensorflow
Last synced: 03 Mar 2026
https://github.com/8g6-new/cara
A high performance spectrogram with STFT Mel and MFCC support in pure C
audio-visualizer bark bfcc c dsp mel mfcc spectrogram stft
Last synced: 02 Aug 2025
https://github.com/ragibson/mfcc-speech-recognition
Real-time speech recognition via "Mel-Frequency Cepstral Coefficients" neural networks.
machine-learning mfcc neural-network pytorch real-time speech-recognition
Last synced: 05 Jul 2025
https://github.com/dnvt/burn-speech-training
End-to-end speech model training pipeline built on Burn — MFCC features, CTC loss, LibriSpeech loader, SpeechOcean762 evaluation
burn ctc machine-learning mfcc pronunciation rust speech training-pipeline
Last synced: 30 May 2026
https://github.com/oowais/muses
Audio Comparison system for comparing mp3/wav audio using mfcc, rhythm and other features
audio audio-analysis audio-processing feature-extraction mfcc music-information-retrieval python rhythm
Last synced: 30 Apr 2025
https://github.com/javierantoran/moby_dick_whale_audio_detection
Feature extraction, HMMs, Neural Nets, and Boosting for Kaggle Cornell Whale detection challenge.
cross-validation extract-features feature-extraction gradient-boosting hmm mfcc neural-network spectrogram whale
Last synced: 04 Apr 2025
https://github.com/brucewlee/lama-music-genre-dataset
.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa
africa asia audio-processing classification dataset genre genre-classification genre-suggestion genres-classification harvard-dataverse lama mfcc music music-library signal-processing sound
Last synced: 19 Oct 2025
https://github.com/certainlywrong/mfcc_bee
Implementação do algoritmo de extração de características em dart.
Last synced: 26 Jan 2026
https://github.com/linto-ai/gpu-ne10-mfcc
Some works on accelerating MFCC features extraction with NEON NE10 and Videocore IV on Raspberry Pi
Last synced: 16 May 2026
https://github.com/pprattis/automatic-speech-recognision-system-asr
A python script that implements an automatic speech recognision system.
asr automatic-speech-recognition computer-science dtw dynamic-time-warping fir-filter librosa mel-frequency-cepstral-coefficients mfcc nyquist program python short-time-fourier-transform short-time-signal-analysis signal signal-processing student
Last synced: 07 Sep 2025
https://github.com/parvatijay2901/footstep-voice-identification
MiiCare (Technical test): Detect the footstep
asr footstep gmm mfcc voice-recognition
Last synced: 10 Oct 2025
https://github.com/pprattis/automatic-speech-recognision-system-ASR
A python script that implements an automatic speech recognision system.
asr automatic-speech-recognition computer-science dtw dynamic-time-warping fir-filter librosa mel-frequency-cepstral-coefficients mfcc nyquist program python short-time-fourier-transform short-time-signal-analysis signal signal-processing student
Last synced: 28 Sep 2025
https://github.com/reshalfahsi/music-genre-classification
Music Genre Classification using MFCC + ANN
audio audio-analysis audio-processing cmvn gtzan-dataset mfcc mfcc-features music-genre-classification pytorch pytorch-lightning signal-processing stft
Last synced: 28 Apr 2026
https://github.com/linto-ai/sfeatpy
Library to extract MFCC features from audio signal
feature-extraction mfcc mfcc-features python3 speech-processing
Last synced: 07 Nov 2025
https://github.com/ankurs287/a-general-purpose-audio-tagging-system
A general purpose audio tagging system that classifies a wide range of sounds (ranging from car horns to strumming of guitar) that we hear on a daily basis.
deep-learning deep-neural-networks mfcc svm transfer-learning
Last synced: 12 Sep 2025
https://github.com/kennykarnama/go-mfcc
MFCC Implementatin in Go
dsp golang mfcc signal-processing
Last synced: 06 Oct 2025
https://github.com/Mike014/Audio-Classification
This is a prototype Django application that allows users to upload audio files and classify them using machine learning techniques.
ai audio django django-application machine-learning mfcc pca python
Last synced: 26 Oct 2025
https://github.com/nazir20/gender-classification-using-mfcc-with-lstm-model
Gender classification from audio signals is an important task with various applications, such as speech recognition systems, voice assistants, and speaker verification. This project aims to demonstrate how to use Mel Frequency Cepstral Coefficients as features and a Long Short-Term Memory (LSTM) neural network for gender classification.
audio-processing deep-learning deep-neural-networks gender-classification lstm lstm-neural-networks mfcc mfcc-extractor neural-networks
Last synced: 22 Mar 2025
https://github.com/ozymandiasthegreat/wakeword-zero
Personal wake word detector, ported to TypeScript/WASM
dtw fft hotword mfcc nativescript node wakeword wasm web
Last synced: 29 Sep 2025
https://github.com/hallowshaw/speech-emotion-recognition-with-mfcc
A project to classify emotions like happiness, sadness, and anger from speech using MFCCs, machine learning models, and visualizations for audio features and model performance.
crema-d kaggle-dataset librosa lstm matplotlib mel-frequency-cepstral-coefficient mfcc mfcc-algorithm python ravdees savee scikit-learn seaborn sentiment-analyser sentiment-analysis speech-emotion-regonition speech-sentiment-analysis tess voice-emotion-recognition voice-sentiment-analysis
Last synced: 07 May 2026
https://github.com/nannigalaxy/audio-preprocessing-tool
Audio preprocessing tool for signal processing and machine learning applications.
audio-processing augmentation machine-learning mfcc signal-processing
Last synced: 22 Jun 2025
https://github.com/hamedzarei/ms-speechprocessing-hw03
HW03 of Speech Processing
htk lpc mfcc speech-processing
Last synced: 05 Sep 2025
https://github.com/codersacademy006/speech-recognition-system
The objective of this DLM (Deep Learning Model) is to recognize the emotions from speech.
deep-learning emotion-detection emotion-recognition emotion-recognizer feature-extraction gradient-boosting keras kneighborsclassifier librosa machine-learning mfcc mlp-classifier neural-networks random-forest-classifier recurrent-neural-networks sklearn speech-emotion-recognition support-vector-machine
Last synced: 31 May 2026
https://github.com/loharmurtaza/fog_detection_subject_dependent
This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"
accelerometer detection eeg emg f1-score freezing-of-gait gyroscope machine-learning mfcc multi-modal-learning rf sensitivity skin-conductance specificity svm
Last synced: 20 Jan 2026
https://github.com/jubinjacob03/genre-classification-recommendation_spotify
Project for classifying audio files into different genres using the K-Nearest Neighbors (KNN) algorithm.
knn-classification mfcc python streamlit
Last synced: 15 May 2026
https://github.com/payalmh5/emotionrecognition
Emotion Recognition in Speech: This project leverages advanced machine learning techniques to classify emotions from speech using the Toronto Emotional Speech Set (TESS). By extracting Mel-Frequency Cepstral Coefficients (MFCC) and utilizing an LSTM-based deep learning model, the project accurately identifies emotions like anger, happiness, and sad
data-science deep-learning emotion-recognition keras lstm mfcc model-development model-training
Last synced: 30 Sep 2025
https://github.com/anishagg17/voice_to_gender_classifier
Identify a voice as male or female, based upon acoustic properties of the voice and speech extracted by processing audio.
gmm librosa mfcc scipy speech-processing
Last synced: 30 Apr 2026
https://github.com/deepthipathlawath20/emotion-recognition-bimodal
Bimodal emotion recognition (face + speech) with feature-level fusion and classic ML classifiers.
audio computer-vision emotion-recognition knn mfcc multimodal navie-bayes-algorithm python scikit-learn svm tensorflow
Last synced: 01 May 2026
https://github.com/dhanushi2620/aquasignature
Deep learning model using CRNN and MFCC features to classify underwater sounds and detect foreign threats based on acoustic frequency shifts.
acoustic-signature ai-for-defense anomaly-detection deep-learning-models keras librosa mfcc spectrogram tensorflow
Last synced: 09 May 2026
https://github.com/loharmurtaza/FoG_detection_subject_dependent
This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"
accelerometer detection eeg emg f1-score freezing-of-gait gyroscope machine-learning mfcc multi-modal-learning rf sensitivity skin-conductance specificity svm
Last synced: 29 Sep 2025