Projects in Awesome Lists tagged with mfcc

https://github.com/ddbourgin/numpy-ml

Machine learning, in numpy

attention bayesian-inference gaussian-mixture-models gaussian-processes good-turing-smoothing gradient-boosting hidden-markov-models knn lstm machine-learning mfcc neural-networks reinforcement-learning resnet topic-modeling vae wavenet wgan-gp word2vec

Last synced: 31 Dec 2024

https://github.com/aubio/aubio

a library for audio and music analysis

analysis annotation audio beat c extraction mfcc music onset pitch python sound tempo-tracking

Last synced: 31 Dec 2024

https://github.com/libaudioflux/audioflux

A library for audio and music analysis, feature extraction.

audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform

Last synced: 24 Dec 2024

https://github.com/libAudioFlux/audioFlux

A library for audio and music analysis, feature extraction.

audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform

Last synced: 25 Oct 2024

https://github.com/x4nth055/emotion-recognition-using-speech

Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras

deep-learning emotion-detection emotion-recognition emotion-recognizer feature-extraction gradient-boosting keras kneighborsclassifier librosa machine-learning mfcc mlp-classifier neural-networks random-forest-classifier recurrent-neural-networks sklearn speech-emotion-recognition support-vector-machine

Last synced: 27 Dec 2024

https://github.com/ar1st0crat/nwaves

.NET DSP library with a lot of audio processing functions

adaptive-filtering audio dsp fda feature-extraction filtering lpc mfcc mir noise pitch psychoacoustics resampling signal sound-effects sound-synthesis time-stretch wav wavelets

Last synced: 27 Dec 2024

https://github.com/SuperKogito/spafe

:sound: spafe: Simplified Python Audio Features Extraction

audio audio-analysis beat dsp features-extraction filterbank frequencies frequency frequency-analysis gammatone-filterbanks mfcc music music-information-retrieval pitch python signal-processing sound speech-processing time-frequency-analysis voice

Last synced: 22 Nov 2024

https://github.com/adamstark/Gist

A C++ Library for Audio Analysis

audio audio-analysis c-plus-plus fft gist mfcc mir music music-information-retrieval onset-detection pitch-tracking spectral-analysis

Last synced: 27 Oct 2024

https://github.com/jsingh811/pyAudioProcessing

Audio feature extraction and classification

audio-data audio-files chroma-features classifier classifier-options classify classify-audio classify-audio-samples feature-extraction gfcc gfcc-extractor gfcc-features hyperparameter-tuning mfcc mfcc-extractor mfcc-features pyaudioprocessing spectral-features wav-files

Last synced: 26 Oct 2024

https://github.com/csukuangfj/kaldifeat

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

cpp fbank features-extraction kaldi mfcc online-feature-extractor plp python pytorch streaming-feature-extractor

Last synced: 29 Dec 2024

https://github.com/sp-nitech/diffsptk

A differentiable version of SPTK

cepstrum cqt ddsp deep-learning digital-signal-processing dsp gmm k-means lpc lsp mdct mfcc nmf plp pqmf python pytorch signal-processing sptk stft

Last synced: 24 Dec 2024

https://github.com/tympanix/subsync

Synchronize your subtitles using machine learning

delay fix machine-learning mfcc neural-network shift shift-subtitle speech-detection subsync subtitle subtitles

Last synced: 24 Dec 2024

https://github.com/mycroftai/sonopy

A simple audio feature extraction library

audio-processing library mel-spectrogram mfcc sound spectrogram

Last synced: 21 Nov 2024

https://github.com/mathquis/node-personal-wakeword

Personal wake word detector

dtw hotword-detection hotword-detector mfcc node wakeword

Last synced: 13 Nov 2024

https://github.com/aubio/vamp-aubio-plugins

aubio plugins for Vamp

analysis aubio audio beat beat-detection beat-tracking mfcc music music-information-retrieval onset onset-detection tempo tempo-detection tempo-tracking vamp-plugins

Last synced: 24 Dec 2024

https://github.com/ringabout/scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

arraymancer audio digital-signal-processing mfcc nim scientific-computing speech-analysis speech-processing speech-recognition wav

Last synced: 25 Nov 2024

https://github.com/ihabbendidi/voice-authentification-api

A RESTFUL API implementation of an authentification system using voice fingerprint

api authentication encoding flask gmm machine-learning mfcc mfcc-analysis mfcc-extractor mfcc-features security server voice voice-recognition

Last synced: 16 Dec 2024

https://github.com/dataxujing/asr-paper

:fire: ASR教程: https://dataxujing.github.io/ASR-paper/

asr citrinet conformer contextnet ctc dnn-hmm fbank gmm-hmm jasper las mfcc mocha neural-transducer quartznet rnn-t speech-transformer squeezeformer tandem transformer-transducer wfst

Last synced: 17 Dec 2024

https://github.com/javierantoran/tiger-costume-voice-conversion

Voice Alignment and Conversion with Neural Networks and the WORLD codec.

alignment dwt dynamic-time-warping mfcc mlpg neural-network speaker speech sptk trajectory-generation voice voice-alignment voice-conversion voice-generation

Last synced: 05 Nov 2024

https://github.com/orbxball/timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

data-preprocessing deep-learning mfcc phone speech-recognition timit timit-dataset

Last synced: 27 Oct 2024

https://github.com/baggepinnen/lpvspectral.jl

Least-squares (sparse) spectral estimation and (sparse) LPV spectral decomposition.

frequencies least-squares lomb-scargle-periodogram lpv mel-spectrogram mfcc periodogram power-spectral-density spectrogram spectrum spectrum-analyzer spectrum-identification system-identification time-series-analysis

Last synced: 21 Nov 2024

https://github.com/lucko515/speech-commands-recognition

Recognizing common speech commands using Keras and Tensorflow.

convolutional-neural-networks gru keras lstm mfcc recurrent-neural-networks speech-recognition speech-to-text tensorflow

Last synced: 15 Oct 2024

https://github.com/ragibson/mfcc-speech-recognition

Real-time speech recognition via "Mel-Frequency Cepstral Coefficients" neural networks.

machine-learning mfcc neural-network pytorch speech-recognition

Last synced: 09 Nov 2024

https://github.com/javierantoran/moby_dick_whale_audio_detection

Feature extraction, HMMs, Neural Nets, and Boosting for Kaggle Cornell Whale detection challenge.

cross-validation extract-features feature-extraction gradient-boosting hmm mfcc neural-network spectrogram whale

Last synced: 05 Nov 2024

https://github.com/certainlywrong/mfcc_bee

Implementação do algoritmo de extração de características em dart.

dart mfcc mfcc-extractor

Last synced: 21 Nov 2024

https://github.com/brucewlee/lama-music-genre-dataset

.wav files, training dataset (MFCC), and graph plots (FFTs, MFCCs, Waveforms) from Latin America, Asia, MiddleEast, and Africa

africa asia audio-processing classification dataset genre genre-classification genre-suggestion genres-classification harvard-dataverse lama mfcc music music-library signal-processing sound

Last synced: 19 Dec 2024

https://github.com/linto-ai/sfeatpy

Library to extract MFCC features from audio signal

feature-extraction mfcc mfcc-features python3 speech-processing

Last synced: 27 Dec 2024

https://github.com/peteprattis/automatic-speech-recognision-system-asr

A python script that implements an automatic speech recognision system.

asr automatic-speech-recognition computer-science dtw dynamic-time-warping fir-filter librosa mel-frequency-cepstral-coefficients mfcc nyquist program python short-time-fourier-transform short-time-signal-analysis signal signal-processing student

Last synced: 17 Nov 2024

https://github.com/linto-ai/gpu-ne10-mfcc

Some works on accelerating MFCC features extraction with NEON NE10 and Videocore IV on Raspberry Pi

arm mfcc raspberry-pi

Last synced: 27 Dec 2024

https://github.com/kennykarnama/go-mfcc

MFCC Implementatin in Go

dsp golang mfcc signal-processing

Last synced: 24 Nov 2024

https://github.com/mathquis/node-gist

Node binding for the Gist Audio Analysis Library

analysis audio fft gist mfcc pitch spectrum

Last synced: 13 Nov 2024

https://github.com/efecanxrd/speech-recognition

Identify speaker from given speech signal using MFCC features and Gaussian Mixture Models

gaussian-mixture-models gaussianmixturemodel gmm mfcc mfcc-algorithm mfcc-features python python-speech-features python-speechrecognition python27 recognition sklearn sklearn-gmm speech speech-recognition speech-recognizer tensorflow

Last synced: 22 Dec 2024

https://github.com/nannigalaxy/audio-preprocessing-tool

Audio preprocessing tool for signal processing and machine learning applications.

audio-processing augmentation machine-learning mfcc signal-processing

Last synced: 19 Dec 2024

https://github.com/anishagg17/voice_to_gender_classifier

Identify a voice as male or female, based upon acoustic properties of the voice and speech extracted by processing audio.

gmm librosa mfcc scipy speech-processing

Last synced: 29 Nov 2024

https://github.com/loharmurtaza/fog_detection_subject_dependent

This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"

accelerometer detection eeg emg f1-score freezing-of-gait gyroscope machine-learning mfcc multi-modal-learning rf sensitivity skin-conductance specificity svm

Last synced: 20 Dec 2024

https://github.com/loharmurtaza/fog_detection

This repository is based on my research work "Detecting Freezing of Gait in Parkinson's Disease Patients Using Multi-Modal Machine Learning"

accelerometer detection eeg emg f1-score freezing-of-gait gyroscope machine-learning mfcc multi-modal-learning rf sensitivity skin-conductance specificity svm

Last synced: 26 Sep 2024

https://github.com/hamedzarei/ms-speechprocessing-hw03

HW03 of Speech Processing

htk lpc mfcc speech-processing

Last synced: 08 Nov 2024

https://github.com/reshalfahsi/music-genre-classification

Music Genre Classification using MFCC + ANN

audio audio-analysis audio-processing cmvn gtzan-dataset mfcc mfcc-features music-genre-classification pytorch pytorch-lightning signal-processing stft

Last synced: 15 Nov 2024

https://github.com/kennykarnama/mfcc

MFCC Feature Extraction In Matlab

dsp matlab mfcc

Last synced: 24 Nov 2024

https://github.com/nazir20/gender-classification-using-mfcc-with-lstm-model

Gender classification from audio signals is an important task with various applications, such as speech recognition systems, voice assistants, and speaker verification. This project aims to demonstrate how to use Mel Frequency Cepstral Coefficients as features and a Long Short-Term Memory (LSTM) neural network for gender classification.

audio-processing deep-learning deep-neural-networks gender-classification lstm lstm-neural-networks mfcc mfcc-extractor neural-networks

Last synced: 28 Nov 2024

https://github.com/ozymandiasthegreat/wakeword-zero

Personal wake word detector, ported to TypeScript/WASM

dtw fft hotword mfcc nativescript node wakeword wasm web

Last synced: 26 Sep 2024

https://github.com/mike014/audio-classification-

This is a prototype Django application that allows users to upload audio files and classify them using machine learning techniques.

ai audio django django-application machine-learning mfcc pca python

Last synced: 11 Oct 2024

https://github.com/jubinjacob03/genre-classification-recommendation_spotify

Project for classifying audio files into different genres using the K-Nearest Neighbors (KNN) algorithm.

knn-classification mfcc python streamlit

Last synced: 13 Nov 2024