An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with librosa

A curated list of projects in awesome lists tagged with librosa .

https://github.com/librosa/librosa

Python library for audio and music analysis

audio dsp librosa music python scipy

Last synced: 12 May 2025

https://github.com/demfier/multimodal-speech-emotion-recognition

Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)

iemocap librosa lstm multimodal-emotion-recognition pandas python3 pytorch scikit-learn speech-emotion-recognition

Last synced: 01 May 2025

https://github.com/scherroman/mugen

A command-line music video generator based on rhythm

amv audio command-line librosa montage moviepy mugen music-video python remix rhythm tesseract video

Last synced: 21 Feb 2025

https://github.com/kaist-maclab/pytsmod

An open-source Python library for audio time-scale modification.

audio dsp librosa music numpy python scipy time-scale tsm

Last synced: 04 Apr 2025

https://github.com/KAIST-MACLab/PyTSMod

An open-source Python library for audio time-scale modification.

audio dsp librosa music numpy python scipy time-scale tsm

Last synced: 14 Jul 2025

https://github.com/spotify/realbook

Easier audio-based machine learning with TensorFlow.

audio cqt librosa machine-learning mel-spectrogram spectrograms stft tensorflow

Last synced: 04 Apr 2025

https://github.com/yeyupiaoling/audioclassification-paddlepaddle

基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法

audio-classification ecapa-tdnn librosa paddlepaddle panns res2net resnet-se tdnn urbansound8k

Last synced: 06 Sep 2025

https://github.com/GianlucaPaolocci/Sound-classification-on-Raspberry-Pi-with-Tensorflow

In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone

audio-analysis audio-signals dataset librosa machine-learning multilayer-perceptron-network raspberry raspberry-pi sound-classification tensorflow tensorflow-models

Last synced: 07 Apr 2025

https://github.com/gianlucapaolocci/sound-classification-on-raspberry-pi-with-tensorflow

In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone

audio-analysis audio-signals dataset librosa machine-learning multilayer-perceptron-network raspberry raspberry-pi sound-classification tensorflow tensorflow-models

Last synced: 02 May 2025

https://github.com/rohankrgupta/Orca-call-Classifier-Machine-learning

Advanced ML Project : An Orca Call classifier using mel-spectrograms as audio representations to detect Killer whales

advanced-machine-learning feature keras-tensorflow librosa mel-spectrograms opencv

Last synced: 11 Mar 2025

https://github.com/danyalimran93/Music-Genre-Classification

Classifying English Music (.mp3) files using Music Information Retrieval (MIR), Digital/Audio Signal Processing (DIP) and Machine Learning (ML) Strategies

audio-signal-processing librosa machine-learning music-genre music-information-retrieval

Last synced: 26 Aug 2025

https://github.com/adzialocha/tomomibot

Artificial intelligence bot for live voice improvisation

keras librosa machine-learning music

Last synced: 05 Apr 2025

https://github.com/albincorreya/chromacoverid

Methods to compute various chroma audio features and audio similarity measures particularly for the task of cover song identification

audio-processing audio-similarity-measures chroma cover-song-detection cover-song-identification essentia librosa music-information-retrieval

Last synced: 23 Aug 2025

https://github.com/clolsonus/VirtualChoir

Automatically sync, mix, and draw virtual choir videos from raw tracks of individual recordings. You may need some singing skills but you don't need video editing skills or additional software.

audacity audio-tracks choir librosa opencv pydub python sync video-tracks virtual-choirs

Last synced: 09 May 2025

https://github.com/anujdutt9/audio-scene-classification

Scene Classification using Audio in the nearby Environment.

audio-classification deep-learning keras librosa python36 tensorflow

Last synced: 04 May 2025

https://github.com/swapnilkumbhar/dsp-project

Digital Signal Processing mini project: Autotune

digital-signal-processing librosa pitch-correction

Last synced: 13 May 2025

https://github.com/skempin/audio-peak-detection

Python script utilising Librosa to log the timings of audio peaks in an MP3 file

audio-analysis audio-applications librosa mp3 python python-2 wav

Last synced: 09 Apr 2025

https://github.com/xi/infinity-player

infinite jukebox clone using librosa

audio librosa numpy

Last synced: 29 Jul 2025

https://github.com/andi611/conditional-specgan-tensorflow

Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network

audio-synthesis conditional-gan digital-signal-processing gan librosa machine-learning nlp nlp-machine-learning tensorflow tts

Last synced: 13 Apr 2025

https://github.com/musa11971/manhuw

Recognizing and identifying Quran reciters from audio recordings.

librosa machine-learning python quran speaker-identification speaker-recognition

Last synced: 12 Aug 2025

https://github.com/ryoha000/librosapp

A C++ implementation of stft, melspectrogram and mel_to_stft

librosa melspectrogram spectrogram stft

Last synced: 14 Oct 2025

https://github.com/librosa/data

Example (audio) data for use with librosa

audio librosa

Last synced: 01 May 2025

https://github.com/korseby/py3tag

Write tags to audio files (mp3, flac, and m4a are supported) based on their filenames

audioread flac-files librosa m4a-files mp3-files mutagen python-libraries python3 scheme

Last synced: 11 Apr 2025

https://github.com/georgiosioannoucoder/vera

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. 🔊

audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-emotion

Last synced: 09 Aug 2025

https://github.com/inishchith/soundanalysis

2nd Runner-Up @MumbaiHackathon 2017

librosa music numpy python3 scipy sound-clips

Last synced: 20 Aug 2025

https://github.com/agentmaker/paddle-librosa

Paddle-Librosa provides Paddle implementation of some librosa functions

librosa paddlepaddle

Last synced: 24 Mar 2025

https://github.com/akash-rajak/volume-suggester

Python Script to suggest the volume at which the music audio file needs to be played for better experience and feeling.

audio-feature-extraction audio-loudness ffmpeg librosa matplotlib mutagen numpy os path pyaudio pydub pynput python3 subprocess tkinter volume-suggester wave

Last synced: 18 Oct 2025

https://github.com/talkuhulk/music-genres-classification

Tensorflow implementation of music-genres-classification with InceptionResnetV2

audio-classification classification cnn-tensorflow genres-classification inception-resnet-v2 librosa python tensorflow

Last synced: 16 Oct 2025

https://github.com/kr1shnasomani/tonesense

Speech emotion recognition from audio clips using CNN

deep-learning keras librosa matplotlib neural-network numpy pandas scikit-learn tensorflow

Last synced: 06 Apr 2025

https://github.com/parthvadhadiya/tensorflow-speech-recognition-challenge

this repository contains end to end python script to train speech data provided by google, evaluate testing data, and submite to competition

competition kaggle-competition keras librosa spectrum speech-data speech-recognition tensorflow

Last synced: 29 Jun 2025

https://github.com/santiviquez/noisy-human-recognition

Recognized non-speech human sounds such as: clapping, footsteps, brushing teeth, drinking sipping, laughing, etc

audio-classification audio-recognition librosa pytorch

Last synced: 01 Mar 2025

https://github.com/georgiosioannoucoder/vera-deployed

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the deployed version of Vera. 🔊

audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-recognition

Last synced: 12 Nov 2025

https://github.com/adzialocha/notebook

Jupyter notebooks for random experiments with audio processing, data analysis and machine learning

jupyter-notebook keras learning librosa music21 scikit-learn

Last synced: 30 Oct 2025

https://github.com/palak-463/tablataalrecognitionsystem

Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. 🎛️

cnn deep-learning flask fnn librosa numpy os pickle python scikit-learn

Last synced: 30 Dec 2025

https://github.com/kitsuya0828/inpersonation-app

An application that automatically scores your mimics

dtw-algorithm librosa python3 streamlit

Last synced: 04 Apr 2025

https://github.com/natgluons/chronosense

Personalized Sleep Optimizer App, a machine learning project that analyzes sleep audio using librosa, PyTorch, and scikit-learn to detect disturbances and optimize sleep quality through personalized recommendations.

audio-analysis audio-classification audio-processing chronobiology librosa sleep-analysis sleep-research sleep-tracker torchaudio

Last synced: 26 Jun 2025

https://github.com/machinelearningzuu/data-engineering-process-of-audio-data

This Repository Consists of the Feature Engineering Process of Audio Signals in both Time Domain & Frequency Domain. In more the repository contains Jupiter-notebook implementations which uses python & librosa

audio-processing librosa machine-learning python

Last synced: 29 Mar 2025

https://github.com/joshuamhtsang/yt2spec

Convert audio into spectrograms.

ffmpeg flask flask-restful librosa python3 youtube-dl

Last synced: 05 Nov 2025

https://github.com/ptrpaws/augaudio

A simple audio data augmentation package

audio data-augmentation librosa python python3 simple

Last synced: 27 Aug 2025

https://github.com/alihassanml/speech-recognition-system

This project implements a speech recognition system using the LibriSpeech dataset and the `librosa` library for feature extraction, alongside a deep learning model built with TensorFlow/Keras.

deep-learning librosa speech-recognition speech-to-text

Last synced: 31 Mar 2025

https://github.com/rijoslal/mickey

Mickey is a ML web app that captures emotions in music using LSTM and GRU-based neural networks built with TensorFlow. It features a FastAPI backend with Jinja templates for the frontend, and uses Librosa for audio processing. The system analyzes music to classify emotions, making it a powerful tool for mood-based music recommendations

fastapi html-css-javascript jinja2-templates librosa sklearn tensorflow

Last synced: 15 Mar 2025

https://github.com/niranjanchaudhari0929/prediction-of-insect-species-using-acoustic-features

Prediction model built to predict the insect species using the acoustic data gathered.

librosa matplotlib pandas sklearn

Last synced: 23 Mar 2025

https://github.com/thekartikeyamishra/voicecloner

The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.

ai indic-transliteration librosa machine-learning numpy nvidia-pyindex nvidia-tacotron2 nvidia-waveglow python torch torchaudio

Last synced: 20 Feb 2025

https://github.com/kitsuyaazuma/inpersonation-app

An application that automatically scores your mimics

dtw-algorithm librosa python3 streamlit

Last synced: 07 May 2025

https://github.com/dhavaltaunk08/gender-classification

I did this project during my internship at IIT Guwahati. It aimed to perform gender classification in video streaming.

deep-learning librosa opencv-python python scikit-learn

Last synced: 10 Oct 2025

https://github.com/alex1iv/asr_ru_numbers

Automatic Speech Recognition (ASR) system for Russian digits

audio-processing librosa numpy speech-recognition tensorflow

Last synced: 03 Nov 2025

https://github.com/ziadasem/audio-processing-for-ml

audio processing with python and librosa for ML

audio-processing librosa machine-learning python

Last synced: 27 Feb 2025

https://github.com/sagartr/deep-audio-classifier-using-machine-learning

Languages Used: Python Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance

audio-processing capuchin librosa python tensorflow tensorflow-models

Last synced: 07 Apr 2025

https://github.com/beyza-ozben/fft_ses_temizleme

BİL314-Sinyaller ve Sistemler Dersi/Final Projesi (Fourier Dönüşümü)

audio-denoising conda-environment fastfouriertransform fft librosa matplotlib noisereduce numpy python scipy-library soundfile

Last synced: 27 Jun 2025

https://github.com/iamarunbrahma/spoken-digit-recognition

In this notebook, we are recognizing digits from 0 to 9 based on audio recordings file. Input data will be in the form of speech signal and output will be a single digit.

librosa lstm speech-recognition

Last synced: 28 Mar 2025

https://github.com/dhanushi2620/aquasignature

Deep learning model using CRNN and MFCC features to classify underwater sounds and detect foreign threats based on acoustic frequency shifts.

acoustic-signature ai-for-defense anomaly-detection deep-learning-models keras librosa mfcc spectrogram tensorflow

Last synced: 13 Sep 2025

https://github.com/usmana5809/quran-recitation-audio-classification

Quran Recitation Audio Classification project aims to classify different recitations of the Quran using machine learning techniques. It involves preprocessing audio data, extracting features, training models, and evaluating their performance

audio-classification classification-model islamic-studies librosa machine-learning python quran scikit-learn

Last synced: 20 Mar 2025

https://github.com/wxjiao/librosa-audio-features

Temporal audio features extraction by Librosa.

audio-features librosa

Last synced: 22 Jul 2025

https://github.com/psavarmattas/speechtotext

we shall build a very simple speech recognition system that takes our voice as input and produces the corresponding text by hearing the input.

facebook-api ipython librosa machine-learning numpy python pytorch soundfile transformers

Last synced: 29 Dec 2025

https://github.com/khushijtrivedi/speech

The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.

ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper

Last synced: 08 Oct 2025

https://github.com/hayatiyrtgl/audio_processing_for_cnn_network

Spectrum creation is the most important thing while dealing with audio data

audio audio-processing librosa preprocessing preprocessing-data python stft

Last synced: 08 Apr 2025

https://github.com/gicehajunior/speech-extraction-python

Extraction of text from audio clip using moviepy and speech_recognition library, python

librosa speech speech-recognition speech-to-text

Last synced: 03 Apr 2025

https://github.com/anishagg17/voice_to_gender_classifier

Identify a voice as male or female, based upon acoustic properties of the voice and speech extracted by processing audio.

gmm librosa mfcc scipy speech-processing

Last synced: 12 Jun 2025

https://github.com/namratha2301/dogcat

Web Application that Identifies Animal from their Sound. Right now restricted to binary classification between cat and dog sounds.

ann azure bashscript cnn flake8 flask keras librosa python-3-9 tailwindcss tensorflow voice-processing

Last synced: 30 Dec 2025

https://github.com/sugarcane-mk/finetuning_wav2vec2

This repo provides step by step process from sctatch to fine tune facebook's wav2vec2-large model using transformers

asr asr-model cuda facebook fairseq fine-tuning finetuning huggingface librosa python torch transformers wav2vec2 wav2vec2-large-960h

Last synced: 18 Mar 2025

https://github.com/sudemc/firstvoiceproject

🎵 Müzik Enstrüman Ayrıştırma ve Görselleştirme Projesi Bu proje, bir müzik parçasını Spleeter ve Librosa kullanarak enstrüman ve vokal bileşenlerine ayırır. Ayrıca, ses sinyallerinin spektral ve zamansal analizini görselleştirir.

audio-processing deep-learning librosa machine-learning music musicanalysis python spleeter visualization

Last synced: 09 Apr 2025

https://github.com/sujalk777/signal_systems_lab

This repository contains the assignments for the Signal Systems Laboratory course offered at IIT Jammu Autumn 24

jupyter-notebook librosa linux matplotlib numpy python raspberry-pi

Last synced: 30 Dec 2025

https://github.com/relative-log31/sync-audio-and-video

A program that synchronizes video and audio.

librosa numpy pyqt pyqt6 python python-app python-script python3

Last synced: 21 Jun 2025

https://github.com/costopoulos/ntua-dsp

:signal_strength: NTUA ECE Digital Signal Processing Course Source Codes and Reports

dsp filters fourier-transform librosa numpy pywt scipy short-time-signal-analysis stft

Last synced: 16 May 2025

https://github.com/georgiosioannoucoder/vera-deployed-v2

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the 2nd deployed version of VERA. 🔊

audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-emotion

Last synced: 22 Feb 2025

https://github.com/vasugi2003/fusion-ai---multimodal-persuvasiveness-prediction

Developed a system to predict persuasiveness using multi-modal data (text, images, audio). Utilized BERT for text embeddings, ResNet for image features, and Librosa for audio analysis. Fused data from all modalities for enhanced prediction accuracy.

ai bert-model fusion librosa multimodal-deep-learning python resnet-50 tensorflow

Last synced: 08 Apr 2025

https://github.com/sanatren/signal_processing_and_speech_recognition

all the practices related to speech recognition and pytorch for audios.

librosa signal-processing speech-recognition

Last synced: 28 Jul 2025

https://github.com/nannib/audiodf

This program can detect if an audio message is a Deep Fake or it is genuine

audio detection fake features forensic librosa tool wav

Last synced: 27 Jul 2025

https://github.com/theodor94/py-audio-visualizer

A cross-platform GUI tool that transforms audio files into high-resolution visualizations and detailed TXT reports.

audio cross-platform ffmpeg librosa matplotlib python spectrogram tkinter unbound-planet visualizer waveform

Last synced: 04 Oct 2025

https://github.com/sanchariii/musicgenre_classification

Music genre classification is a machine learning model by which the model can predict music and classify the music based on popular genres like pop,jazz,rock,hip-hop,lofi etc.

convolutional-neural-networks jupyer-notebook librosa python

Last synced: 05 Sep 2025

https://github.com/kavayk29/audio-classification-using-python-library

This is a audio classification Project using python Libraries such as librosa to make the visual representation of the audio files, and using numpy to make array of data for manipulation and then extraction the features for classification to train and test of CNN model.

librosa matplotlib-pyplot mfcc-features numpy pandas sklearn-library

Last synced: 26 Feb 2025