Projects in Awesome Lists tagged with librosa
A curated list of projects in awesome lists tagged with librosa .
https://github.com/x4nth055/emotion-recognition-using-speech
Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras
deep-learning emotion-detection emotion-recognition emotion-recognizer feature-extraction gradient-boosting keras kneighborsclassifier librosa machine-learning mfcc mlp-classifier neural-networks random-forest-classifier recurrent-neural-networks sklearn speech-emotion-recognition support-vector-machine
Last synced: 04 Apr 2025
https://github.com/marcogdepinto/emotion-classification-from-audio-files
Understanding emotions from audio files using neural networks and multiple datasets.
audio audio-processing classification-report datascience deep-learning deep-neural-networks emotion emotion-classification-ravdess keras keras-neural-networks librosa livingstone machine-learning python python3 ravdess-dataset song songs speech tensorflow
Last synced: 05 Apr 2025
https://github.com/marcogdepinto/Emotion-Classification-Ravdess
Understanding emotions from audio files using neural networks and multiple datasets.
audio audio-processing classification-report datascience deep-learning deep-neural-networks emotion emotion-classification-ravdess keras keras-neural-networks librosa livingstone machine-learning python python3 ravdess-dataset song songs speech tensorflow
Last synced: 12 Mar 2025
https://github.com/demfier/multimodal-speech-emotion-recognition
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
iemocap librosa lstm multimodal-emotion-recognition pandas python3 pytorch scikit-learn speech-emotion-recognition
Last synced: 01 May 2025
https://github.com/scherroman/mugen
A command-line music video generator based on rhythm
amv audio command-line librosa montage moviepy mugen music-video python remix rhythm tesseract video
Last synced: 21 Feb 2025
https://github.com/kaist-maclab/pytsmod
An open-source Python library for audio time-scale modification.
audio dsp librosa music numpy python scipy time-scale tsm
Last synced: 04 Apr 2025
https://github.com/KAIST-MACLab/PyTSMod
An open-source Python library for audio time-scale modification.
audio dsp librosa music numpy python scipy time-scale tsm
Last synced: 14 Jul 2025
https://github.com/spotify/realbook
Easier audio-based machine learning with TensorFlow.
audio cqt librosa machine-learning mel-spectrogram spectrograms stft tensorflow
Last synced: 04 Apr 2025
https://github.com/yeyupiaoling/audioclassification-paddlepaddle
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
audio-classification ecapa-tdnn librosa paddlepaddle panns res2net resnet-se tdnn urbansound8k
Last synced: 06 Sep 2025
https://github.com/GianlucaPaolocci/Sound-classification-on-Raspberry-Pi-with-Tensorflow
In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone
audio-analysis audio-signals dataset librosa machine-learning multilayer-perceptron-network raspberry raspberry-pi sound-classification tensorflow tensorflow-models
Last synced: 07 Apr 2025
https://github.com/gianlucapaolocci/sound-classification-on-raspberry-pi-with-tensorflow
In this project is presented a simple method to train an MLP neural network for audio signals. The trained model can be exported on a Raspberry Pi (2 or superior suggested) to classify audio signal registered with USB microphone
audio-analysis audio-signals dataset librosa machine-learning multilayer-perceptron-network raspberry raspberry-pi sound-classification tensorflow tensorflow-models
Last synced: 02 May 2025
https://github.com/ztrimus/speech-emotion-recognition
Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.
audio-files colab-notebook convolutional-neural-networks data-science deep-learning emotion-detection emotion-recognition jupyter-notebook keras librosa lstm natural-language-processing neural-network python3 pytorch rnn speech-emotion-recognition speech-recoginition supervised-learning voice
Last synced: 14 Apr 2025
https://github.com/rohankrgupta/Orca-call-Classifier-Machine-learning
Advanced ML Project : An Orca Call classifier using mel-spectrograms as audio representations to detect Killer whales
advanced-machine-learning feature keras-tensorflow librosa mel-spectrograms opencv
Last synced: 11 Mar 2025
https://github.com/bits-bytes-nn/sound-anomaly-detection-with-autoencoders
MIMII Sound Anomaly Detection with AutoEncoders
anomaly-detection autoencoder bokeh librosa matplotlib sagemaker tensorflow variational-autoencoder
Last synced: 06 Jul 2025
https://github.com/danyalimran93/Music-Genre-Classification
Classifying English Music (.mp3) files using Music Information Retrieval (MIR), Digital/Audio Signal Processing (DIP) and Machine Learning (ML) Strategies
audio-signal-processing librosa machine-learning music-genre music-information-retrieval
Last synced: 26 Aug 2025
https://github.com/adzialocha/tomomibot
Artificial intelligence bot for live voice improvisation
keras librosa machine-learning music
Last synced: 05 Apr 2025
https://github.com/albincorreya/chromacoverid
Methods to compute various chroma audio features and audio similarity measures particularly for the task of cover song identification
audio-processing audio-similarity-measures chroma cover-song-detection cover-song-identification essentia librosa music-information-retrieval
Last synced: 23 Aug 2025
https://github.com/clolsonus/VirtualChoir
Automatically sync, mix, and draw virtual choir videos from raw tracks of individual recordings. You may need some singing skills but you don't need video editing skills or additional software.
audacity audio-tracks choir librosa opencv pydub python sync video-tracks virtual-choirs
Last synced: 09 May 2025
https://github.com/anujdutt9/audio-scene-classification
Scene Classification using Audio in the nearby Environment.
audio-classification deep-learning keras librosa python36 tensorflow
Last synced: 04 May 2025
https://github.com/swapnilkumbhar/dsp-project
Digital Signal Processing mini project: Autotune
digital-signal-processing librosa pitch-correction
Last synced: 13 May 2025
https://github.com/skempin/audio-peak-detection
Python script utilising Librosa to log the timings of audio peaks in an MP3 file
audio-analysis audio-applications librosa mp3 python python-2 wav
Last synced: 09 Apr 2025
https://github.com/andi611/conditional-specgan-tensorflow
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
audio-synthesis conditional-gan digital-signal-processing gan librosa machine-learning nlp nlp-machine-learning tensorflow tts
Last synced: 13 Apr 2025
https://github.com/musa11971/manhuw
Recognizing and identifying Quran reciters from audio recordings.
librosa machine-learning python quran speaker-identification speaker-recognition
Last synced: 12 Aug 2025
https://github.com/ryoha000/librosapp
A C++ implementation of stft, melspectrogram and mel_to_stft
librosa melspectrogram spectrogram stft
Last synced: 14 Oct 2025
https://github.com/kookmin-sw/capstone-2022-15
IN4U - 면접 연습 웹 서비스
aws django interview interview-practice librosa mysql-database opencv react stylegan2 wav2lip
Last synced: 06 May 2025
https://github.com/bhattsameer/eyeshield
Data Transmission Between two devices using Sound
datasharing filesharing librosa matplotlib pied-piper pyaudio pydub python python3 sound soundcompare sounddatatransmission soundjoin soundsplit textsharing wave
Last synced: 19 Apr 2025
https://github.com/matlab-deep-learning/use-a-python-speech-command-recognition-system-to-matlab
Use a Python speech command recognition system in MATLAB
audio audio-processing co-execution deep deeplearning example librosa matlab pytorch speech-recognition
Last synced: 07 May 2025
https://github.com/zionc27/speech-emotion-recognition
Speech Emotion Recognition (SER) using Deep neural networks CNN and RNN
clstm cnn ipython-notebook keras librosa lstm machine-learning python rnn speech speech-emotion-classification speech-emotion-recognition tensorflow
Last synced: 09 May 2025
https://github.com/mehdihosseinimoghadam/signal-processing
Signal Processing with Python and Librosa
griffinlim librosa melspectrogram python signal-processing spectrogram torchaudio variational-autoencoder vector-quantization voice voice-reconstruction voice-synthesis vq-vae
Last synced: 22 Jul 2025
https://github.com/korseby/py3tag
Write tags to audio files (mp3, flac, and m4a are supported) based on their filenames
audioread flac-files librosa m4a-files mp3-files mutagen python-libraries python3 scheme
Last synced: 11 Apr 2025
https://github.com/georgiosioannoucoder/vera
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. 🔊
audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-emotion
Last synced: 09 Aug 2025
https://github.com/inishchith/soundanalysis
2nd Runner-Up @MumbaiHackathon 2017
librosa music numpy python3 scipy sound-clips
Last synced: 20 Aug 2025
https://github.com/agentmaker/paddle-librosa
Paddle-Librosa provides Paddle implementation of some librosa functions
Last synced: 24 Mar 2025
https://github.com/rupeshs/audio-regen
audio fourier-transform librosa matlabplot python signalprocessing spectrogram stft
Last synced: 10 Jul 2025
https://github.com/akash-rajak/volume-suggester
Python Script to suggest the volume at which the music audio file needs to be played for better experience and feeling.
audio-feature-extraction audio-loudness ffmpeg librosa matplotlib mutagen numpy os path pyaudio pydub pynput python3 subprocess tkinter volume-suggester wave
Last synced: 18 Oct 2025
https://github.com/akashmodak97/genre-detection-of-bengali-songs-based-on-audio-data
Genre Detection of Bengali Rabindranath Tagore's Song Based On Audio Data.
bengali-songs deep-learning deep-neural-networks feature-extraction genre-classification keras-tensorflow librosa lstm mfcc-features music-classification neural-network python3 song-genre tensorflow2
Last synced: 08 Oct 2025
https://github.com/pprattis/automatic-speech-recognision-system-ASR
A python script that implements an automatic speech recognision system.
asr automatic-speech-recognition computer-science dtw dynamic-time-warping fir-filter librosa mel-frequency-cepstral-coefficients mfcc nyquist program python short-time-fourier-transform short-time-signal-analysis signal signal-processing student
Last synced: 28 Sep 2025
https://github.com/matlab-deep-learning/convert-librosa-audio-feature-extraction-to-matlab
Convert librosa Audio Feature Extraction To MATLAB
audio deep-learning librosa matlab matlab-deep-learning pytorch
Last synced: 02 Jan 2026
https://github.com/talkuhulk/music-genres-classification
Tensorflow implementation of music-genres-classification with InceptionResnetV2
audio-classification classification cnn-tensorflow genres-classification inception-resnet-v2 librosa python tensorflow
Last synced: 16 Oct 2025
https://github.com/pprattis/automatic-speech-recognision-system-asr
A python script that implements an automatic speech recognision system.
asr automatic-speech-recognition computer-science dtw dynamic-time-warping fir-filter librosa mel-frequency-cepstral-coefficients mfcc nyquist program python short-time-fourier-transform short-time-signal-analysis signal signal-processing student
Last synced: 07 Sep 2025
https://github.com/kr1shnasomani/tonesense
Speech emotion recognition from audio clips using CNN
deep-learning keras librosa matplotlib neural-network numpy pandas scikit-learn tensorflow
Last synced: 06 Apr 2025
https://github.com/parthvadhadiya/tensorflow-speech-recognition-challenge
this repository contains end to end python script to train speech data provided by google, evaluate testing data, and submite to competition
competition kaggle-competition keras librosa spectrum speech-data speech-recognition tensorflow
Last synced: 29 Jun 2025
https://github.com/santiviquez/noisy-human-recognition
Recognized non-speech human sounds such as: clapping, footsteps, brushing teeth, drinking sipping, laughing, etc
audio-classification audio-recognition librosa pytorch
Last synced: 01 Mar 2025
https://github.com/saadarazzaq/speech-to-text-transformer
ASR with Facebook's Wav2Vec2 model for accurate 🎙️ to 📝 conversion.
asr huggingface-transformer librosa speech-recognition speech-to-text transformer wav2vec2-base-960h wav2vec2ctc wav2vec2tokenizer
Last synced: 17 Mar 2025
https://github.com/harmanveer2546/music-genre-classification
Classifying the genre of a music using deep neural networks.
cnn deep-learning ipython keras knn labelencoder librosa music numpy pandas pickle python scipy sequential svc tensorflow
Last synced: 29 Dec 2025
https://github.com/georgiosioannoucoder/vera-deployed
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the deployed version of Vera. 🔊
audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-recognition
Last synced: 12 Nov 2025
https://github.com/adzialocha/notebook
Jupyter notebooks for random experiments with audio processing, data analysis and machine learning
jupyter-notebook keras learning librosa music21 scikit-learn
Last synced: 30 Oct 2025
https://github.com/palak-463/tablataalrecognitionsystem
Software built using Python which makes use of CNN and FNN to detect the Taals of the Tabla, an Indian classical music instrument. 🎛️
cnn deep-learning flask fnn librosa numpy os pickle python scikit-learn
Last synced: 30 Dec 2025
https://github.com/djdhairya/speech-emotion-recognition-
Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.
audio-processing convolutional-neural-networks data-science deep-learning emotion-detection emotion-recognition kersa librosa natural-language-processing nuralnetwork overfitting python pytorch rnn speech-emotion-recognition speech-processing speech-recognition voice
Last synced: 13 Apr 2025
https://github.com/kitsuya0828/inpersonation-app
An application that automatically scores your mimics
dtw-algorithm librosa python3 streamlit
Last synced: 04 Apr 2025
https://github.com/natgluons/chronosense
Personalized Sleep Optimizer App, a machine learning project that analyzes sleep audio using librosa, PyTorch, and scikit-learn to detect disturbances and optimize sleep quality through personalized recommendations.
audio-analysis audio-classification audio-processing chronobiology librosa sleep-analysis sleep-research sleep-tracker torchaudio
Last synced: 26 Jun 2025
https://github.com/machinelearningzuu/data-engineering-process-of-audio-data
This Repository Consists of the Feature Engineering Process of Audio Signals in both Time Domain & Frequency Domain. In more the repository contains Jupiter-notebook implementations which uses python & librosa
audio-processing librosa machine-learning python
Last synced: 29 Mar 2025
https://github.com/joshuamhtsang/yt2spec
Convert audio into spectrograms.
ffmpeg flask flask-restful librosa python3 youtube-dl
Last synced: 05 Nov 2025
https://github.com/ptrpaws/augaudio
A simple audio data augmentation package
audio data-augmentation librosa python python3 simple
Last synced: 27 Aug 2025
https://github.com/alihassanml/speech-recognition-system
This project implements a speech recognition system using the LibriSpeech dataset and the `librosa` library for feature extraction, alongside a deep learning model built with TensorFlow/Keras.
deep-learning librosa speech-recognition speech-to-text
Last synced: 31 Mar 2025
https://github.com/rijoslal/mickey
Mickey is a ML web app that captures emotions in music using LSTM and GRU-based neural networks built with TensorFlow. It features a FastAPI backend with Jinja templates for the frontend, and uses Librosa for audio processing. The system analyzes music to classify emotions, making it a powerful tool for mood-based music recommendations
fastapi html-css-javascript jinja2-templates librosa sklearn tensorflow
Last synced: 15 Mar 2025
https://github.com/niranjanchaudhari0929/prediction-of-insect-species-using-acoustic-features
Prediction model built to predict the insect species using the acoustic data gathered.
librosa matplotlib pandas sklearn
Last synced: 23 Mar 2025
https://github.com/brayvid/engine-detection
Flatiron School Data Science Bootcamp Phase 4 Project
audio-analysis classification convolution data-science kaggle keras librosa machine-learning neural-network spectrogram xgboost
Last synced: 11 Jun 2025
https://github.com/thekartikeyamishra/voicecloner
The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.
ai indic-transliteration librosa machine-learning numpy nvidia-pyindex nvidia-tacotron2 nvidia-waveglow python torch torchaudio
Last synced: 20 Feb 2025
https://github.com/kitsuyaazuma/inpersonation-app
An application that automatically scores your mimics
dtw-algorithm librosa python3 streamlit
Last synced: 07 May 2025
https://github.com/dhavaltaunk08/gender-classification
I did this project during my internship at IIT Guwahati. It aimed to perform gender classification in video streaming.
deep-learning librosa opencv-python python scikit-learn
Last synced: 10 Oct 2025
https://github.com/alex1iv/asr_ru_numbers
Automatic Speech Recognition (ASR) system for Russian digits
audio-processing librosa numpy speech-recognition tensorflow
Last synced: 03 Nov 2025
https://github.com/ziadasem/audio-processing-for-ml
audio processing with python and librosa for ML
audio-processing librosa machine-learning python
Last synced: 27 Feb 2025
https://github.com/sagartr/deep-audio-classifier-using-machine-learning
Languages Used: Python Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance
audio-processing capuchin librosa python tensorflow tensorflow-models
Last synced: 07 Apr 2025
https://github.com/najdbinrabah/deep-learning-with-tensorflow-and-keras
This project explores emotion recognition in audio data, focusing on feature extraction techniques while also comparing the performance of LSTM and 1D CNN models.
1d-convolutional-neural-network audio-analysis chroma-features convolutional-neural-networks data-science deep-learning deep-neural-networks emotion-detection feature-extraction keras librosa long-short-term-memory-network machine-learning mel-frequency-cepstral-coefficients mel-spectrogram multiclass-classification python recurrent-neural-networks tensorflow
Last synced: 06 Apr 2025
https://github.com/beyza-ozben/fft_ses_temizleme
BİL314-Sinyaller ve Sistemler Dersi/Final Projesi (Fourier Dönüşümü)
audio-denoising conda-environment fastfouriertransform fft librosa matplotlib noisereduce numpy python scipy-library soundfile
Last synced: 27 Jun 2025
https://github.com/iamarunbrahma/spoken-digit-recognition
In this notebook, we are recognizing digits from 0 to 9 based on audio recordings file. Input data will be in the form of speech signal and output will be a single digit.
librosa lstm speech-recognition
Last synced: 28 Mar 2025
https://github.com/dhanushi2620/aquasignature
Deep learning model using CRNN and MFCC features to classify underwater sounds and detect foreign threats based on acoustic frequency shifts.
acoustic-signature ai-for-defense anomaly-detection deep-learning-models keras librosa mfcc spectrogram tensorflow
Last synced: 13 Sep 2025
https://github.com/usmana5809/quran-recitation-audio-classification
Quran Recitation Audio Classification project aims to classify different recitations of the Quran using machine learning techniques. It involves preprocessing audio data, extracting features, training models, and evaluating their performance
audio-classification classification-model islamic-studies librosa machine-learning python quran scikit-learn
Last synced: 20 Mar 2025
https://github.com/wxjiao/librosa-audio-features
Temporal audio features extraction by Librosa.
Last synced: 22 Jul 2025
https://github.com/psavarmattas/speechtotext
we shall build a very simple speech recognition system that takes our voice as input and produces the corresponding text by hearing the input.
facebook-api ipython librosa machine-learning numpy python pytorch soundfile transformers
Last synced: 29 Dec 2025
https://github.com/khushijtrivedi/speech
The Assistive Speech Technology System is designed to enhance communication by analyzing and processing various speech and audio inputs.
ajax bigru-crf bootstrap flask flask-server html-css-javascript librosa python restapi-framework voice-recognition whisper
Last synced: 08 Oct 2025
https://github.com/zvikinoza/masr
Mini Automatic Speech Recognition
fourier-transform keras-tensorflow librosa sound-processing speech-recognition speech-to-text
Last synced: 21 Mar 2025
https://github.com/harmanveer-2546/music-genre-classification
Classifying the genre of a music using deep neural networks.
cnn deep-neural-networks keras labelencoder librosa matplotlib music numpy pandas pickle python scipy seaborn sequential-models tensorflow
Last synced: 03 Sep 2025
https://github.com/hayatiyrtgl/audio_processing_for_cnn_network
Spectrum creation is the most important thing while dealing with audio data
audio audio-processing librosa preprocessing preprocessing-data python stft
Last synced: 08 Apr 2025
https://github.com/gicehajunior/speech-extraction-python
Extraction of text from audio clip using moviepy and speech_recognition library, python
librosa speech speech-recognition speech-to-text
Last synced: 03 Apr 2025
https://github.com/anishagg17/voice_to_gender_classifier
Identify a voice as male or female, based upon acoustic properties of the voice and speech extracted by processing audio.
gmm librosa mfcc scipy speech-processing
Last synced: 12 Jun 2025
https://github.com/jaay7/emotion-detection
jupyter-notebook librosa mlp-classifier python3 recurrent-neural-networks tensorflow
Last synced: 09 Apr 2025
https://github.com/namratha2301/dogcat
Web Application that Identifies Animal from their Sound. Right now restricted to binary classification between cat and dog sounds.
ann azure bashscript cnn flake8 flask keras librosa python-3-9 tailwindcss tensorflow voice-processing
Last synced: 30 Dec 2025
https://github.com/sugarcane-mk/finetuning_wav2vec2
This repo provides step by step process from sctatch to fine tune facebook's wav2vec2-large model using transformers
asr asr-model cuda facebook fairseq fine-tuning finetuning huggingface librosa python torch transformers wav2vec2 wav2vec2-large-960h
Last synced: 18 Mar 2025
https://github.com/sudemc/firstvoiceproject
🎵 Müzik Enstrüman Ayrıştırma ve Görselleştirme Projesi Bu proje, bir müzik parçasını Spleeter ve Librosa kullanarak enstrüman ve vokal bileşenlerine ayırır. Ayrıca, ses sinyallerinin spektral ve zamansal analizini görselleştirir.
audio-processing deep-learning librosa machine-learning music musicanalysis python spleeter visualization
Last synced: 09 Apr 2025
https://github.com/hallowshaw/speech-emotion-recognition-with-mfcc
A project to classify emotions like happiness, sadness, and anger from speech using MFCCs, machine learning models, and visualizations for audio features and model performance.
crema-d kaggle-dataset librosa lstm matplotlib mel-frequency-cepstral-coefficient mfcc mfcc-algorithm python ravdees savee scikit-learn seaborn sentiment-analyser sentiment-analysis speech-emotion-regonition speech-sentiment-analysis tess voice-emotion-recognition voice-sentiment-analysis
Last synced: 23 Feb 2025
https://github.com/sujalk777/signal_systems_lab
This repository contains the assignments for the Signal Systems Laboratory course offered at IIT Jammu Autumn 24
jupyter-notebook librosa linux matplotlib numpy python raspberry-pi
Last synced: 30 Dec 2025
https://github.com/relative-log31/sync-audio-and-video
A program that synchronizes video and audio.
librosa numpy pyqt pyqt6 python python-app python-script python3
Last synced: 21 Jun 2025
https://github.com/costopoulos/ntua-dsp
:signal_strength: NTUA ECE Digital Signal Processing Course Source Codes and Reports
dsp filters fourier-transform librosa numpy pywt scipy short-time-signal-analysis stft
Last synced: 16 May 2025
https://github.com/georgiosioannoucoder/vera-deployed-v2
Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the 2nd deployed version of VERA. 🔊
audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-emotion
Last synced: 22 Feb 2025
https://github.com/vasugi2003/fusion-ai---multimodal-persuvasiveness-prediction
Developed a system to predict persuasiveness using multi-modal data (text, images, audio). Utilized BERT for text embeddings, ResNet for image features, and Librosa for audio analysis. Fused data from all modalities for enhanced prediction accuracy.
ai bert-model fusion librosa multimodal-deep-learning python resnet-50 tensorflow
Last synced: 08 Apr 2025
https://github.com/sanatren/signal_processing_and_speech_recognition
all the practices related to speech recognition and pytorch for audios.
librosa signal-processing speech-recognition
Last synced: 28 Jul 2025
https://github.com/theodor94/py-audio-visualizer
A cross-platform GUI tool that transforms audio files into high-resolution visualizations and detailed TXT reports.
audio cross-platform ffmpeg librosa matplotlib python spectrogram tkinter unbound-planet visualizer waveform
Last synced: 04 Oct 2025
https://github.com/hayatiyrtgl/audio_classification_with_ann
Ann for Audio classification.
artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks audio audio-processing keras librosa python
Last synced: 26 Jul 2025
https://github.com/sanchariii/musicgenre_classification
Music genre classification is a machine learning model by which the model can predict music and classify the music based on popular genres like pop,jazz,rock,hip-hop,lofi etc.
convolutional-neural-networks jupyer-notebook librosa python
Last synced: 05 Sep 2025
https://github.com/kavayk29/audio-classification-using-python-library
This is a audio classification Project using python Libraries such as librosa to make the visual representation of the audio files, and using numpy to make array of data for manipulation and then extraction the features for classification to train and test of CNN model.
librosa matplotlib-pyplot mfcc-features numpy pandas sklearn-library
Last synced: 26 Feb 2025