Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sagartr/deep-audio-classifier-using-machine-learning

Languages Used: Python Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance
https://github.com/sagartr/deep-audio-classifier-using-machine-learning

audio-processing capuchin librosa python tensorflow tensorflow-models

Last synced: 7 days ago
JSON representation

Host: GitHub
URL: https://github.com/sagartr/deep-audio-classifier-using-machine-learning
Owner: SAGARTR
Created: 2024-07-28T09:16:50.000Z (2 months ago)
Default Branch: main
Last Pushed: 2024-07-28T09:24:55.000Z (2 months ago)
Last Synced: 2024-09-26T21:04:37.507Z (7 days ago)
Topics: audio-processing, capuchin, librosa, python, tensorflow, tensorflow-models
Language: Jupyter Notebook
Homepage:
Size: 104 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Deep-Audio-Classifier-using-Machine-Learning
Languages Used: Python

Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance.

The code need the data file to be downloded from kaggel :
link to the data : https://www.kaggle.com/datasets/kenjee/z-by-hp-unlocked-challenge-3-signal-processing

Key Components:
a. Audio Preprocessing: Convert raw audio waveforms into spectrograms for processing by convolutional neural networks (CNNs).

b. Deep Learning Model: Train a CNN or recurrent neural network (RNN) on preprocessed audio data to learn features and classify audio into categories like speech, music or environmental sounds.
c. Sliding Window Classification: Divide longer audio clips into shorter segments, apply the trained model to each segment, and aggregate the individual classifications to determine overall density of target audio events.
d. Model Training and Optimization: Train the model on a diverse dataset using techniques like data augmentation and regularization to improve generalization.

By leveraging the power of deep learning, the deep audio classifier can significantly enhance the efficiency and accuracy of various audio processing tasks, making it a valuable tool in a wide range of industries.