An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with audio-classification

A curated list of projects in awesome lists tagged with audio-classification .

https://github.com/towhee-io/examples

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

audio-classification cross-modal embeddings image-classification machine-learning nlp video-tagging

Last synced: 04 Apr 2025

https://github.com/RetroCirce/HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

audio-classification music-information-retrieval python sound-event-detection transformer-models

Last synced: 14 Jul 2025

https://github.com/ksanjeevan/crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

audio audio-classification convnet crnn lstm melspectrogram pytorch rnn spectrogram

Last synced: 16 Jan 2026

https://github.com/YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

audio audio-classification audio-processing audio-tagging speech-recognition

Last synced: 01 Apr 2025

https://github.com/drscotthawley/panotti

A multi-channel neural network audio classifier using Keras

audio-classification convolutional-neural-networks keras music-tagging neural-network tensorflow

Last synced: 24 Apr 2025

https://github.com/cwx-worst-one/eat

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

audio audio-classification deep-learning eat fairseq pytorch representation-learning self-supervised-learning

Last synced: 05 Apr 2025

https://github.com/siavashshams/ssamba

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

audio audio-classification deep-learning emotion-recognition keyword-spotting mamba representation-learning self-supervised-learning speaker-identification state-space-model

Last synced: 06 Apr 2025

https://github.com/jonnor/esc-cnn-microcontroller

Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks

audio-classification embedded-devices machine-learning master-thesis microcontroller thesis

Last synced: 06 Oct 2025

https://github.com/kaistmm/Audio-Mamba-AuM

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

audio audio-classification audio-mamba deep-learning mamba pytorch representation-learning speaker-identification speech-classification state-space-model

Last synced: 04 Aug 2025

https://github.com/yeyupiaoling/audioclassification-paddlepaddle

基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法

audio-classification ecapa-tdnn librosa paddlepaddle panns res2net resnet-se tdnn urbansound8k

Last synced: 06 Sep 2025

https://github.com/JohannesBuchner/spoken-command-recognition

A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition

audio audio-classification dataset machine-learning machine-learning-dataset spoken-english

Last synced: 11 Mar 2025

https://github.com/johannesbuchner/spoken-command-recognition

A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition

audio audio-classification dataset machine-learning machine-learning-dataset spoken-english

Last synced: 15 Jul 2025

https://github.com/caldarie/flutter_tflite_audio

Audio classification Tflite package for flutter (iOS & Android). Can support Google Teachable Machine models

android audio-classification flutter google-teachable-machine ios tflite

Last synced: 21 Feb 2026

https://github.com/chen0040/mxnet-audio

Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet

audio-classification music-recommendation music-search mxnet song-recommender

Last synced: 15 Aug 2025

https://github.com/mtg/dcase-models

Python library for rapid prototyping of environmental sound analysis systems

audio-classification audio-tagging deep-learning python sound-event-detection

Last synced: 16 Oct 2025

https://github.com/pooya-mohammadi/audio-classification-pytorch

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.

audio-classification deep-learning deep-utils lstm python pytorch transformers wav2vec2

Last synced: 11 Apr 2025

https://github.com/emuell/afec

Cross platform audio feature extraction and sound classification tool

afec audio-classification audio-feature-extraction audio-features audio-formats classification-model lightgbm machine-learning

Last synced: 09 Oct 2025

https://github.com/otonomee/streamstem

Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) in MP3, WAV or FLAC.

audio-classification deep-learning demucs fastapi machine-learning source-separation spotify-stemmer youtube-stemmer

Last synced: 26 Oct 2025

https://github.com/anujdutt9/audio-scene-classification

Scene Classification using Audio in the nearby Environment.

audio-classification deep-learning keras librosa python36 tensorflow

Last synced: 04 May 2025

https://github.com/awsaf49/sonics

[ICLR 2025] SONICS: Synthetic Or Not - Identifying Counterfeit Songs

audio-classification cnn deepfake-detection fake-song-detection music-dataset transformer

Last synced: 17 Jan 2026

https://github.com/f0k/birdclef2018

BirdCLEF 2018 implementation

audio-classification bioacoustics deep-learning

Last synced: 12 Apr 2025

https://github.com/sainathadapa/mediaeval-2019-moodtheme-detection

4th position solution to the MediaEval - The 2019 Emotion and Themes in Music using Jamendo

audio-classification deep-learning mediaeval music-information-retrieval

Last synced: 30 Jun 2025

https://github.com/zabir-nabil/audioperm

A python library for generating different permutations of audible segments from audio files.

audio-augmentation audio-classification audio-processing augmentation speaker-recognition speech-augmentation

Last synced: 26 Jul 2025

https://github.com/yas-sim/openvino-sound-classification-demo-rt

Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result on the screen with illustration.

audio-classification deep-learning deep-learning-demo demo intel openvino python real-time sound-classification

Last synced: 22 Apr 2025

https://github.com/labbeti/sslh

Deep Semi-Supervised Learning with Holistic methods for audio classification.

audio-classification deep-learning machine-learning pytorch pytorch-lightning semi-supervised

Last synced: 08 May 2025

https://github.com/eonu/fsdd

Approximate Dynamic Time Warping (+kNN) and BiLSTM-RNN approaches to isolated word recognition on the Free Spoken Digit Dataset.

audio-classification dtw dynamic-time-warping free-spoken-digit-dataset fsdd k-nearest-neighbors knn long-short-term-memory lstm mel-frequency-cepstral-coefficients mfcc-features pytorch sequence-classification variable-length-data

Last synced: 09 May 2025

https://github.com/iver56/live-audio-ml

A system that should classify audio in real-time

audio audio-classification laughter machine-learning real-time

Last synced: 27 Aug 2025

https://github.com/glefundes/misophonia-bot

🤖 Telegram bot powered by Deep Learning. Automatically assesses the safety of audios and voice messages for people suffering from misophonia.

audio audio-classification deep-learning pytorch telegram telegram-bot telegram-bot-api torchaudio

Last synced: 25 Oct 2025

https://github.com/dharness/sqwak-app

Audio classifying service

audio-classification machine-learning

Last synced: 14 Jun 2025

https://github.com/georgiosioannoucoder/vera

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. 🔊

audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-emotion

Last synced: 09 Aug 2025

https://github.com/viig99/mixmatch-freesound

Multi label audio classification using mixmatch & a noisy loss

audio-classification freesound mix-match

Last synced: 09 Jun 2026

https://github.com/amirivojdan/chickensense

ChickenSense: A Low-Cost Deep Learning-Based Solution for Poultry Feed Consumption Monitoring Using Sound Technology

audio-classification broiler deep-learning poultry-farming precision-agriculture

Last synced: 21 May 2026

https://github.com/jesssullivan/merlinai-interpreters

Experiments, interpreter implementations, demos, data ingress tangents and lots of notes for birdsong identification machine learning

annotation-tool audio-classification birding macaulay-library

Last synced: 11 Apr 2025

https://github.com/capjamesg/awsnap.js

Navigate websites by clicking your fingers and saying the link you want to visit.

audio-classification speech-transcription tensorflow-js webaudio-api

Last synced: 18 Mar 2026

https://github.com/musevarg/ai-neural-network-classifying-guitar-distortions

Convolutional neural network to classify audio files (Python, Keras, Tensorflow) and its GUI (C#).

artificial-intelligence audio-classification c-sharp convolutional-neural-networks jupyter-notebooks keras-tensorflow python

Last synced: 09 Apr 2026

https://github.com/amgawishx/convmixer-for-audio-classification

Implementation of ConvMixer for bird audio classification instead of a conventional CNN in PyTorch.

audio-classification convmixer convolutional-neural-networks neural-network pytorch

Last synced: 16 Apr 2026

https://github.com/talkuhulk/music-genres-classification

Tensorflow implementation of music-genres-classification with InceptionResnetV2

audio-classification classification cnn-tensorflow genres-classification inception-resnet-v2 librosa python tensorflow

Last synced: 16 May 2026

https://github.com/georgiosioannoucoder/vera-deployed

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the deployed version of Vera. 🔊

audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-recognition

Last synced: 17 Apr 2026

https://github.com/rfcx/tfk-audio

Tools for TensorFlow/Keras audio recognition workflows

audio audio-analysis audio-classification audio-processing deep-learning keras tensorflow tensorflow2

Last synced: 20 Feb 2026

https://github.com/atharv-naik/whale-call-recognition-flask-app

Web app that allows to upload audio files to recognize it as Blue whale A-call or not

audio-classification audio-processing flask machine-learning

Last synced: 04 May 2026

https://github.com/maxidonkey/delphihuggingface

The Hugging Face API wrapper for Delphi leverages cutting-edge models to deliver powerful features, including object detection, music generation, text classification, sentiment analysis, image segmentation, speech-to-text transcription, and text generation.

api-wrapper audio-classification bert chatbot delphi gpt huggingface image-classification image-prompting music-generation object-detection text-classification

Last synced: 26 Jan 2026

https://github.com/datarohit/deep-audio-classification

This project aims to build a deep learning model for counting the number of Capuchinbird calls within a given audio clips. Here audio spectrograms and split-window approach is used to count the bird calls in the the audio.

audio-classification audio-processing deep-learning image-classification spectrogram tensorflow

Last synced: 19 May 2026

https://github.com/anyantudre/audio-transformers-hugging-face

Explore the application of transformers to audio data in this course. Learn to tackle tasks like speech recognition, audio classification, and text-to-speech generation using cutting-edge transformer models.

audio-classification audio-processing automatic huggingface speech-recognition speech-synthesis speech-to-text

Last synced: 24 Jun 2026

https://github.com/santiviquez/noisy-human-recognition

Recognized non-speech human sounds such as: clapping, footsteps, brushing teeth, drinking sipping, laughing, etc

audio-classification audio-recognition librosa pytorch

Last synced: 11 May 2026

https://github.com/hridxyz/music-genre-classification

A deep learning model to classify music audio into 10 genres using Convolutional Neural Networks (CNNs). Achieved over 97% training accuracy and 90% validation accuracy.

audio-classification cnn deep-learning keras machine-learning music-genre-classification tensorflow

Last synced: 21 Jan 2026

https://github.com/natgluons/chronosense

Personalized Sleep Optimizer App, a machine learning project that analyzes sleep audio using librosa, PyTorch, and scikit-learn to detect disturbances and optimize sleep quality through personalized recommendations.

audio-analysis audio-classification audio-processing chronobiology librosa sleep-analysis sleep-research sleep-tracker torchaudio

Last synced: 26 Jun 2025

https://github.com/chayuto/yamnet-cry-distill-int8

Python ML for training a custom on-device cry model (knowledge-distilled from YAMNet, INT8, deployed on ESP32-S3)

audio-classification audioset cry-detection embedded-ml esp32 esp32-s3 int8-quantization knowledge-distillation on-device-ml tensorflow tflite tinyml yamnet

Last synced: 04 Jun 2026

https://github.com/aldomann/speech-recognition

Speech recognition and audio classification with Keras

audio-classification keras r

Last synced: 29 Apr 2026

https://github.com/cyblx/cnn_urbansound8k

Full pipeline for urban sound classification using PyTorch and the UrbanSound8K dataset. Converts audio into MEL spectrograms, applies data augmentation, and trains a CNN to recognize sounds like horns, barks, and sirens.

audio-classification covolution-neural-network deep-learning pytorch spectrogram urbansound8k

Last synced: 30 Apr 2026

https://github.com/ladbaby/insrec

🎹 A Musical Instrument Recognition App Using Neural Networks.

audio-classification deep-learning time-series time-series-classification

Last synced: 02 May 2026

https://github.com/georgiosioannoucoder/vera-deployed-v2

Voice Emotion Recognition of Audio (VERA) is an open-source project created for the Data Science track for the program CUNY Tech Prep (CTP) in Cohort 8. This is the 2nd deployed version of VERA. 🔊

audio-classification classification cnn-model data-science emotion emotion-recognition librosa machine-learning speech-emotion-recognition voice-emotion

Last synced: 16 May 2026

https://github.com/shivendrra/ava

building AVA from ex-machina; a lightweight multi-modal system from scratch, just for learning & experimentation

audio-classification audio-engine audio-transformers large-language-models llm machine-learning swin-transformer transformer vision vision-engine vision-models vision-transformer

Last synced: 31 Mar 2025

https://github.com/daniel-furman/audio-classification-lesson

An audio deep learning mini-lesson from the 2021 DAT/Artathon fellowship.

ai audio-classification deep-learning sound-event-detection vision-transformer

Last synced: 26 Jul 2025

https://github.com/zulhaditya/musicari

The Music Recognition program uses the Librosa Library and the SHA256 Hashing Method

audio-classification audio-recognition machine-learning python

Last synced: 11 Jun 2025

https://github.com/andercruz/audio-classification-neural-networks-cnn

This project explores various approaches for audio classification using neural networks with TensorFlow and Keras. The notebook demonstrates the complete process from data loading and preprocessing to model building, training, evaluation, and inference.

audio-classification audio-processing cnn deep-learning environmental-sound-classification keras machine-learning neural-networks spectrogram-analysis speech-recognition tensorflow transfer-learning yamnet

Last synced: 29 Apr 2026

https://github.com/ml13571/audio-classifier

Classification model to detect water, alarm and other sounds, including training, inference and dataset

ai audio-classification audio-processing classification ml

Last synced: 12 Jun 2026

https://github.com/edward62740/lpnn

Always-on μW neural network for audio classification

aiot audio-classification edge-ai

Last synced: 24 Jun 2025

https://github.com/stephanielees/birdsoundclassification

Sound classification for classifying five birds

audio-classification dtw shapedtw

Last synced: 17 May 2026

https://github.com/tirovo/emotionai-voice

An AI-powered application for detecting human emotions

audio-classification cnn deep-learning emotion-detection pytorch

Last synced: 09 Sep 2025

https://github.com/usmana5809/quran-recitation-audio-classification

Quran Recitation Audio Classification project aims to classify different recitations of the Quran using machine learning techniques. It involves preprocessing audio data, extracting features, training models, and evaluating their performance

audio-classification classification-model islamic-studies librosa machine-learning python quran scikit-learn

Last synced: 20 Mar 2025

https://github.com/sadmansakib93/arctic-mammal-classification

Deep learning model for acoustic detection and classification of Bowhead whales. This repo contains code for the binary classification model, two classes are: Bowhead whales (BH), Other/background.

annotations audio-analysis audio-classification classification deep-learning deep-neural-networks machine-learning python spectrogram tensorflow

Last synced: 01 May 2026