{"id":15159324,"url":"https://github.com/sagartr/deep-audio-classifier-using-machine-learning","last_synced_at":"2026-01-21T16:33:38.449Z","repository":{"id":250537350,"uuid":"834758511","full_name":"SAGARTR/Deep-Audio-Classifier-using-Machine-Learning","owner":"SAGARTR","description":"Languages Used: Python                                  Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately  categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and  TensorFlow for model development and training, incorporating data augmentation techniques  to enhance performance","archived":false,"fork":false,"pushed_at":"2024-07-28T09:24:55.000Z","size":107,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-13T18:49:31.197Z","etag":null,"topics":["audio-processing","capuchin","librosa","python","tensorflow","tensorflow-models"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/SAGARTR.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-07-28T09:16:50.000Z","updated_at":"2024-07-28T09:29:53.000Z","dependencies_parsed_at":"2024-07-28T09:39:52.916Z","dependency_job_id":null,"html_url":"https://github.com/SAGARTR/Deep-Audio-Classifier-using-Machine-Learning","commit_stats":null,"previous_names":["sagartr/deep-audio-classifier-using-machine-learning"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAGARTR%2FDeep-Audio-Classifier-using-Machine-Learning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAGARTR%2FDeep-Audio-Classifier-using-Machine-Learning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAGARTR%2FDeep-Audio-Classifier-using-Machine-Learning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SAGARTR%2FDeep-Audio-Classifier-using-Machine-Learning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/SAGARTR","download_url":"https://codeload.github.com/SAGARTR/Deep-Audio-Classifier-using-Machine-Learning/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247687208,"owners_count":20979419,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["audio-processing","capuchin","librosa","python","tensorflow","tensorflow-models"],"created_at":"2024-09-26T21:04:39.304Z","updated_at":"2026-01-21T16:33:38.412Z","avatar_url":"https://github.com/SAGARTR.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Deep-Audio-Classifier-using-Machine-Learning\nLanguages Used: Python   \n\nDeveloped and implemented a deep audio classifier using CNNs and LSTMs to accurately  categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and  TensorFlow for model development and training, incorporating data augmentation techniques  to enhance performance.\n\n\nThe code need the data file to be downloded from kaggel : \nlink to the data : https://www.kaggle.com/datasets/kenjee/z-by-hp-unlocked-challenge-3-signal-processing\n\nKey Components:\na.\tAudio Preprocessing: Convert raw audio waveforms into spectrograms for processing by convolutional neural networks (CNNs).\n\n\nb.\tDeep Learning Model: Train a CNN or recurrent neural network (RNN) on preprocessed audio data to learn features and classify audio into categories like speech, music or environmental sounds.\nc.\tSliding Window Classification: Divide longer audio clips into shorter segments, apply the trained model to each segment, and aggregate the individual classifications to determine overall density of target audio events.\nd.\tModel Training and Optimization: Train the model on a diverse dataset using techniques like data augmentation and regularization to improve generalization.\n\nBy leveraging the power of deep learning, the deep audio classifier can significantly enhance the efficiency and accuracy of various audio processing tasks, making it a valuable tool in a wide range of industries.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsagartr%2Fdeep-audio-classifier-using-machine-learning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsagartr%2Fdeep-audio-classifier-using-machine-learning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsagartr%2Fdeep-audio-classifier-using-machine-learning/lists"}