{"id":13396958,"url":"https://github.com/tyiannak/pyAudioAnalysis","last_synced_at":"2025-03-13T23:31:58.466Z","repository":{"id":20119419,"uuid":"23389271","full_name":"tyiannak/pyAudioAnalysis","owner":"tyiannak","description":"Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications","archived":false,"fork":false,"pushed_at":"2024-03-31T17:27:35.000Z","size":171627,"stargazers_count":5999,"open_issues_count":203,"forks_count":1212,"subscribers_count":208,"default_branch":"master","last_synced_at":"2025-03-11T20:11:19.268Z","etag":null,"topics":["audio","audio-analysis-tasks","audio-data","machine-learning","pyaudioanalysis","python","signal-processing"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/tyiannak.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"tyiannak"}},"created_at":"2014-08-27T12:43:13.000Z","updated_at":"2025-03-10T08:40:50.000Z","dependencies_parsed_at":"2022-07-07T18:22:28.801Z","dependency_job_id":"a0b53b5e-3550-4fe1-96b0-e3b416c7eab4","html_url":"https://github.com/tyiannak/pyAudioAnalysis","commit_stats":{"total_commits":723,"total_committers":28,"mean_commits":"25.821428571428573","dds":"0.11065006915629327","last_synced_commit":"ae00ebd446d497dbbb654c40a71810e7fdf2cc1b"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tyiannak%2FpyAudioAnalysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tyiannak%2FpyAudioAnalysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tyiannak%2FpyAudioAnalysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/tyiannak%2FpyAudioAnalysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/tyiannak","download_url":"https://codeload.github.com/tyiannak/pyAudioAnalysis/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243500226,"owners_count":20300760,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["audio","audio-analysis-tasks","audio-data","machine-learning","pyaudioanalysis","python","signal-processing"],"created_at":"2024-07-30T18:01:08.420Z","updated_at":"2025-03-13T23:31:58.457Z","avatar_url":"https://github.com/tyiannak.png","language":"Python","readme":"\n# \u003cimg src=\"icon.png\" align=\"left\" height=\"130\"/\u003e A Python library for audio feature extraction, classification, segmentation and applications\n\n*This is general info. Click [here](https://github.com/tyiannak/pyAudioAnalysis/wiki) for the complete wiki and [here](https://hackernoon.com/audio-handling-basics-how-to-process-audio-files-using-python-cli-jo283u3y) for a more generic intro to audio data handling*\n\n## News\n * [2022-01-01] If you are not interested in training audio models from your own data, you can check the [Deep Audio API](https://labs-repos.iit.demokritos.gr/MagCIL/deep_audio_api.html), were you can directly send audio data and receive predictions with regards to the respective audio content (speech vs silence, musical genre, speaker gender, etc). \n * [2021-08-06] [deep-audio-features](https://github.com/tyiannak/deep_audio_features) deep audio classification and feature extraction using CNNs and Pytorch \n * Check out [paura](https://github.com/tyiannak/paura) a Python script for realtime recording and analysis of audio data\n\n## General\npyAudioAnalysis is a Python library covering a wide range of audio analysis tasks. Through pyAudioAnalysis you can:\n * Extract audio *features* and representations (e.g. mfccs, spectrogram, chromagram)\n * *Train*, parameter tune and *evaluate* classifiers of audio segments\n * *Classify* unknown sounds\n * *Detect* audio events and exclude silence periods from long recordings\n * Perform *supervised segmentation* (joint segmentation - classification)\n * Perform *unsupervised segmentation* (e.g. speaker diarization) and extract audio *thumbnails*\n * Train and use *audio regression* models (example application: emotion recognition)\n * Apply dimensionality reduction to *visualize* audio data and content similarities\n\n## Installation\n * Clone the source of this library: `git clone https://github.com/tyiannak/pyAudioAnalysis.git`\n * Install dependencies: `pip install -r ./requirements.txt `\n * Install using pip: `pip install -e .`\n\n## An audio classification example\n\u003e More examples and detailed tutorials can be found [at the wiki](https://github.com/tyiannak/pyAudioAnalysis/wiki)\n\npyAudioAnalysis provides easy-to-call wrappers to execute audio analysis tasks. Eg, this code first trains an audio segment classifier, given a set of WAV files stored in folders (each folder representing a different class) and then the trained classifier is used to classify an unknown audio WAV file\n\n```python\nfrom pyAudioAnalysis import audioTrainTest as aT\naT.extract_features_and_train([\"classifierData/music\",\"classifierData/speech\"], 1.0, 1.0, aT.shortTermWindow, aT.shortTermStep, \"svm\", \"svmSMtemp\", False)\naT.file_classification(\"data/doremi.wav\", \"svmSMtemp\",\"svm\")\n```\n\n\u003eResult:\n(0.0, array([ 0.90156761,  0.09843239]), ['music', 'speech'])\n\nIn addition, command-line support is provided for all functionalities. E.g. the following command extracts the spectrogram of an audio signal stored in a WAV file: `python audioAnalysis.py fileSpectrogram -i data/doremi.wav`\n\n## Further reading\n\nApart from this README file, to bettern understand how to use this library one should read the following:\n  * [Audio Handling Basics: Process Audio Files In Command-Line or Python](https://hackernoon.com/audio-handling-basics-how-to-process-audio-files-using-python-cli-jo283u3y), if you want to learn how to handle audio files from command line, and some basic programming on audio signal processing. Start with that if you don't know anything about audio. \n  * [Intro to Audio Analysis: Recognizing Sounds Using Machine Learning](https://hackernoon.com/intro-to-audio-analysis-recognizing-sounds-using-machine-learning-qy2r3ufl) This goes a bit deeper than the previous article, by providing a complete intro to theory and practice of audio feature extraction, classification and segmentation (includes many Python examples).\n * [The library's wiki](https://github.com/tyiannak/pyAudioAnalysis/wiki)\n * [How to Use Machine Learning to Color Your Lighting Based on Music Mood](https://hackernoon.com/how-to-use-machine-learning-to-color-your-lighting-based-on-music-mood-bi163u8l). An interesting use-case of using this lib to train a real-time music mood estimator.\n  * A more general and theoretic description of the adopted methods (along with several experiments on particular use-cases) is presented [in this publication](http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0144610). *Please use the following citation when citing pyAudioAnalysis in your research work*:\n```python\n@article{giannakopoulos2015pyaudioanalysis,\n  title={pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis},\n  author={Giannakopoulos, Theodoros},\n  journal={PloS one},\n  volume={10},\n  number={12},\n  year={2015},\n  publisher={Public Library of Science}\n}\n```\n\nFor Matlab-related audio analysis material check  [this book](http://www.amazon.com/Introduction-Audio-Analysis-MATLAB%C2%AE-Approach/dp/0080993885).\n\n## Author\n\u003cimg src=\"https://tyiannak.github.io/files/3.JPG\" align=\"left\" height=\"100\"/\u003e\n\n[Theodoros Giannakopoulos](https://tyiannak.github.io),\nPrincipal Researcher of Multimodal Machine Learning at the [Multimedia Analysis Group of the Computational Intelligence Lab (MagCIL)](https://labs-repos.iit.demokritos.gr/MagCIL/index.html) of the Institute of Informatics and Telecommunications, of the National Center for Scientific Research \"Demokritos\"\n\n","funding_links":["https://github.com/sponsors/tyiannak"],"categories":["Audio","How-To Analyze Audio","Software","Python","资源列表","Tools","音频处理","Natural Language Processing","音频","HarmonyOS","语音识别与合成_其他","Audio [🔝](#readme)","python","Audio ##","📚 فهرست","Audio Related Packages","Awesome Python","Music","Analysis \u0026 Feature Extraction"],"sub_categories":["Framework","音频","Coming soon...","Speech \u0026 Audio","Windows Manager","网络服务_其他","SSL+","کار با فایل های صوتی","Audio","Drone Frames"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftyiannak%2FpyAudioAnalysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftyiannak%2FpyAudioAnalysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftyiannak%2FpyAudioAnalysis/lists"}