{"id":13827278,"url":"https://github.com/SuperKogito/pydiogment","last_synced_at":"2025-07-09T03:31:57.742Z","repository":{"id":40969654,"uuid":"226846055","full_name":"SuperKogito/pydiogment","owner":"SuperKogito","description":":mega: Python library for audio augmentation","archived":false,"fork":false,"pushed_at":"2023-07-06T21:48:22.000Z","size":92989,"stargazers_count":84,"open_issues_count":13,"forks_count":16,"subscribers_count":8,"default_branch":"master","last_synced_at":"2025-07-06T11:48:02.886Z","etag":null,"topics":["audio","audio-effects","audio-processing","augmentation","deep-learning","machine-learning","python","sound","sound-processing"],"latest_commit_sha":null,"homepage":"https://superkogito.github.io/pydiogment/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/SuperKogito.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2019-12-09T10:34:32.000Z","updated_at":"2025-04-23T11:20:49.000Z","dependencies_parsed_at":"2024-01-08T12:17:49.899Z","dependency_job_id":null,"html_url":"https://github.com/SuperKogito/pydiogment","commit_stats":{"total_commits":123,"total_committers":5,"mean_commits":24.6,"dds":0.3414634146341463,"last_synced_commit":"074543dc9483b450653f8a00c8279bf1eb873199"},"previous_names":[],"tags_count":4,"template":false,"template_full_name":null,"purl":"pkg:github/SuperKogito/pydiogment","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SuperKogito%2Fpydiogment","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SuperKogito%2Fpydiogment/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SuperKogito%2Fpydiogment/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SuperKogito%2Fpydiogment/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/SuperKogito","download_url":"https://codeload.github.com/SuperKogito/pydiogment/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SuperKogito%2Fpydiogment/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263980063,"owners_count":23538917,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["audio","audio-effects","audio-processing","augmentation","deep-learning","machine-learning","python","sound","sound-processing"],"created_at":"2024-08-04T09:01:53.226Z","updated_at":"2025-07-09T03:31:57.724Z","avatar_url":"https://github.com/SuperKogito.png","language":"Python","funding_links":[],"categories":["Python","Audio Related Packages"],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n\u003cimg src=\"https://github.com/SuperKogito/pydiogment/blob/master/docs/icon.png?raw=true\"\u003e\n\u003c/p\u003e\n\n:bell:\tPydiogment\n==========\n\n[![Build Status](https://travis-ci.org/SuperKogito/pydiogment.svg?branch=master)](https://travis-ci.org/SuperKogito/pydiogment) [![Build status](https://ci.appveyor.com/api/projects/status/bnxaa6dw82cyhl5h?svg=true)](https://ci.appveyor.com/project/SuperKogito/pydiogment) [![Documentation Status](https://readthedocs.org/projects/pydiogment/badge/?version=latest)](https://pydiogment.readthedocs.io/en/latest/?badge=latest) [![License](https://img.shields.io/badge/license-BSD%203--Clause%20License%20(Revised)%20-blue)](https://github.com/SuperKogito/pydiogment/blob/master/LICENSE) [![Python](https://img.shields.io/badge/python-3.5%20%7C%203.6%20%7C%203.7-blue)](https://www.python.org/doc/versions/) [![Coverage Status](https://codecov.io/gh/SuperKogito/pydiogment/graph/badge.svg)](https://codecov.io/gh/SuperKogito/pydiogment) [![Coverage Status](https://coveralls.io/repos/github/SuperKogito/pydiogment/badge.svg?branch=master)](https://coveralls.io/github/SuperKogito/pydiogment?branch=master) [![CodeFactor](https://www.codefactor.io/repository/github/superkogito/pydiogment/badge/master)](https://www.codefactor.io/repository/github/superkogito/pydiogment/overview/master)\n\n**Pydiogment** aims to simplify audio augmentation. It generates multiple audio files based on a starting mono audio file. The library can generates files with higher speed, slower, and different tones etc.\n\n:inbox_tray: Installation\n============\n\nDependencies\n------------\n\n**Pydiogment** requires:\n\n-\t[Python](https://www.python.org/download/releases/3.0/) (\u003e= 3.5)  \n-\t[NumPy](https://numpy.org/) (\u003e= 1.17.2)\n  \n-\t[SciPy](https://www.scipy.org/)  (\u003e= 1.3.1)\n\n- [FFmpeg](https://www.ffmpeg.org/)\n\n### On Linux\nOn Linux you can use the following commands to get the libraries:\n- Numpy: `pip install numpy`\n- Scipy:  `pip install scipy`\n- FFmpeg: `sudo apt install ffmpeg`\n\n### On Windows\nOn Windows you can use the following installation binaries:\n- Numpy: https://www.lfd.uci.edu/~gohlke/pythonlibs/#numpy or if you have Python already installed you can use install it using `pip3 install numpy`\n- Scipy: https://www.lfd.uci.edu/~gohlke/pythonlibs/#scipy\n- FFmpeg: https://ffmpeg.org/download.html#build-windows\n\n### On MacOS\nOn MacOs, use homebrew to install the packages:\n- Numpy:  `brew install numpy --with-python3`\n- Scipy:  You need to first install a compilation tool like Gfortran using homebrew `brew install gfortran` when it's done, install Scipy `pip install scipy`\nfor more information and guidelines you can check this link: https://github.com/scipy/scipy/blob/master/INSTALL.rst.txt#mac-os-x\n- FFmpeg: `brew install ffmpeg`\n\n\nInstallation\n-------------\nIf you already have a working installation of [NumPy](https://numpy.org/) and [SciPy](https://www.scipy.org/) , you can simply install **Pydiogment** using pip:\n\n```\npip install pydiogment\n```\nTo update an existing version of  **Pydiogment**, use:\n```\npip install -U pydiogment\n```\n\n:bulb:  How to use\n==========\n\n- ## Amplitude related augmentation\n  - ### Apply a fade in and fade out effect\n    ```python3\n    from pydiogment.auga import fade_in_and_out\n\n    test_file = \"path/test.wav\"\n    fade_in_and_out(test_file)\n    ```\n\n  - ### Apply gain to file\n    ```python3\n    from pydiogment.auga import apply_gain\n\n    test_file = \"path/test.wav\"\n    apply_gain(test_file, -100)\n    apply_gain(test_file, -50)\n    ```\n\n  - ### Add Random Gaussian Noise based on SNR to file\n    ```python3\n    from pydiogment.auga import add_noise\n\n    test_file = \"path/test.wav\"\n    add_noise(test_file, 10)\n    ```\n\n\n- ## Frequency related augmentation\n  - ### Change file tone\n    ```python3\n    from pydiogment.augf import change_tone\n\n    test_file = \"path/test.wav\"\n    change_tone(test_file, 0.9)\n    change_tone(test_file, 1.1)\n    ```\n\n- ## Time related augmentation    \n  - #### Slow-down/ speed-up file\n    ```python3\n    from pydiogment.augt import slowdown, speed\n\n    test_file = \"path/test.wav\"\n    slowdown(test_file, 0.8)\n    speed(test_file, 1.2)\n    ```\n\n  - ### Apply random cropping to the file\n    ```python3\n    from pydiogment.augt import random_cropping\n\n    test_file = \"path/test.wav\"\n    random_cropping(test_file, 1)\n    ```\n\n  - ### Change shift data on the time axis in a certain direction\n    ```python3\n    from pydiogment.augt import shift_time\n\n    test_file = \"path/test.wav\"\n    shift_time(test_file, 1, \"right\")\n    shift_time(test_file, 1, \"left\")\n    ```\n- ## Audio files format \nThis library currently supports mono WAV files only.\n\n:bookmark_tabs:  Documentation  \n==============\nA thorough documentation of the library is available under [pydiogment.readthedocs.io](https://pydiogment.readthedocs.io/en/latest/index.html).\n\n:construction_worker:\t Contributing and bugs report      \n============\n\nContributions are welcome and encouraged. To learn more about how to contribute to **Pydiogment** please refer to the [Contributing guidelines](https://github.com/SuperKogito/pydiogment/blob/master/CONTRIBUTING.md)\n\nTo report bugs, request a feature or just ask for help you can refer to the [issues](https://github.com/SuperKogito/pydiogment/issuesif) section.\nBefore reporting a bug please make sure it is not addressed by an older issue and make sure to add your operating system type, its version number and the versions of the dependencies used.\n\n:tada:\tAcknowledgment and credits     \n============================\n- The test file used in the pytests is [OSR_us_000_0060_8k.wav](https://www.voiptroubleshooter.com/open_speech/american/OSR_us_000_0060_8k.wav) from the [Open Speech Repository](https://www.voiptroubleshooter.com/open_speech/american.html).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FSuperKogito%2Fpydiogment","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FSuperKogito%2Fpydiogment","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FSuperKogito%2Fpydiogment/lists"}