{"id":26664420,"url":"https://github.com/shahules786/mayavoz","last_synced_at":"2025-04-09T07:06:21.532Z","repository":{"id":61842512,"uuid":"526827145","full_name":"shahules786/mayavoz","owner":"shahules786","description":"Pytorch based speech enhancement toolkit.","archived":false,"fork":false,"pushed_at":"2024-02-29T13:32:25.000Z","size":1148,"stargazers_count":336,"open_issues_count":11,"forks_count":26,"subscribers_count":12,"default_branch":"main","last_synced_at":"2025-04-02T04:08:25.175Z","etag":null,"topics":["audio-enhancement","deep-learning","denoiser","pretrained-models","python","pytorch","speech-enhancement"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/shahules786.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-08-20T04:44:35.000Z","updated_at":"2025-03-03T10:10:29.000Z","dependencies_parsed_at":"2024-11-16T03:26:42.576Z","dependency_job_id":"64e6660f-2820-4be6-9a7c-328038fe7560","html_url":"https://github.com/shahules786/mayavoz","commit_stats":null,"previous_names":["shahules786/enhancer"],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shahules786%2Fmayavoz","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shahules786%2Fmayavoz/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shahules786%2Fmayavoz/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shahules786%2Fmayavoz/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/shahules786","download_url":"https://codeload.github.com/shahules786/mayavoz/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247994119,"owners_count":21030050,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["audio-enhancement","deep-learning","denoiser","pretrained-models","python","pytorch","speech-enhancement"],"created_at":"2025-03-25T16:28:15.987Z","updated_at":"2025-04-09T07:06:21.514Z","avatar_url":"https://github.com/shahules786.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://user-images.githubusercontent.com/25312635/195514652-e4526cd1-1177-48e9-a80d-c8bfdb95d35f.png\" /\u003e\n\u003c/p\u003e\n\n![GitHub Workflow Status](https://img.shields.io/github/actions/workflow/status/shahules786/mayavoz/ci.yaml?branch=main)\n![GitHub](https://img.shields.io/github/license/shahules786/enhancer)\n![GitHub issues](https://img.shields.io/github/issues/shahules786/enhancer?logo=GitHub)\n![GitHub Repo stars](https://img.shields.io/github/stars/shahules786/enhancer?style=social)\n\nmayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio practioners \u0026 researchers. It provides easy to use pretrained speech enhancement models and facilitates highly customisable model training.\n\n| **[Quick Start](#quick-start-fire)** | **[Installation](#installation)** | **[Tutorials](https://github.com/shahules786/enhancer/tree/main/notebooks)** | **[Available Recipes](#recipes)** | **[Demo](#demo)**\n## Key features :key:\n\n* Various pretrained models nicely integrated with [huggingface hub](https://huggingface.co/docs/hub/index) :hugs: that users can select and use without any hastle.\n* :package: Ability to train and validate your own custom speech enhancement models with just under 10 lines of code!\n* :magic_wand: A command line tool that facilitates training of highly customisable speech enhacement models from the terminal itself!\n* :zap: Supports multi-gpu training integrated with [Pytorch Lightning](https://pytorchlightning.ai/).\n* :shield: data augmentations integrated using [torch-augmentations](https://github.com/asteroid-team/torch-audiomentations)\n\n\n## Demo\n\nNoisy speech followed by enhanced version.\n\nhttps://user-images.githubusercontent.com/25312635/203756185-737557f4-6e21-4146-aa2c-95da69d0de4c.mp4\n\n\n\n## Quick Start :fire:\n``` python\nfrom mayavoz.models import Mayamodel\n\nmodel = Mayamodel.from_pretrained(\"shahules786/mayavoz-waveunet-valentini-28spk\")\nmodel.enhance(\"noisy_audio.wav\")\n```\n\n## Recipes\n\n| Model     | Dataset      | STOI    | PESQ  | URL                           |\n| :---:     |  :---:       | :---:   | :---: | :---:                         |\n| WaveUnet  | Valentini-28spk   | 0.836   | 2.78  |  shahules786/mayavoz-waveunet-valentini-28spk      |\n| Demucs    | Valentini-28spk   | 0.961   | 2.56  |  shahules786/mayavoz-demucs-valentini-28spk       |\n| DCCRN     | Valentini-28spk   | 0.724   | 2.55  |  shahules786/mayavoz-dccrn-valentini-28spk         |\n| Demucs     | MS-SNSD-20hrs  | 0.56 | 1.26  | shahules786/mayavoz-demucs-ms-snsd-20       |\n\nTest scores are based on respective test set associated with train dataset.\n\n**See [tutorials](/notebooks/) to train your custom model**\n\n## Installation\nOnly Python 3.8+ is officially supported (though it might work with Python 3.7)\n\n- With Pypi\n```\npip install mayavoz\n```\n\n- With conda\n\n```\nconda env create -f environment.yml\nconda activate mayavoz\n```\n\n- From source code\n```\ngit clone url\ncd mayavoz\npip install -e .\n```\n\n## Support\n\nFor commercial enquiries and scientific consulting, please [contact me](https://shahules786.github.io/).\n\n### Acknowledgements\nSincere gratitude to [AMPLYFI](https://amplyfi.com/) for supporting this project.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshahules786%2Fmayavoz","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fshahules786%2Fmayavoz","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshahules786%2Fmayavoz/lists"}