{"id":16978172,"url":"https://github.com/adkcodexd/anime-audio-dataset-maker","last_synced_at":"2025-06-24T14:36:27.330Z","repository":{"id":213309496,"uuid":"731690041","full_name":"ADKcodeXD/Anime-Audio-Dataset-Maker","owner":"ADKcodeXD","description":"Anime audio speaker recognize and classify. A easy python script for vits data set make.","archived":false,"fork":false,"pushed_at":"2024-01-27T15:00:52.000Z","size":29941,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-01-26T16:44:16.686Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ADKcodeXD.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2023-12-14T16:40:14.000Z","updated_at":"2024-09-28T12:25:26.000Z","dependencies_parsed_at":"2024-01-27T16:34:38.985Z","dependency_job_id":null,"html_url":"https://github.com/ADKcodeXD/Anime-Audio-Dataset-Maker","commit_stats":null,"previous_names":["adkcodexd/autoslice","adkcodexd/anime-audio-dataset-maker"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ADKcodeXD%2FAnime-Audio-Dataset-Maker","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ADKcodeXD%2FAnime-Audio-Dataset-Maker/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ADKcodeXD%2FAnime-Audio-Dataset-Maker/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ADKcodeXD%2FAnime-Audio-Dataset-Maker/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ADKcodeXD","download_url":"https://codeload.github.com/ADKcodeXD/Anime-Audio-Dataset-Maker/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244875046,"owners_count":20524591,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-14T01:42:43.333Z","updated_at":"2025-03-21T22:15:21.018Z","avatar_url":"https://github.com/ADKcodeXD.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Anime Audio DataSet Maker \n\n- [English README](README_en.md)\n- [中文说明](README.md)\n\n## Introduce\n\n此项目旨在为番剧提供一个快速高效提取角色音频的解决方案。\n\nWEBUI下载链接：\n\u003ca href=\"https://github.com/ADKcodeXD/Anime-Audio-Dataset-Maker-WEBUI/releases\"\u003eAnime-Audio-Dataset-Maker-WEBUI Release\u003c/a\u003e\n\n## 安装\u0026使用\n- 第一个方法 整合包一键下载使用方法:\n链接: https://pan.baidu.com/s/1T9GbDo6enrV__G0j7pXbwQ?pwd=s556 提取码: s556\n下载后使用 整合包使用这个.bat 即可\n\n- 安装使用 首先先安装pytorch，\n这个需要根据系统的cuda版本来进行安装\n以我的Cuda11.8为例\n\n```sh\npip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118\n```\n\n在这里\u003ca href=\"https://pytorch.org/get-started/locally/\"\u003epytorch\u003c/a\u003e 选择你对应的版本并运行相对应的命令即可~\n\n- 然后安装该仓库需要的依赖\n```sh\npip3 install -r requirement.txt\n```\n\n- 下载webui\n\u003ca href=\"https://github.com/ADKcodeXD/Anime-Audio-Dataset-Maker-WEBUI/releases\"\u003eAnime-Audio-Dataset-Maker-WEBUI Release\u003c/a\u003e\n下载最新版本的webui解压至该项目根目录下\n\n- 运行launch.bat\n\n项目会运行在7896端口\n\n## How it work\n\n- 通过pyannote.audio对原音频进行说话人的识别和切割\n- 通过字幕时间线对原音频进行切割\n- 通过匹配检测最佳匹配的说话人\n- 分类到各个说话人的文件夹中\n\n## WebUI操作流程\n\n- 开始预处理音频\n![Alt text](tutorial/1.gif)\n\n...\n\n## Feature\n\n- Support automaticly split long audio by each speaker\n- Support sub upload and slice by sub timeline.\n- Support edit the sub text and export it by bert-vits config\n- Support split ever single audio (WebUI)\n- Support merge audio with interval (WebUI)\n- Support management folders or files (WebUI)\n- Support use Arrow key to handle data (WebUI)\n- Support batch rename (WebUI)\n- Support batch move or remove (WebUI)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fadkcodexd%2Fanime-audio-dataset-maker","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fadkcodexd%2Fanime-audio-dataset-maker","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fadkcodexd%2Fanime-audio-dataset-maker/lists"}