{"id":18004711,"url":"https://github.com/oscarknagg/voicemap","last_synced_at":"2025-05-02T22:31:07.926Z","repository":{"id":41081501,"uuid":"147032769","full_name":"oscarknagg/voicemap","owner":"oscarknagg","description":"Identifying people from small audio fragments","archived":false,"fork":false,"pushed_at":"2020-04-10T08:03:57.000Z","size":3337,"stargazers_count":170,"open_issues_count":9,"forks_count":73,"subscribers_count":6,"default_branch":"master","last_synced_at":"2025-04-07T07:42:52.145Z","etag":null,"topics":["convolutional-neural-networks","machine-learning","speaker-identification","speaker-recognition"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/oscarknagg.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-09-01T21:04:00.000Z","updated_at":"2025-01-15T20:47:02.000Z","dependencies_parsed_at":"2022-08-10T01:42:42.045Z","dependency_job_id":null,"html_url":"https://github.com/oscarknagg/voicemap","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oscarknagg%2Fvoicemap","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oscarknagg%2Fvoicemap/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oscarknagg%2Fvoicemap/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oscarknagg%2Fvoicemap/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/oscarknagg","download_url":"https://codeload.github.com/oscarknagg/voicemap/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252116140,"owners_count":21697318,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["convolutional-neural-networks","machine-learning","speaker-identification","speaker-recognition"],"created_at":"2024-10-30T00:15:33.011Z","updated_at":"2025-05-02T22:31:05.483Z","avatar_url":"https://github.com/oscarknagg.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# voicemap\n\nThis repository contains code to build deep learning models to identify\ndifferent speakers based on audio samples containg their voice.\n\nThe eventual aim is for this repository to become a pip-installable\npython package for quickly and easily performing speaker identification\nrelated tasks.\n\n**This tensorflow/Keras/python2.7 branch is discontinued. Work is\ncontinuing on the pytorch-python-3.6 branch which will become the\nmaster branch.**\n\n## Instructions\n#### Requirements\nMake a new virtualenv and install requirements from `requirements.txt`\nwith the following command.\n```\npip install -r requirements.txt\n```\nThis project was written in Python 2.7.12 so I cannot guarantee it works\non any other version.\n\n#### Data\nGet training data here: http://www.openslr.org/12\n- train-clean-100.tar.gz\n- train-clean-360.tar.gz\n- dev-clean.tar.gz\n\nPlace the unzipped training data into the `data/` folder so the file\nstructure is as follows:\n```\ndata/\n    LibriSpeech/\n        dev-clean/\n        train-clean-100/\n        train-clean-360/\n        SPEAKERS.TXT\n```\n\nPlease use the `SPEAKERS.TXT` supplied in the repo as I've made a few\ncorrections to the one found at openslr.org.\n\n#### Run tests\n\nThis requires the LibriSpeech data.\n```\npython -m unittest tests.tests\n```\n\n## Contents\n### voicemap\nThis package contains re-usable code for defining network architectures,\ninteracting with datasets and many utility functions.\n\n### experiments\nThis package contains experiments in the form of python scripts.\n\n### notebooks\nThis folder contains Jupyter notebooks used for interactive\nvisualisation and analysis.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foscarknagg%2Fvoicemap","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Foscarknagg%2Fvoicemap","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foscarknagg%2Fvoicemap/lists"}