{"id":18946864,"url":"https://github.com/klaudiozdrava/digit-recognition","last_synced_at":"2025-08-30T20:11:09.395Z","repository":{"id":257481883,"uuid":"553279724","full_name":"klaudiozdrava/Digit-Recognition","owner":"klaudiozdrava","description":"A Python project that was developed as a university assignment for the subject of Signal Processing and Voice Recognition. The goal of this assignment was to make an ASR system that predict digits from a voice signal using Neural Network.","archived":false,"fork":false,"pushed_at":"2022-10-18T03:35:11.000Z","size":16,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-01T00:43:22.225Z","etag":null,"topics":["audio-processing","digits-recognition","neural-network","python","pytorch","spectrogram"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/klaudiozdrava.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-10-18T01:37:55.000Z","updated_at":"2022-10-18T03:33:54.000Z","dependencies_parsed_at":"2024-09-20T13:00:50.397Z","dependency_job_id":null,"html_url":"https://github.com/klaudiozdrava/Digit-Recognition","commit_stats":null,"previous_names":["klaudiozdrava/digit-recognition"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/klaudiozdrava%2FDigit-Recognition","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/klaudiozdrava%2FDigit-Recognition/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/klaudiozdrava%2FDigit-Recognition/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/klaudiozdrava%2FDigit-Recognition/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/klaudiozdrava","download_url":"https://codeload.github.com/klaudiozdrava/Digit-Recognition/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239946876,"owners_count":19723014,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["audio-processing","digits-recognition","neural-network","python","pytorch","spectrogram"],"created_at":"2024-11-08T13:08:13.365Z","updated_at":"2025-02-21T02:41:43.713Z","avatar_url":"https://github.com/klaudiozdrava.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Digit-Recognition\n\nA Python project that was developed as a university assignment for the subject of Signal Processing and Voice Recognition.\nThe goal of this assignment was to make an ASR system that predict digits from a voice signal using Neural Network.\nThe dataset that was used for the purpose of this assigment is AudioMNIST.\n\nThe steps of the algorithm are :\n1) We train a simple Feed Forward Neural Network model using only Mel Spectogram as features. \n2) Seperate foreground from background information using REPET algorithm.\n3) In the foreground signal,we extract digits information using sliding window technique.\n4) Finally we feed our model with these digits and make predictions.\n\nTo run this project :\n1) You should download the necessary libraries from requirement.txt and also the audio dataset.\n2) Run Dataset.py first and after run Network.py.\n3) Finally you should run the prediction.py and insert the file path when prompted.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fklaudiozdrava%2Fdigit-recognition","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fklaudiozdrava%2Fdigit-recognition","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fklaudiozdrava%2Fdigit-recognition/lists"}