{"id":15895662,"url":"https://github.com/jrmeyer/interspeech-2018","last_synced_at":"2026-01-17T22:51:58.698Z","repository":{"id":89745047,"uuid":"123611589","full_name":"JRMeyer/interspeech-2018","owner":"JRMeyer","description":"I submitted this paper to Interspeech 2018. The paper was not accepted. The reviewer comments are included in the repo.","archived":false,"fork":false,"pushed_at":"2018-06-22T15:15:28.000Z","size":3611,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-02-08T09:11:13.765Z","etag":null,"topics":["interspeech2018","kaldi","multi-task-learning","rejection"],"latest_commit_sha":null,"homepage":"","language":"TeX","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/JRMeyer.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-03-02T17:49:41.000Z","updated_at":"2019-04-21T13:42:14.000Z","dependencies_parsed_at":null,"dependency_job_id":"c8d650f4-06c4-43bb-a827-11baa51a28de","html_url":"https://github.com/JRMeyer/interspeech-2018","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JRMeyer%2Finterspeech-2018","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JRMeyer%2Finterspeech-2018/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JRMeyer%2Finterspeech-2018/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/JRMeyer%2Finterspeech-2018/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/JRMeyer","download_url":"https://codeload.github.com/JRMeyer/interspeech-2018/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246867055,"owners_count":20846671,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["interspeech2018","kaldi","multi-task-learning","rejection"],"created_at":"2024-10-06T09:01:59.496Z","updated_at":"2026-01-17T22:51:58.687Z","avatar_url":"https://github.com/JRMeyer.png","language":"TeX","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Multilingual Multi-Task Learning for Low-Resource Languages\n\n## Abstract\n\n  The following study investigates low-resource multilingual acoustic model training with Multi-Task Learning (MTL) for Automatic Speech Recognition. The main question of this research is: *What is the best way to represent a source language with MTL to improve performance on the target language?* The two parameters of interest are (1) the level of detail at which the source language is modeled, and (2) the relative weighting of source vs. target languages during backprop.\n\nResults show that when the source task is weighted \\textit{higher} than the target task, a *more* detailed task representation (ie. the triphone) leads to better performance on the target language. On the other hand, when the source task is weighted *lower*, then a *less* detailed level of source task representation (ie. the monophone) is better for performance in the target language. Given all levels of detail in the source task, a 1-to-1 weighting ratio of source-to-target leads to best results on average.\n\nThis study uses Kyrgyz (audiobook recordings) as a target language and English (LibriSpeech subset) as a source language.\n\n## Reviewer Comments\n\nYou can find the Interspeech Committee's comments in the `REVIEWER_COMMENTS.txt` file.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjrmeyer%2Finterspeech-2018","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjrmeyer%2Finterspeech-2018","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjrmeyer%2Finterspeech-2018/lists"}