{"id":18651600,"url":"https://github.com/gsriram7/pos","last_synced_at":"2025-11-05T14:30:27.042Z","repository":{"id":88507671,"uuid":"124019516","full_name":"gsriram7/POS","owner":"gsriram7","description":"Part of speech tagger using HMM and Viterbi algorithm","archived":false,"fork":false,"pushed_at":"2018-03-06T04:40:11.000Z","size":4420,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-12-27T13:40:40.859Z","etag":null,"topics":["hmm","hmm-viterbi-algorithm","part-of-speech-tagger","viterbi-algorithm"],"latest_commit_sha":null,"homepage":"http://ron.artstein.org/csci544-2018/coding-1.html","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/gsriram7.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":"support_files/1.txt","governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-03-06T04:15:11.000Z","updated_at":"2019-10-01T05:25:29.000Z","dependencies_parsed_at":null,"dependency_job_id":"0eb580b0-a91a-4744-8c08-3abb1fc08daa","html_url":"https://github.com/gsriram7/POS","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gsriram7%2FPOS","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gsriram7%2FPOS/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gsriram7%2FPOS/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gsriram7%2FPOS/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/gsriram7","download_url":"https://codeload.github.com/gsriram7/POS/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239461313,"owners_count":19642550,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["hmm","hmm-viterbi-algorithm","part-of-speech-tagger","viterbi-algorithm"],"created_at":"2024-11-07T06:52:03.093Z","updated_at":"2025-11-05T14:30:26.989Z","avatar_url":"https://github.com/gsriram7.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Part of Speech Tagger\n\nThe tagger uses Hidden Markov Model to encode the a language corpus with words tagged with corresponding tags.\nUses Viterbi algorithm to decode and tag sentences from test data.\n\nThe encoder is generic and it works for ***ANY*** language.\n\nThe encoder models the [corpus](en_train_tagged.txt) and writes the probabilities into [hmmmodel.txt](hmmmodel.txt)\nThe decoder consumes the model and tags the [test data](en_dev_raw.txt) and writes the output into [hmmoutput.txt](hmmoutput.txt)\n\n## Accuracy for the model trained on given corpa\n\n * English  - 88.93%\n * Chinese  - 87.08%\n * Hindi    - 92.34%\n \n These accuracies are obtained using a single generic encoder for 3 different languages.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgsriram7%2Fpos","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgsriram7%2Fpos","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgsriram7%2Fpos/lists"}