{"id":19056836,"url":"https://github.com/andrianllmm/akl-dictionary-parser","last_synced_at":"2025-11-12T14:03:24.758Z","repository":{"id":252387316,"uuid":"840276562","full_name":"andrianllmm/akl-dictionary-parser","owner":"andrianllmm","description":"A Python script that parses an Aklanon dictionary and converts it into several useful formats.","archived":false,"fork":false,"pushed_at":"2024-08-09T10:50:55.000Z","size":17804,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-02T11:44:09.468Z","etag":null,"topics":["aklanon","dictionary","parser","text-mining"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/andrianllmm.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-09T10:49:14.000Z","updated_at":"2024-09-07T14:41:42.000Z","dependencies_parsed_at":"2024-08-09T13:10:40.189Z","dependency_job_id":null,"html_url":"https://github.com/andrianllmm/akl-dictionary-parser","commit_stats":null,"previous_names":["andrianllmm/akl-dictionary-parser"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrianllmm%2Fakl-dictionary-parser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrianllmm%2Fakl-dictionary-parser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrianllmm%2Fakl-dictionary-parser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrianllmm%2Fakl-dictionary-parser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/andrianllmm","download_url":"https://codeload.github.com/andrianllmm/akl-dictionary-parser/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240113070,"owners_count":19749732,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["aklanon","dictionary","parser","text-mining"],"created_at":"2024-11-08T23:52:03.923Z","updated_at":"2025-11-12T14:03:24.745Z","avatar_url":"https://github.com/andrianllmm.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Aklanon Dictionary Parser\n\n**A Python script that parses an Aklanon dictionary and converts it into several\nuseful formats**\n\n## About\n\nThis parser parses an [Aklanon](https://en.wikipedia.org/wiki/Aklanon_language)\ndictionary from the book\n[A Study of the Aklanon Dialect (Vol. 2)](https://files.eric.ed.gov/fulltext/ED145704.pdf)\nin Excel format and outputs it to [JSON format](output/akl_dictionary.json),\n[frequency list](output/akl_freqlist.csv), and\n[word list](output/akl_wordlist.txt). Since the book is in PDF format, the\ndictionary is manually encoded from the book to the Excel file. The encoded data\nis still incomplete as it is very time consuming to manually encode. Thus,\n[contribute](#contributing) if you can.\n\n## Output\n\n\u003e \u003cstrong style=\"font-size: large;\"\u003e4,471 words collected\u003c/strong\u003e \u003csmall\u003e(as of\n\u003e 08/09/2024)\u003c/small\u003e\n\n| Resource       | Format | Link                                                     |\n| -------------- | ------ | -------------------------------------------------------- |\n| Dictionary     | json   | [output/akl_dictionary.json](output/akl_dictionary.json) |\n| Frequency list | csv    | [output/akl_freqlist.csv](output/akl_freqlist.csv)       |\n| Word list      | txt    | [output/akl_wordlist.txt](output/akl_wordlist.txt)       |\n\n### JSON Dictionary\n\nThe JSON dictionary is structured as a list of words and its corresponding list\nof attributes. The attributes include part of speech, definition, etymology,\nclassifications, synonyms, antonyms, example sentences, inflections, and\nsources. The entries are sorted alphabetically.\n\n```json\n[\n  {\n    \"word\": \"The word itself\",\n    \"attributes\": [\n      {\n        \"pos\": \"Simplified arts of speech\",\n        \"definition\": \"The definition\",\n        \"origin\": \"The etymology\",\n        \"classification\": \"Any classification\",\n        \"similar\": [\"List of synonyms\"],\n        \"opposite\": [\"List of antonyms\"],\n        \"examples\": [\"List of example sentences that use the word\"],\n        \"inflections\": [\"List of inflected forms\"],\n        \"sources\": [\"List of sources\"]\n      }\n    ]\n  }\n]\n```\n\n### Frequency list\n\nThe frequency list is structured as a list of words and its corresponding\nfrequency value sorted from highest to lowest frequency value. Since there's no\navailable Aklanon frequency list yet, all frequency values are set to 1.\n\n```csv\na,1\nab-ab,1\naba,1\n```\n\n### Word list\n\nThe word list is simply the list of words sorted alphabetically.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandrianllmm%2Fakl-dictionary-parser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fandrianllmm%2Fakl-dictionary-parser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandrianllmm%2Fakl-dictionary-parser/lists"}