{"id":28547109,"url":"https://github.com/skywind3000/lemma.en","last_synced_at":"2025-07-23T06:03:27.051Z","repository":{"id":82864411,"uuid":"86447073","full_name":"skywind3000/lemma.en","owner":"skywind3000","description":"English Lemma Database - Compiled by Referencing British National Corpus","archived":false,"fork":false,"pushed_at":"2024-09-23T08:53:41.000Z","size":789,"stargazers_count":31,"open_issues_count":2,"forks_count":4,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-07-07T07:43:04.933Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/skywind3000.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2017-03-28T10:30:43.000Z","updated_at":"2025-05-27T07:42:05.000Z","dependencies_parsed_at":"2025-07-07T07:36:53.587Z","dependency_job_id":"324af7a7-cf5b-4ab3-b427-855cba011281","html_url":"https://github.com/skywind3000/lemma.en","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/skywind3000/lemma.en","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skywind3000%2Flemma.en","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skywind3000%2Flemma.en/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skywind3000%2Flemma.en/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skywind3000%2Flemma.en/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/skywind3000","download_url":"https://codeload.github.com/skywind3000/lemma.en/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/skywind3000%2Flemma.en/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":266626115,"owners_count":23958344,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-07-23T02:00:09.312Z","response_time":66,"last_error":null,"robots_txt_status":null,"robots_txt_updated_at":null,"robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-06-10T00:09:20.664Z","updated_at":"2025-07-23T06:03:27.021Z","avatar_url":"https://github.com/skywind3000.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"## Preface\n\nEnglish Lemma Database - Compiled by Referencing British National Corpus\n\nCompiled by Lin Wei (https://github.com/skywind3000), Mar 28, 2017 by referencing the 100M+ words in the British Nation Corpus (BNC), NodeBox Linguistics and Yasumasa Someya's lemma list.\n\nThis lemma list is provided \"as is\" and is free to use for any research and/or educational purposes. The list currently contains 186,523 words (tokens) in 84,487 lemma groups. \n\n\n## Data Format\n\nDefinition\n\n```text\nword/bnc-frequence -\u003e form1 (, form2 (, form3...))\n```\n\nData Sample:\n```text\nbe/4109826 -\u003e is,was,are,were,'s,been,being,'re,'m,am,m\nhave/1315648 -\u003e had,has,'ve,having,'s,'d,d,ve\nit/1213224 -\u003e its,they\nhe/1196022 -\u003e his,him,they\ni/1133697 -\u003e my,me,we,is\nthey/841960 -\u003e their,them,'em\nyou/804279 -\u003e your,ya,ye\nnot/767330 -\u003e n't\nshe/653505 -\u003e her\ndo/535646 -\u003e did,does,done,doing,du,d'\nwe/503360 -\u003e our,us\nwill/334612 -\u003e 'll,wo,ll\nsay/317317 -\u003e said,says,saying\nwould/278414 -\u003e 'd\ncan/263138 -\u003e ca,cans,can,could\ngo/227247 -\u003e going,went,gone,goes,goin'\nget/212569 -\u003e got,getting,gets,gotten\nmake/209818 -\u003e made,making,makes\nup/206976 -\u003e ups,upping,upped\nsee/184969 -\u003e seen,saw,seeing,sees\nother/181277 -\u003e others\ntime/181080 -\u003e times,timed,timing\nknow/177717 -\u003e knew,known,knows,knowing\ntake/172773 -\u003e took,taken,taking,takes\nyear/161649 -\u003e years\n```\n\n\n## About\n\nIf you have any questions or comments about this lemma list, feel free to contact me (skywind3000@163.com), at any time...\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fskywind3000%2Flemma.en","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fskywind3000%2Flemma.en","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fskywind3000%2Flemma.en/lists"}