{"id":13416266,"url":"https://github.com/google/corpuscrawler","last_synced_at":"2025-03-14T23:31:32.974Z","repository":{"id":26419424,"uuid":"102909145","full_name":"google/corpuscrawler","owner":"google","description":"Crawler for linguistic corpora","archived":false,"fork":false,"pushed_at":"2023-12-05T23:11:10.000Z","size":499,"stargazers_count":204,"open_issues_count":17,"forks_count":55,"subscribers_count":20,"default_branch":"master","last_synced_at":"2025-03-11T14:47:43.873Z","etag":null,"topics":["corpus-builder","corpus-linguistics","crawling","linguistics","minority-language"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/google.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-09-08T22:21:03.000Z","updated_at":"2025-03-09T11:45:11.000Z","dependencies_parsed_at":"2022-07-27T08:18:41.125Z","dependency_job_id":null,"html_url":"https://github.com/google/corpuscrawler","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/google%2Fcorpuscrawler","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/google%2Fcorpuscrawler/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/google%2Fcorpuscrawler/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/google%2Fcorpuscrawler/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/google","download_url":"https://codeload.github.com/google/corpuscrawler/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243663512,"owners_count":20327300,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["corpus-builder","corpus-linguistics","crawling","linguistics","minority-language"],"created_at":"2024-07-30T21:00:56.189Z","updated_at":"2025-03-14T23:31:28.459Z","avatar_url":"https://github.com/google.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# Corpus Crawler\n\n_Corpus Crawler_ is a tool for\n[Corpus Linguistics](https://en.wikipedia.org/wiki/Corpus_linguistics).\n\nModern linguistic research works on language corpora, which are large samples of\n“real world” text.  This crawler helps to build such corpora: it follows links\nto publicly accessible web pages known to be written in a certain language; it\nremoves boilerplate and HTML markup; finally, it writes its output into\nplaintext files.  The crawler implements the\n[Robots Exclusion Standard](https://en.wikipedia.org/wiki/Robots_exclusion_standard),\nand it is intentionally slow so it does not cause much load on the crawled\nweb sites.\n\nThis is not an official Google product.  But if you’re a linguistic researcher,\nor if you’re writing a spell checker (or similar language-processing software)\nfor an “exotic” language, you might find _Corpus Crawler_ useful.\n\nTo build corpora for not-yet-supported languages, please read the\n[contribution guidelines](./CONTRIBUTING.md) and send us\n[GitHub pull requests](https://help.github.com/categories/collaborating-with-issues-and-pull-requests/).\n\nThe crawled corpora have been used to compute word frequencies in\nUnicode’s [Unilex project](https://github.com/unicode-org/unilex).\n\n\n## Supported Languages\n\n| IETF BCP47 Code     | Language                     |  Tokens¹                                                                            |\n| :------------------ | :--------------------------- | ----------------------------------------------------------------------------------: |\n| `aai`               | Arifama-Miniafia             |    181K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aai.txt)               |\n| `aak`               | Ankave                       |    194K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aak.txt)               |\n| `aau`               | Abau                         |    313K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aau.txt)               |\n| `aaz`               | Amarasi                      |    308K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aaz.txt)               |\n| `abt`               | Ambulas                      |    297K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/abt.txt)               |\n| `aby`               | Aneme Wake                   |    233K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aby.txt)               |\n| `acd`               | Gikyode                      |    323K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/acd.txt)               |\n| `ace`               | Aceh/Acehnese                |    817K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ace.txt)               |\n| `acf`               | Saint Lucian Creole French   |    236K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/acf.txt)               |\n| `ach`               | Acoli                        |    178K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ach.txt)               |\n| `acn`               | Achang                       |    232K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/acn.txt)               |\n| `acr`               | Achi                         |    239K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/acr.txt)               |\n| `acu`               | Achuar-Shiwiar               |    174K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/acu.txt)               |\n| `ade`               | Adele                        |    267K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ade.txt)               |\n| `adh`               | Adhola                       |    166K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/adh.txt)               |\n| `adj`               | Adioukrou                    |    233K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/adj.txt)               |\n| `ae`                | Avestan                      |    129K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ae.txt)                |\n| `ae-Latn`           | Avestan (Latin)              |    141K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ae-Latn.txt)           |\n| `aey`               | Amele                        |    218K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aey.txt)               |\n| `agd`               | Agarabi                      |    256K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/agd.txt)               |\n| `agg`               | Angor                        |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/agg.txt)               |\n| `agm`               | Angaataha                    |    238K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/agm.txt)               |\n| `agn`               | Agutaynen                    |    234K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/agn.txt)               |\n| `agr`               | Aguaruna                     |    149K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/agr.txt)               |\n| `ahk`               | Akha                         |    367K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ahk.txt)               |\n| `aia`               | Arosi                        |    223K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aia.txt)               |\n| `akb`               | Batak Angkola                |    220K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/akb.txt)               |\n| `ake`               | Akawaio                      |    190K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ake.txt)               |\n| `akh`               | Akha                         |    408K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/akh.txt)               |\n| `akp`               | Siwu                         |    191K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/akp.txt)               |\n| `alj`               | Alangan                      |    185K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/alj.txt)               |\n| `alp`               | Alune                        |    225K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/alp.txt)               |\n| `alt`               | Southern Altai               |    121K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/alt.txt)               |\n| `alz`               | Alur                         |    160K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/alz.txt)               |\n| `am`                | Amharic                      |  2,170K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/am.txt)                |\n| `ame`               | Yanesha'                     |    221K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ame.txt)               |\n| `amf`               | Hamer-Banna                  |    152K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/amf.txt)               |\n| `amk`               | Ambai                        |    229K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/amk.txt)               |\n| `amm`               | Ama (Papua New Guinea)       |    246K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/amm.txt)               |\n| `amn`               | Amanab                       |    207K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/amn.txt)               |\n| `amp`               | Alamblak                     |    241K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/amp.txt)               |\n| `amr`               | Amarakaeri                   |    151K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/amr.txt)               |\n| `amu`               | Guerrero Amuzgo              |    202K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/amu.txt)               |\n| `ann`               | Obolo                        |    236K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ann.txt)               |\n| `anv`               | Denya                        |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/anv.txt)               |\n| `aoj`               | Mufian                       |    217K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aoj.txt)               |\n| `aom`               | Ömie                         |    231K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aom.txt)               |\n| `aon`               | Bumbita Arapesh              |    294K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aon.txt)               |\n| `aoz`               | Uab Meto                     |    197K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aoz.txt)               |\n| `ape`               | Bukiyip                      |    294K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ape.txt)               |\n| `apr`               | Arop-Lokep                   |    373K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/apr.txt)               |\n| `apz`               | Safeyoka                     |    235K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/apz.txt)               |\n| `ar`                | Arabic                       | 19,593K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ar.txt)                |\n| `arl`               | Arabela                      |    206K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/arl.txt)               |\n| `asg`               | Cishingini                   |    270K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/asg.txt)               |\n| `aso`               | Dano                         |    290K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/aso.txt)               |\n| `ata`               | Pele-Ata                     |    248K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ata.txt)               |\n| `atb`               | Zaiwa                        |    291K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/atb.txt)               |\n| `atg`               | Ivbie North-Okpela-Arhe      |    229K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/atg.txt)               |\n| `atq`               | Aralle-Tabulahan             |    202K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/atq.txt)               |\n| `auy`               | Awiyaana                     |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/auy.txt)               |\n| `av`                | Avaric                       |    111K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/av.txt)                |\n| `avn`               | Avatime                      |    229K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/avn.txt)               |\n| `avt`               | Au                           |    263K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/avt.txt)               |\n| `avu`               | Avokaya                      |    391K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/avu.txt)               |\n| `awa`               | Awadhi                       |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/awa.txt)               |\n| `awb`               | Awa (Papua New Guinea)       |    179K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/awb.txt)               |\n| `ay`                | Aymara                       |    482K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ay.txt)                |\n| `ayo`               | Ayoreo                       |    264K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ayo.txt)               |\n| `az`                | Azerbaijani                  |  3,413K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/az.txt)                |\n| `azg`               | San Pedro Amuzgos Amuzgo     |    271K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/azg.txt)               |\n| `azz`               | Highland Puebla Nahuatl      |    265K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/azz.txt)               |\n| `ba`                | Bashkir                      |    666K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ba.txt)                |\n| `ban`               | Balinese                     |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ban.txt)               |\n| `bao`               | Waimaha                      |    232K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bao.txt)               |\n| `bav`               | Vengo                        |    250K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bav.txt)               |\n| `bba`               | Baatonum                     |    792K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bba.txt)               |\n| `bbb`               | Barai                        |    289K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bbb.txt)               |\n| `bbo`               | Northern Bobo Madaré         |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bbo.txt)               |\n| `bbr`               | Girawa                       |    245K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bbr.txt)               |\n| `bch`               | Bariai                       |    248K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bch.txt)               |\n| `bcw`               | Bana                         |    304K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bcw.txt)               |\n| `bdd`               | Bunama                       |    171K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bdd.txt)               |\n| `be`                | Belarusian                   |  1,441K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/be.txt)                |\n| `be-tarask`         | Belarusian (Taraškievica)    | 108,431K [💾](http://www.gstatic.com/i18n/corpora/wordcounts/be-tarask.txt)         |\n| `bef`               | Benabena                     |    239K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bef.txt)               |\n| `bep`               | Besoa                        |    204K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bep.txt)               |\n| `bex`               | Jur Modo                     |    254K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bex.txt)               |\n| `bfd`               | Bafut                        |    276K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bfd.txt)               |\n| `bfo`               | Malba Birifor                |    260K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bfo.txt)               |\n| `bg`                | Bulgarian                    | 10,597K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bg.txt)                |\n| `bgr`               | Bawm Chin                    |    213K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bgr.txt)               |\n| `bgz`               | Banggai                      |    186K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bgz.txt)               |\n| `bhl`               | Bimin                        |    324K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bhl.txt)               |\n| `bhw`               | Biak                         |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bhw.txt)               |\n| `bi`                | Bislama                      |    315K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bi.txt)                |\n| `bib`               | Bissa                        |    243K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bib.txt)               |\n| `big`               | Biangai                      |    229K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/big.txt)               |\n| `bik`               | Central Bikol                |    183K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bik.txt)               |\n| `bim`               | Bimoba                       |    215K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bim.txt)               |\n| `biv`               | Southern Birifor             |    221K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/biv.txt)               |\n| `bjr`               | Binumarien                   |    226K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bjr.txt)               |\n| `bjv`               | Bedjond                      |    268K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bjv.txt)               |\n| `bkl`               | Berik                        |    306K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bkl.txt)               |\n| `bku`               | Buhid                        |    204K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bku.txt)               |\n| `bkv`               | Bekwarra                     |    244K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bkv.txt)               |\n| `blh`               | Kuwaa                        |    259K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/blh.txt)               |\n| `blt-Latn`          | Tai Dam (Latin)              |    262K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/blt-Latn.txt)          |\n| `blz`               | Balantak                     |    199K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/blz.txt)               |\n| `bm`                | Bambara                      |     30K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bm.txt)                |\n| `bmh`               | Kein                         |    253K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bmh.txt)               |\n| `bmq`               | Bomu                         |    207K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bmq.txt)               |\n| `bmr`               | Muinane                      |    122K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bmr.txt)               |\n| `bmu`               | Somba-Siawari                |    234K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bmu.txt)               |\n| `bmv`               | Bum                          |    258K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bmv.txt)               |\n| `bn`                | Bangla                       |  7,258K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bn.txt)                |\n| `bnj`               | Eastern Tawbuid              |    239K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bnj.txt)               |\n| `bnp`               | Bola                         |    263K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bnp.txt)               |\n| `bo`                | Tibetan                      |  5,642K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bo.txt)                |\n| `boa`               | Bora                         |    133K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/boa.txt)               |\n| `boj`               | Anjam                        |    255K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/boj.txt)               |\n| `bon`               | Bine                         |    244K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bon.txt)               |\n| `bov`               | Tuwuli                       |    203K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bov.txt)               |\n| `box`               | Buamu                        |    274K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/box.txt)               |\n| `bpr`               | Koronadal Blaan              |    204K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bpr.txt)               |\n| `bps`               | Sarangani Blaan              |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bps.txt)               |\n| `bqc`               | Boko                         |    567K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bqc.txt)               |\n| `bqj`               | Bandial                      |    175K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bqj.txt)               |\n| `bqp`               | Busa                         |    162K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bqp.txt)               |\n| `bru`               | Eastern Bru                  |    261K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bru.txt)               |\n| `bs`                | Bosnian                      |  8,993K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bs.txt)                |\n| `bsn`               | Barasana-Eduria              |    225K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bsn.txt)               |\n| `bss`               | Akoose                       |    199K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bss.txt)               |\n| `btd`               | Batak Dairi                  |    192K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/btd.txt)               |\n| `bts`               | Batak Simalungun             |    175K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bts.txt)               |\n| `btt`               | Bete-Bendi                   |    266K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/btt.txt)               |\n| `btx`               | Batak Karo                   |    189K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/btx.txt)               |\n| `bua`               | Buriat                       |    143K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bua.txt)               |\n| `bud`               | Ntcham                       |    207K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bud.txt)               |\n| `buk`               | Bugawac                      |    264K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/buk.txt)               |\n| `bus`               | Bokobaru                     |    159K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bus.txt)               |\n| `bvc`               | Baelelea                     |    308K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bvc.txt)               |\n| `bvz`               | Bauzi                        |    509K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bvz.txt)               |\n| `bwq`               | Southern Bobo Madaré         |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bwq.txt)               |\n| `bwu`               | Buli                         |    285K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bwu.txt)               |\n| `byr`               | Baruya                       |    182K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/byr.txt)               |\n| `byx`               | Qaqet                        |    387K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/byx.txt)               |\n| `bzh`               | Mapos Buang                  |    251K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bzh.txt)               |\n| `bzi`               | Bisu                         |    381K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bzi.txt)               |\n| `bzj`               | Belize Kriol English         |    240K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/bzj.txt)               |\n| `ca-valencia`       | Valencian                    | 24,295K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ca-valencia.txt)       |\n| `caa`               | Chortí                       |    307K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/caa.txt)               |\n| `cab`               | Garifuna                     |    154K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cab.txt)               |\n| `cac`               | Chuj                         |    244K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cac.txt)               |\n| `cak`               | Kaqchikel                    |    259K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cak.txt)               |\n| `cap`               | Chipaya                      |    154K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cap.txt)               |\n| `car`               | Galibi Carib                 |    160K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/car.txt)               |\n| `cax`               | Chiquitano                   |    149K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cax.txt)               |\n| `cbc`               | Carapana                     |    256K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cbc.txt)               |\n| `cbi`               | Chachi                       |    187K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cbi.txt)               |\n| `cbl`               | Bualkhaw Chin                |    210K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cbl.txt)               |\n| `cbr`               | Cashibo-Cacataibo            |    236K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cbr.txt)               |\n| `cbs`               | Cashinahua                   |    198K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cbs.txt)               |\n| `cbt`               | Chayahuita                   |    150K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cbt.txt)               |\n| `cbv`               | Cacua                        |    265K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cbv.txt)               |\n| `cce`               | Chopi                        |    204K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cce.txt)               |\n| `ccp`               | Chakma                       |     79K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ccp.txt)               |\n| `cdf`               | Chiru                        |    193K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cdf.txt)               |\n| `ce`                | Chechen                      |    669K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ce.txt)                |\n| `ceb`               | Cebuano                      |  1,067K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ceb.txt)               |\n| `ceg`               | Chamacoco                    |    232K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ceg.txt)               |\n| `cfm`               | Falam Chin                   |    438K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cfm.txt)               |\n| `cgc`               | Kagayanen                    |    299K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cgc.txt)               |\n| `chj`               | Ojitlán Chinantec            |    305K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/chj.txt)               |\n| `chm`               | Mari                         |    132K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/chm.txt)               |\n| `chr`               | Cherokee                     |    119K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/chr.txt)               |\n| `chz`               | Ozumacín Chinantec           |    205K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/chz.txt)               |\n| `cjo`               | Ashéninka Pajonal            |    141K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cjo.txt)               |\n| `cjp`               | Cabécar                      |    199K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cjp.txt)               |\n| `cjv`               | Chuave                       |    286K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cjv.txt)               |\n| `cko`               | Anufo                        |    272K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cko.txt)               |\n| `cle`               | Lealao Chinantec             |    313K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cle.txt)               |\n| `cme`               | Cerma                        |    230K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cme.txt)               |\n| `cmr`               | Mro-Khimi Chin               |    275K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cmr.txt)               |\n| `cnh`               | Hakha Chin                   |    934K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cnh.txt)               |\n| `cni`               | Asháninka                    |    122K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cni.txt)               |\n| `cnk`               | Khumi Chin                   |    237K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cnk.txt)               |\n| `cnl`               | Lalana Chinantec             |    308K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cnl.txt)               |\n| `cnt`               | Tepetotutla Chinantec        |    261K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cnt.txt)               |\n| `coe`               | Koreguaje                    |    181K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/coe.txt)               |\n| `cof`               | Colorado                     |    183K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cof.txt)               |\n| `cok`               | Santa Teresa Cora            |    230K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cok.txt)               |\n| `con`               | Cofán                        |    151K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/con.txt)               |\n| `cot`               | Caquinte                     |    128K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cot.txt)               |\n| `crh`               | Crimean Tatar                |    505K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/crh.txt)               |\n| `cs`                | Czech                        |  3,141K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cs.txt)                |\n| `csk`               | Jola-Kasa                    |    177K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/csk.txt)               |\n| `cso`               | Sochiapam Chinantec          |    328K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cso.txt)               |\n| `ctd-Latn`          | Tedim Chin (Latin)           |    852K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ctd-Latn.txt)          |\n| `ctu`               | Chol                         |    203K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ctu.txt)               |\n| `cub`               | Cubeo                        |    220K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cub.txt)               |\n| `cuc`               | Usila Chinantec              |    278K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cuc.txt)               |\n| `cui`               | Cuiba                        |    292K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cui.txt)               |\n| `cuk`               | San Blas Kuna                |    187K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cuk.txt)               |\n| `cul`               | Culina                       |    221K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cul.txt)               |\n| `cv`                | Chuvash                      |    111K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cv.txt)                |\n| `cwe`               | Kwere                        |    144K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cwe.txt)               |\n| `cwt`               | Kuwaataay                    |    168K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cwt.txt)               |\n| `cy`                | Welsh                        | 11,519K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cy.txt)                |\n| `cya`               | Nopala Chatino               |    245K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/cya.txt)               |\n| `czt`               | Zotung Chin                  |    227K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/czt.txt)               |\n| `da`                | Danish                       |    655K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/da.txt)                |\n| `daa`               | Dangaléat                    |    208K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/daa.txt)               |\n| `dad`               | Marik                        |    197K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dad.txt)               |\n| `dah`               | Gwahatike                    |    274K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dah.txt)               |\n| `ddn`               | Dendi                        |    210K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ddn.txt)               |\n| `de`                | German                       | 46,431K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/de.txt)                |\n| `ded`               | Dedua                        |    146K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ded.txt)               |\n| `des`               | Desano                       |    210K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/des.txt)               |\n| `dga`               | Southern Dagaare             |    458K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dga.txt)               |\n| `dgi`               | Northern Dagara              |    257K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dgi.txt)               |\n| `dgz`               | Daga                         |    219K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dgz.txt)               |\n| `din`               | Southwestern Dinka           |    196K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/din.txt)               |\n| `dip`               | Northeastern Dinka           |    193K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dip.txt)               |\n| `djk`               | Eastern Maroon Creole        |    307K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/djk.txt)               |\n| `dln`               | Darlong                      |    776K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dln.txt)               |\n| `dnw`               | Western Dani                 |    254K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dnw.txt)               |\n| `dob`               | Dobu                         |    179K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dob.txt)               |\n| `dop`               | Lukpa                        |    226K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dop.txt)               |\n| `dsh`               | Daasanach                    |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dsh.txt)               |\n| `dtb`               | Labuk-Kinabatangan Kadazan   |    248K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dtb.txt)               |\n| `dtp`               | Kadazan Dusun                |  1,038K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dtp.txt)               |\n| `dts`               | Toro So Dogon                |    202K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dts.txt)               |\n| `due`               | Umiray Dumaget Agta          |    247K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/due.txt)               |\n| `dug`               | Duruma                       |    172K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dug.txt)               |\n| `duo`               | Dupaninan Agta               |    266K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/duo.txt)               |\n| `dwr`               | Dawro                        |    254K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dwr.txt)               |\n| `dww`               | Dawawa                       |    208K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dww.txt)               |\n| `dyi`               | Djimini Senoufo              |    268K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dyi.txt)               |\n| `dyo`               | Jola-Fonyi                   |    158K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dyo.txt)               |\n| `dyu`               | Dyula                        |  1,156K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dyu.txt)               |\n| `dz`                | Dzongkha                     |     61K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/dz.txt)                |\n| `ee`                | Ewe                          |    421K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ee.txt)                |\n| `eka`               | Ekajuk                       |    213K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/eka.txt)               |\n| `el`                | Greek                        |  5,470K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/el.txt)                |\n| `emi`               | Mussau-Emira                 |    176K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/emi.txt)               |\n| `emp`               | Northern Emberá              |    158K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/emp.txt)               |\n| `enb`               | Markweeta                    |    147K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/enb.txt)               |\n| `enq`               | Enga                         |    217K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/enq.txt)               |\n| `enx`               | Enxet                        |    772K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/enx.txt)               |\n| `eri`               | Ogea                         |    269K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/eri.txt)               |\n| `es`                | Spanish                      | 32,670K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/es.txt)                |\n| `ese`               | Ese Ejja                     |    226K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ese.txt)               |\n| `et`                | Estonian                     |  3,658K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/et.txt)                |\n| `eu`                | Basque                       |    130K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/eu.txt)                |\n| `ewo`               | Ewondo                       |    158K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ewo.txt)               |\n| `eza`               | Ezaa                         |    963K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/eza.txt)               |\n| `fa`                | Persian                      |  9,114K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fa.txt)                |\n| `fa-AF`             | Dari                         |  7,363K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fa-AF.txt)             |\n| `faa`               | Fasu                         |    238K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/faa.txt)               |\n| `fai`               | Faiwol                       |    256K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fai.txt)               |\n| `fal`               | South Fali                   |    198K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fal.txt)               |\n| `far`               | Fataleka                     |    286K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/far.txt)               |\n| `fi`                | Finnish                      |  4,837K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fi.txt)                |\n| `fil`               | Tagalog                      |    184K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fil.txt)               |\n| `fip`               | Fipa                         |    134K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fip.txt)               |\n| `fit`               | Tornedalen Finnish           |    292K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fit.txt)               |\n| `fj`                | Fijian                       |    257K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fj.txt)                |\n| `fo`                | Faroese                      |    851K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fo.txt)                |\n| `fon`               | Fon                          |    266K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fon.txt)               |\n| `for`               | Fore                         |    169K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/for.txt)               |\n| `fr`                | French                       |  5,488K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fr.txt)               |\n| `fue`               | Borgu Fulfulde               |    148K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fue.txt)               |\n| `fuf`               | Pular                        |    174K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fuf.txt)               |\n| `fuq`               | Central-Eastern Niger Fulfulde |  156K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fuq.txt)               |\n| `fuv`               | Nigerian Fulfulde            |     13K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/fuv.txt)               |\n| `ga`                | Irish                        |  7,587K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ga.txt)                |\n| `gag`               | Gagauz                       |    245K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gag.txt)               |\n| `gah`               | Alekano                      |    210K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gah.txt)               |\n| `gam`               | Kandawo                      |    250K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gam.txt)               |\n| `gaw`               | Nobonob                      |    246K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gaw.txt)               |\n| `gbi`               | Galela                       |    288K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gbi.txt)               |\n| `gd`                | Scottish Gaelic              | 17,105K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gd.txt)                |\n| `gde`               | Gude                         |    217K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gde.txt)               |\n| `gdn`               | Umanakaina                   |    306K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gdn.txt)               |\n| `gdr`               | Wipi                         |    271K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gdr.txt)               |\n| `gej`               | Gen                          |    236K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gej.txt)               |\n| `gfk`               | Patpatar                     |    294K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gfk.txt)               |\n| `ghs`               | Guhu-Samane                  |    186K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ghs.txt)               |\n| `gil`               | Gilbertese                   |    228K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gil.txt)               |\n| `gkn`               | Gokana                       |    267K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gkn.txt)               |\n| `gmv-Latn`          | Gamo (Latin)                 |    127K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gmv-Latn.txt)          |\n| `gn`                | Guarani                      |    142K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gn.txt)                |\n| `gnd`               | Zulgo-Gemzek                 |    364K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gnd.txt)               |\n| `gng`               | Ngangam                      |    219K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gng.txt)               |\n| `gnw`               | Western Bolivian Guaraní     |    263K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gnw.txt)               |\n| `gof`               | Gofa                         |    124K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gof.txt)               |\n| `gog`               | Gogo                         |    173K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gog.txt)               |\n| `gor`               | Gorontalo                    |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gor.txt)               |\n| `gqr`               | Gor                          |    218K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gqr.txt)               |\n| `grb`               | Northern Grebo               |    270K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/grb.txt)               |\n| `grt`               | Garo                         |    141K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/grt.txt)               |\n| `gso`               | Southwest Gbaya              |    228K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gso.txt)               |\n| `gsw-u-sd-chag`     | Swiss German (Aargau)        |     99K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gsw-u-sd-chag.txt)     |\n| `gsw-u-sd-chbe`     | Swiss German (Bern)          |     73K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gsw-u-sd-chbe.txt)     |\n| `gsw-u-sd-chfr`     | Swiss German (Fribourg)      |     42K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gsw-u-sd-chfr.txt)     |\n| `gu`                | Gujarati                     |    702K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gu.txt)                |\n| `gub`               | Guajajára                    |    997K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gub.txt)               |\n| `guc`               | Wayuu                        |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/guc.txt)               |\n| `gud`               | Yocoboué Dida                |    216K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gud.txt)               |\n| `guh`               | Guahibo                      |    204K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/guh.txt)               |\n| `gui`               | Eastern Bolivian Guaraní     |    197K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gui.txt)               |\n| `gum`               | Guambiano                    |    186K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gum.txt)               |\n| `gun`               | Mbyá Guaraní                 |    176K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gun.txt)               |\n| `guo`               | Guayabero                    |    203K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/guo.txt)               |\n| `guq`               | Aché                         |    184K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/guq.txt)               |\n| `gur`               | Farefare                     |    240K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gur.txt)               |\n| `gux`               | Gourmanchéma                 |    215K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gux.txt)               |\n| `gv`                | Manx Gaelic                  |    152K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gv.txt)                |\n| `gvc`               | Guanano                      |    241K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gvc.txt)               |\n| `gvf`               | Golin                        |    276K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gvf.txt)               |\n| `gvl`               | Gulay                        |    270K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gvl.txt)               |\n| `gwr`               | Gwere                        |    157K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gwr.txt)               |\n| `gym`               | Ngäbere                      |    294K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gym.txt)               |\n| `gyr`               | Guarayu                      |    176K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/gyr.txt)               |\n| `ha`                | Hausa                        |  1,775K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ha.txt)                |\n| `hae`               | Eastern Oromo                |    163K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hae.txt)               |\n| `hag`               | Hanga                        |    202K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hag.txt)               |\n| `haw`               | Hawaiian                     |  2,221K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/haw.txt)               |\n| `hay`               | Haya                         |    112K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hay.txt)               |\n| `heh`               | Hehe                         |    136K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/heh.txt)               |\n| `hi`                | Hindi                        | 10,004K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hi.txt)                |\n| `hif`               | Fiji Hindi                   |    204K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hif.txt)               |\n| `hig`               | Kamwe                        |    261K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hig.txt)               |\n| `hil`               | Hiligaynon                   |    208K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hil.txt)               |\n| `hla`               | Halia                        |    273K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hla.txt)               |\n| `hne`               | Chhattisgarhi                |    207K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hne.txt)               |\n| `hnn`               | Hanunoo                      |    212K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hnn.txt)               |\n| `hns`               | Caribbean Hindustani         |    312K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hns.txt)               |\n| `ho`                | Hiri Motu                    |    240K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ho.txt)                |\n| `hot`               | Hote                         |    222K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hot.txt)               |\n| `hr`                | Croatian                     |  8,188K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hr.txt)                |\n| `ht`                | Haitian                      |  1,101K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ht.txt)                |\n| `hto`               | Minica Huitoto               |    182K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hto.txt)               |\n| `hu`                | Hungarian                    |    600K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hu.txt)                |\n| `hub`               | Huambisa                     |    160K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hub.txt)               |\n| `hui`               | Huli                         |    232K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hui.txt)               |\n| `hus`               | Huastec                      |    236K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hus.txt)               |\n| `huu`               | Murui Huitoto                |    165K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/huu.txt)               |\n| `huv`               | San Mateo Del Mar Huave      |    197K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/huv.txt)               |\n| `hvn`               | Sabu                         |    312K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hvn.txt)               |\n| `hy`                | Armenian                     | 25,972K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/hy.txt)                |\n| `ian`               | Iatmul                       |    224K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ian.txt)               |\n| `iba`               | Iban                         |    179K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/iba.txt)               |\n| `icr`               | Islander Creole English      |    248K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/icr.txt)               |\n| `id`                | Indonesian                   |  6,634K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/id.txt)                |\n| `ifa`               | Amganad Ifugao               |    810K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ifa.txt)               |\n| `ifb`               | Batad Ifugao                 |    835K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ifb.txt)               |\n| `ife`               | Ifè                          |    300K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ife.txt)               |\n| `ifk`               | Tuwali Ifugao                |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ifk.txt)               |\n| `ifu`               | Mayoyao Ifugao               |    258K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ifu.txt)               |\n| `ify`               | Keley-I Kallahan             |    863K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ify.txt)               |\n| `ig`                | Igbo                         |     13K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ig.txt)                |\n| `ign`               | Ignaciano                    |    161K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ign.txt)               |\n| `ik`                | Inupiaq                      |     96K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ik.txt)                |\n| `ilo`               | Iloko                        |    169K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ilo.txt)               |\n| `imo`               | Imbongu                      |    280K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/imo.txt)               |\n| `inb`               | Inga                         |    151K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/inb.txt)               |\n| `ino`               | Inoke-Yate                   |    236K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ino.txt)               |\n| `iou`               | Tuma-Irumu                   |    225K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/iou.txt)               |\n| `ipi`               | Ipili                        |    312K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ipi.txt)               |\n| `iri`               | Irigwe                       |    243K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/iri.txt)               |\n| `irk`               | Iraqw                        |    184K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/irk.txt)               |\n| `iry`               | Iraya                        |    205K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/iry.txt)               |\n| `it`                | Italian                      | 13,569K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/it.txt)                |\n| `itv`               | Itawit                       |    242K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/itv.txt)               |\n| `iu`                | Inuktitut                    |     98K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/iu.txt)                |\n| `iws`               | Sepik Iwam                   |    307K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/iws.txt)               |\n| `izr`               | Izere                        |    216K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/izr.txt)               |\n| `izz`               | Izii                         |    908K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/izz.txt)               |\n| `ja`                | Japanese                     |  2,116K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ja.txt)                |\n| `jac`               | Popti'                       |    221K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jac.txt)               |\n| `jae`               | Yabem                        |    186K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jae.txt)               |\n| `jam`               | Jamaican Creole English      |    254K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jam.txt)               |\n| `jbu`               | Jukun Takum                  |    264K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jbu.txt)               |\n| `jic`               | Tol                          |    285K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jic.txt)               |\n| `jiv`               | Shuar                        |    134K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jiv.txt)               |\n| `jmc`               | Machame                      |    150K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jmc.txt)               |\n| `jun`               | Juang                        |    178K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jun.txt)               |\n| `jv`                | Javanese                     |    177K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jv.txt)                |\n| `jvn`               | Caribbean Javanese           |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/jvn.txt)               |\n| `ka`                | Georgian                     |  4,978K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ka.txt)                |\n| `kaa`               | Kara-Kalpak                  |    135K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kaa.txt)               |\n| `kab-Arab`          | Kabyle (Arabic)              |    715K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kab-Arab.txt)          |\n| `kab-Tfng`          | Kabyle (Tifinagh)            |  1,338K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kab-Tfng.txt)          |\n| `kab`               | Kabyle                       |     66K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kab.txt)               |\n| `kac`               | Kachin                       |  1,057K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kac.txt)               |\n| `kao`               | Xaasongaxango                |    205K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kao.txt)               |\n| `kaq`               | Capanahua                    |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kaq.txt)               |\n| `kbh`               | Camsá                        |    193K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kbh.txt)               |\n| `kbm`               | Iwal                         |    298K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kbm.txt)               |\n| `kbp`               | Kabiyè                       |    571K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kbp.txt)               |\n| `kbq`               | Kamano                       |    156K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kbq.txt)               |\n| `kbr`               | Kafa                         |    147K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kbr.txt)               |\n| `kcg`               | Tyap                         |    279K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kcg.txt)               |\n| `kdc`               | Kutu                         |    140K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kdc.txt)               |\n| `kdi`               | Kumam                        |    195K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kdi.txt)               |\n| `kdj`               | Karamojong                   |    163K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kdj.txt)               |\n| `kdn`               | Kunda                        |    144K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kdn.txt)               |\n| `kek`               | Kekchí                       |    406K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kek.txt)               |\n| `ken`               | Kenyang                      |    200K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ken.txt)               |\n| `keo`               | Kakwa                        |    215K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/keo.txt)               |\n| `ker`               | Kera                         |    267K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ker.txt)               |\n| `kew`               | West Kewa                    |    247K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kew.txt)               |\n| `kez`               | Kukele                       |    173K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kez.txt)               |\n| `kgf`               | Kube                         |    175K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kgf.txt)               |\n| `kgr`               | Abun                         |    356K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kgr.txt)               |\n| `khz`               | Keapara                      |    196K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/khz.txt)               |\n| `kia`               | Kim                          |    525K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kia.txt)               |\n| `kij`               | Kilivila                     |    155K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kij.txt)               |\n| `kj`                | Kuanyama                     |  1,474K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kj.txt)                |\n| `kjb`               | Q'anjob'al                   |    263K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kjb.txt)               |\n| `kje`               | Kisar                        |    235K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kje.txt)               |\n| `kjh`               | Khakas                       |    128K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kjh.txt)               |\n| `kjs`               | East Kewa                    |    251K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kjs.txt)               |\n| `kk`                | Kazakh                       |    642K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kk.txt)                |\n| `kki`               | Kagulu                       |    125K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kki.txt)               |\n| `kkj`               | Kako                         |    263K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kkj.txt)               |\n| `kln`               | Kalenjin                     |    149K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kln.txt)               |\n| `km`                | Khmer                        | 29,110K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/km.txt)                |\n| `kma`               | Konni                        |    230K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kma.txt)               |\n| `kmg`               | Kâte                         |    127K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kmg.txt)               |\n| `kmo`               | Kwoma                        |    213K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kmo.txt)               |\n| `kms`               | Kamasau                      |    293K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kms.txt)               |\n| `kmu`               | Kanite                       |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kmu.txt)               |\n| `kn`                | Kannada                      |    126K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kn.txt)                |\n| `kne`               | Kankanaey                    |    230K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kne.txt)               |\n| `knf`               | Mankanya                     |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/knf.txt)               |\n| `knj`               | Western Kanjobal             |  1,350K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/knj.txt)               |\n| `knk`               | Kuranko                      |    228K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/knk.txt)               |\n| `kno`               | Kono                         |    360K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kno.txt)               |\n| `knv`               | Tabo                         |    243K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/knv.txt)               |\n| `kog`               | Cogui                        |    189K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kog.txt)               |\n| `kpf`               | Komba                        |    174K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kpf.txt)               |\n| `kpg`               | Kapingamarangi               |    967K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kpg.txt)               |\n| `kpr`               | Korafe-Yegha                 |    262K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kpr.txt)               |\n| `kpw`               | Kobon                        |    288K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kpw.txt)               |\n| `kpx`               | Mountain Koiali              |    190K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kpx.txt)               |\n| `kpz`               | Kupsabiny                    |    166K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kpz.txt)               |\n| `kqc`               | Doromu-Koki                  |    209K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kqc.txt)               |\n| `kqe`               | Kalagan                      |    241K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kqe.txt)               |\n| `kqp`               | Kimré                        |    254K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kqp.txt)               |\n| `kqw`               | Kandas                       |    201K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kqw.txt)               |\n| `kqy`               | Koorete                      |    156K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kqy.txt)               |\n| `krc`               | Karachay-Balkar              |    132K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/krc.txt)               |\n| `kri`               | Krio                         |    256K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kri.txt)               |\n| `krj`               | Kinaray-A                    |    228K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/krj.txt)               |\n| `kru`               | Kurukh                       |    182K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kru.txt)               |\n| `ksd`               | Kuanua                       |    228K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ksd.txt)               |\n| `ksr`               | Borong                       |    233K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ksr.txt)               |\n| `ktb`               | Kambaata                     |    113K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ktb.txt)               |\n| `ktj`               | Plapo Krumen                 |    356K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ktj.txt)               |\n| `kto`               | Kuot                         |    286K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kto.txt)               |\n| `ku`                | Kurdish                      |  2,479K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ku.txt)                |\n| `kub`               | Kutep                        |    281K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kub.txt)               |\n| `kud`               | ‘Auhelawa                    |    167K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kud.txt)               |\n| `kue`               | Kuman (Papua New Guinea)     |    230K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kue.txt)               |\n| `kum`               | Kumyk                        |    142K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kum.txt)               |\n| `kup`               | Kunimaipa                    |    279K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kup.txt)               |\n| `kus`               | Kusaal                       |    200K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kus.txt)               |\n| `kv`                | Komi                         |    122K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kv.txt)                |\n| `kvn`               | Border Kuna                  |    212K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kvn.txt)               |\n| `kwf`               | Kwara'ae                     |    296K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kwf.txt)               |\n| `kwi`               | Awa-Cuaiquer                 |    165K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kwi.txt)               |\n| `kwj`               | Kwanga                       |    290K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kwj.txt)               |\n| `kxc`               | Konso                        |    148K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kxc.txt)               |\n| `kxm`               | Northern Khmer               |    257K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kxm.txt)               |\n| `ky`                | Kyrgyz                       | 18,597K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ky.txt)                |\n| `kyc`               | Kyaka                        |    220K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kyc.txt)               |\n| `kyf`               | Kouya                        |    215K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kyf.txt)               |\n| `kyg`               | Keyagana                     |    190K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kyg.txt)               |\n| `kyq`               | Kenga                        |    250K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kyq.txt)               |\n| `kyu`               | Western Kayah                |    466K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kyu.txt)               |\n| `kyz`               | Kayabí                       |    324K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kyz.txt)               |\n| `kze`               | Kosena                       |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kze.txt)               |\n| `kzf`               | Da'a Kaili                   |    213K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kzf.txt)               |\n| `kzj`               | Coastal Kadazan              |    215K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/kzj.txt)               |\n| `la`                | Latin                        |     48K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/la.txt)                |\n| `laj`               | Lango                        |    175K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/laj.txt)               |\n| `las`               | Lama                         |    235K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/las.txt)               |\n| `law`               | Lauje                        |    262K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/law.txt)               |\n| `lb`                | Luxembourgish                |  5,173K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lb.txt)                |\n| `lcm`               | Tungag                       |    239K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lcm.txt)               |\n| `lee`               | Lyélé                        |    257K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lee.txt)               |\n| `lef`               | Lelemi                       |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lef.txt)               |\n| `lem`               | Nomaande                     |    249K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lem.txt)               |\n| `leu`               | Kara (Papua New Guinea)      |    255K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/leu.txt)               |\n| `lew`               | Ledo Kaili                   |    198K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lew.txt)               |\n| `lex`               | Luang                        |    271K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lex.txt)               |\n| `lgg`               | Lugbara                      |    188K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lgg.txt)               |\n| `lhu`               | Lahu                         |    352K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lhu.txt)               |\n| `lia`               | West-Central Limba           |    247K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lia.txt)               |\n| `lid`               | Nyindrou                     |    308K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lid.txt)               |\n| `lif`               | Limbu                        |    138K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lif.txt)               |\n| `lip`               | Sekpele                      |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lip.txt)               |\n| `lis`               | Lisu                         |    304K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lis.txt)               |\n| `ljp`               | Lampung Api                  |    188K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ljp.txt)               |\n| `lln`               | Lele                         |    291K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lln.txt)               |\n| `lme`               | Pévé                         |    245K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lme.txt)               |\n| `lmk`               | Lamkang                      |    217K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lmk.txt)               |\n| `lnd`               | Lundayeh                     |    670K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lnd.txt)               |\n| `lo`                | Lao                          |  4,384K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lo.txt)                |\n| `lob`               | Lobi                         |    192K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lob.txt)               |\n| `loe`               | Saluan                       |    220K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/loe.txt)               |\n| `lok`               | Loko                         |    264K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lok.txt)               |\n| `lon`               | Malawi Lomwe                 |    137K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lon.txt)               |\n| `lsi`               | Lashi                        |  1,077K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lsi.txt)               |\n| `lsm`               | Saamia                       |    156K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lsm.txt)               |\n| `lt`                | Lithuanian                   | 39,575K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lt.txt)                |\n| `luc`               | Aringa                       |    242K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/luc.txt)               |\n| `lus`               | Lushai                       |    204K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lus.txt)               |\n| `lv`                | Latvian                      |  1,020K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lv.txt)                |\n| `lwo`               | Luwo                         |    255K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/lwo.txt)               |\n| `maa`               | San Jerónimo Tecóatl Mazatec |    487K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/maa.txt)               |\n| `mad`               | Madurese                     |    706K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mad.txt)               |\n| `mag`               | Magahi                       |    193K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mag.txt)               |\n| `mai`               | Maithili                     |    211K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mai.txt)               |\n| `maj`               | Jalapa De Díaz Mazatec       |    188K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/maj.txt)               |\n| `mak`               | Makasar                      |    179K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mak.txt)               |\n| `mam`               | Mam                          |    834K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mam.txt)               |\n| `maw`               | Mampruli                     |    251K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/maw.txt)               |\n| `maz`               | Central Mazahua              |    286K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/maz.txt)               |\n| `mbb`               | Western Bukidnon Manobo      |    278K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mbb.txt)               |\n| `mbc`               | Macushi                      |    221K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mbc.txt)               |\n| `mbh`               | Mangseng                     |    321K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mbh.txt)               |\n| `mbt`               | Matigsalug Manobo            |    226K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mbt.txt)               |\n| `mca`               | Maca                         |    208K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mca.txt)               |\n| `mcb`               | Machiguenga                  |    132K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mcb.txt)               |\n| `mcd`               | Sharanahua                   |    200K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mcd.txt)               |\n| `mco`               | Coatlán Mixe                 |    217K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mco.txt)               |\n| `mcp`               | Makaa                        |    237K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mcp.txt)               |\n| `mcq`               | Ese                          |    158K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mcq.txt)               |\n| `mcu`               | Cameroon Mambila             |    260K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mcu.txt)               |\n| `mda`               | Mada                         |    312K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mda.txt)               |\n| `mdy`               | Male                         |    589K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mdy.txt)               |\n| `med`               | Melpa                        |    283K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/med.txt)               |\n| `mee`               | Mengen                       |    301K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mee.txt)               |\n| `mej`               | Meyah                        |    323K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mej.txt)               |\n| `mek`               | Mekeo                        |    234K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mek.txt)               |\n| `men`               | Mende                        |    210K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/men.txt)               |\n| `meq`               | Merey                        |    291K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/meq.txt)               |\n| `meu`               | Motu                         |    175K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/meu.txt)               |\n| `mfe`               | Morisyen                     |    172K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mfe.txt)               |\n| `mfh`               | Matal                        |    238K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mfh.txt)               |\n| `mfi`               | Wandala                      |    265K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mfi.txt)               |\n| `mfk`               | North Mofu                   |    248K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mfk.txt)               |\n| `mfq`               | Moba                         |    232K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mfq.txt)               |\n| `mfy`               | Mayo                         |    167K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mfy.txt)               |\n| `mfz`               | Mabaan                       |    237K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mfz.txt)               |\n| `mg`                | Malagasy                     |  1,623K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mg.txt)                |\n| `mgd`               | Moru                         |    192K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mgd.txt)               |\n| `mgh`               | Makhuwa-Meetto               |    150K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mgh.txt)               |\n| `mgo`               | Meta'                        |    251K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mgo.txt)               |\n| `mh`                | Marshallese                  |    750K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mh.txt)                |\n| `mhi`               | Ma'di                        |    192K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mhi.txt)               |\n| `mhl`               | Mauwake                      |    235K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mhl.txt)               |\n| `mhx`               | Maru                         |    291K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mhx.txt)               |\n| `mhy`               | Ma'anyan                     |    190K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mhy.txt)               |\n| `mi`                | Maori                        |  1,504K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mi.txt)                |\n| `mib`               | Atatláhuca Mixtec            |    263K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mib.txt)               |\n| `mif`               | Mofu-Gudur                   |    283K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mif.txt)               |\n| `mil`               | Peñoles Mixtec               |    365K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mil.txt)               |\n| `min`               | Minangkabau                  |    242K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/min.txt)               |\n| `mio`               | Pinotepa Nacional Mixtec     |    288K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mio.txt)               |\n| `miq`               | Mískito                      |    214K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/miq.txt)               |\n| `mit`               | Southern Puebla Mixtec       |    273K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mit.txt)               |\n| `mk`                | Macedonian                   | 10,422K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mk.txt)                |\n| `mkl`               | Mokole                       |    230K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mkl.txt)               |\n| `ml`                | Malayalam                    |    118K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ml.txt)                |\n| `mlh`               | Mape                         |    235K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mlh.txt)               |\n| `mlp`               | Bargam                       |    297K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mlp.txt)               |\n| `mmo`               | Mangga Buang                 |    269K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mmo.txt)               |\n| `mmx`               | Madak                        |    271K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mmx.txt)               |\n| `mna`               | Mbula                        |    257K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mna.txt)               |\n| `mnb`               | Muna                         |    151K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mnb.txt)               |\n| `mnf`               | Mundani                      |    241K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mnf.txt)               |\n| `mnw`               | Mon                          |  1,836K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mnw.txt)               |\n| `moa`               | Mwan                         |    308K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/moa.txt)               |\n| `mog`               | Mongondow                    |    220K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mog.txt)               |\n| `mop`               | Mopán Maya                   |    296K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mop.txt)               |\n| `mor`               | Moro                         |    152K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mor.txt)               |\n| `mox`               | Molima                       |    222K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mox.txt)               |\n| `mpg`               | Marba                        |    210K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mpg.txt)               |\n| `mpm`               | Yosondúa Mixtec              |    336K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mpm.txt)               |\n| `mps`               | Dadibi                       |  1,270K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mps.txt)               |\n| `mpt`               | Mian                         |    256K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mpt.txt)               |\n| `mpx`               | Misima-Panaeati              |    227K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mpx.txt)               |\n| `mqb`               | Mbuko                        |    302K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mqb.txt)               |\n| `mqj`               | Mamasa                       |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mqj.txt)               |\n| `mqn`               | Moronene                     |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mqn.txt)               |\n| `mr`                | Marathi                      | 16,594K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mr.txt)                |\n| `mrw`               | Maranao                      |    912K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mrw.txt)               |\n| `ms`                | Malay                        |    659K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ms.txt)                |\n| `msm`               | Agusan Manobo                |    225K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/msm.txt)               |\n| `msy`               | Aruamu                       |    229K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/msy.txt)               |\n| `mt`                | Maltese                      |  3,331K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mt.txt)                |\n| `mta`               | Cotabato Manobo              |    262K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mta.txt)               |\n| `mti`               | Maiwa (Papua New Guinea)     |    166K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mti.txt)               |\n| `mtj`               | Moskona                      |    321K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mtj.txt)               |\n| `mto`               | Totontepec Mixe              |    233K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mto.txt)               |\n| `mtp`               | Wichí Lhamtés Nocten         |    183K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mtp.txt)               |\n| `muh`               | Mündü                        |    392K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/muh.txt)               |\n| `mur`               | Murle                        |    210K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mur.txt)               |\n| `mux`               | Bo-Ung                       |    363K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mux.txt)               |\n| `muy`               | Muyang                       |    265K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/muy.txt)               |\n| `mva`               | Manam                        |    231K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mva.txt)               |\n| `mvp`               | Duri                         |    174K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mvp.txt)               |\n| `mwv`               | Mentawai                     |    141K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mwv.txt)               |\n| `mxb`               | Tezoatlán Mixtec             |    281K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mxb.txt)               |\n| `mxt`               | Jamiltepec Mixtec            |    267K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mxt.txt)               |\n| `my`                | Burmese                      |  1,007K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/my.txt)                |\n| `my-t-d0-zawgyi`    | Burmese (Zawgyi encoding)    |    593K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/my-t-d0-zawgyi.txt)    |\n| `myb`               | Mbay                         |    192K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/myb.txt)               |\n| `myk`               | Mamara Senoufo               |    272K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/myk.txt)               |\n| `myv`               | Erzya                        |    143K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/myv.txt)               |\n| `myw`               | Muyuw                        |    150K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/myw.txt)               |\n| `myx`               | Masaaba                      |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/myx.txt)               |\n| `myy`               | Macuna                       |    245K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/myy.txt)               |\n| `mza`               | Santa María Zacatepec Mixtec |    316K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mza.txt)               |\n| `mzi`               | Ixcatlán Mazatec             |    190K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mzi.txt)               |\n| `mzk`               | Nigeria Mambila              |    283K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mzk.txt)               |\n| `mzm`               | Mumuye                       |    265K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/mzm.txt)               |\n| `naf`               | Nabak                        |    220K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/naf.txt)               |\n| `nak`               | Nakanai                      |    333K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nak.txt)               |\n| `nan-Latn`          | Min Nan Chinese (Latin)      |    231K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nan-Latn.txt)          |\n| `nas`               | Naasioi                      |    168K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nas.txt)               |\n| `nca`               | Iyo                          |    203K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nca.txt)               |\n| `nch`               | Central Huasteca Nahuatl     |    195K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nch.txt)               |\n| `ncj`               | Northern Puebla Nahuatl      |    164K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ncj.txt)               |\n| `ncu`               | Chumburung                   |    312K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ncu.txt)               |\n| `ndj`               | Ndamba                       |    141K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ndj.txt)               |\n| `ndy`               | Lutos                        |    216K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ndy.txt)               |\n| `ndz`               | Ndogo                        |    350K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ndz.txt)               |\n| `neb`               | Toura                        |    326K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/neb.txt)               |\n| `new`               | Newari                       |    150K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/new.txt)               |\n| `nfr`               | Nafaanra                     |    233K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nfr.txt)               |\n| `ngp`               | Ngulu                        |    149K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ngp.txt)               |\n| `nho`               | Takuu                        |    309K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nho.txt)               |\n| `nhu`               | Noone                        |    270K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nhu.txt)               |\n| `nhw`               | Western Huasteca Nahuatl     |    194K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nhw.txt)               |\n| `nhy`               | Northern Oaxaca Nahuatl      |    185K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nhy.txt)               |\n| `nia`               | Nias                         |    182K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nia.txt)               |\n| `nii`               | Nii                          |    316K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nii.txt)               |\n| `nij`               | Ngaju                        |    194K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nij.txt)               |\n| `nim`               | Nilamba                      |    117K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nim.txt)               |\n| `nin`               | Ninzo                        |    267K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nin.txt)               |\n| `nkf`               | Inpui Naga                   |    197K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nkf.txt)               |\n| `nko`               | Nkonya                       |    168K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nko.txt)               |\n| `nl`                | Dutch                        | 58,357K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nl.txt)                |\n| `nlc`               | Nalca                        |    241K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nlc.txt)               |\n| `nmz`               | Nawdm                        |    209K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nmz.txt)               |\n| `nnb`               | Nande                        |    127K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nnb.txt)               |\n| `nnq`               | Ngindo                       |    137K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nnq.txt)               |\n| `nnw`               | Southern Nuni                |    291K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nnw.txt)               |\n| `noa`               | Woun Meu                     |    275K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/noa.txt)               |\n| `nog`               | Nogai                        |    104K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nog.txt)               |\n| `nop`               | Numanggang                   |    183K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nop.txt)               |\n| `not`               | Nomatsiguenga                |    141K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/not.txt)               |\n| `nou`               | Ewage-Notu                   |    266K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nou.txt)               |\n| `npl`               | Southeastern Puebla Nahuatl  |    148K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/npl.txt)               |\n| `npy`               | Napu                         |    192K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/npy.txt)               |\n| `nsn`               | Nehan                        |    248K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nsn.txt)               |\n| `nsu`               | Sierra Negra Nahuatl         |    170K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nsu.txt)               |\n| `ntm`               | Nateni                       |    229K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ntm.txt)               |\n| `ntp`               | Northern Tepehuan            |    173K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ntp.txt)               |\n| `ntr`               | Delo                         |    272K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ntr.txt)               |\n| `nuj`               | Nyole                        |    151K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nuj.txt)               |\n| `nus`               | Nuer                         |    195K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nus.txt)               |\n| `nvm`               | Namiae                       |    290K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nvm.txt)               |\n| `nwb`               | Nyabwa                       |    316K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nwb.txt)               |\n| `nwi`               | Southwest Tanna              |    230K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nwi.txt)               |\n| `ny`                | Nyanja                       |    356K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/ny.txt)                |\n| `nyf`               | Giryama                      |    169K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nyf.txt)               |\n| `nyn`               | Nyankole                     |    120K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nyn.txt)               |\n| `nyo`               | Nyoro                        |    120K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nyo.txt)               |\n| `nyy`               | Nyakyusa-Ngonde              |    138K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nyy.txt)               |\n| `nzi`               | Nzima                        |    201K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/nzi.txt)               |\n| `obo`               | Obo Manobo                   |    266K  [💾](http://www.gstatic.com/i18n/corpora/wordcounts/obo.txt)               |\n| `o","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgoogle%2Fcorpuscrawler","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgoogle%2Fcorpuscrawler","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgoogle%2Fcorpuscrawler/lists"}