{"id":13558877,"url":"https://github.com/datasets/language-codes","last_synced_at":"2025-04-04T20:14:18.218Z","repository":{"id":25719276,"uuid":"29156207","full_name":"datasets/language-codes","owner":"datasets","description":"ISO Language Codes (639-1 and 639-2)","archived":false,"fork":false,"pushed_at":"2024-10-25T14:25:01.000Z","size":59012,"stargazers_count":101,"open_issues_count":1,"forks_count":61,"subscribers_count":22,"default_branch":"main","last_synced_at":"2025-03-28T19:09:45.444Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"https://datahub.io/core/language-codes","language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/datasets.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2015-01-12T20:49:43.000Z","updated_at":"2025-03-27T17:32:26.000Z","dependencies_parsed_at":"2024-11-04T10:42:04.901Z","dependency_job_id":null,"html_url":"https://github.com/datasets/language-codes","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/datasets%2Flanguage-codes","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/datasets%2Flanguage-codes/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/datasets%2Flanguage-codes/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/datasets%2Flanguage-codes/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/datasets","download_url":"https://codeload.github.com/datasets/language-codes/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247242680,"owners_count":20907134,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-08-01T12:05:12.583Z","updated_at":"2025-04-04T20:14:18.193Z","avatar_url":"https://github.com/datasets.png","language":"Shell","funding_links":[],"categories":["Shell","others"],"sub_categories":[],"readme":"\u003ca className=\"gh-badge\" href=\"https://datahub.io/core/language-codes\"\u003e\u003cimg src=\"https://badgen.net/badge/icon/View%20on%20datahub.io/orange?icon=https://datahub.io/datahub-cube-badge-icon.svg\u0026label\u0026scale=1.25\" alt=\"badge\" /\u003e\u003c/a\u003e\n\n## Description\n\nComprehensive language code information, consisting of ISO 639-1, ISO 639-2 and IETF language types.\n\n## Data\n\nData is taken from the [Library of Congress](http://www.loc.gov/standards/iso639-2/iso639-2ra.html) as the ISO 639-2 Registration Authority, and from the [Unicode Common Locale Data Repository](http://cldr.unicode.org/).\n\n### data/language-codes.csv \n\nThis file contains the 184 languages with __ISO 639-1__ (alpha 2 / two letter) codes and their English names.\n\n### data/language-codes-3b2.csv \n\nThis file contains the 184 languages with both __ISO 639-2__ (alpha 3 / three letter) bibliographic codes and ISO 639-1 codes, and their English names.\n\n### data/language-codes-full.csv\n\nThis file is more exhaustive.\n\nIt contains all languages with __ISO 639-2__ (alpha 3 / three letter) codes, the respective ISO 639-1 codes (if present), as well as the English and French name of each language.\n\nThere are two versions of the three letter codes: bibliographic and terminologic. Each language has a bibliographic code but only a few languages have terminologic codes. Terminologic codes are chosen to be similar to the corresponding ISO 639-1 two letter codes.\n\nExample from [Wikipedia](https://en.wikipedia.org/wiki/ISO_639#Relations_between_the_parts):\n\u003e [...] the German language (Part 1: `de`) has two codes in Part 2: `ger` (T code) and `deu` (B code), whereas there is only one code in Part 2, `eng`, for the English language.\n\nThere are four special codes: *mul*, *und*, *mis*, *zxx*; and a reserved range *qaa-qtz*.\n\n### data/ietf-language-tags.csv\n\nThis file lists all IETF language tags of the official resource indicated by http://www.iana.org/assignments/language-tag-extensions-registry \nthat into the `/main` folder of http://www.unicode.org/Public/cldr/latest/core.zip (project [cldr.unicode.org](http://cldr.unicode.org)).\n\n## Preparation\n\nThis dataset is automatically updated using Github Workflows using scripts to gather `ietf-language-tags.csv` and different `language-codes` data.\n\n## License\n\nThis material is licensed by its maintainers under the [Public Domain Dedication and License (PDDL)](http://opendatacommons.org/licenses/pddl/1.0/).\n\nNevertheless, it should be noted that this material is ultimately sourced from the Library of Congress as a Registration Authority for ISO and their licensing policies are somewhat unclear. As this is a short, simple database of facts, there is a strong argument that no rights can subsist in this collection.\n\nHowever, if you intended to use these data in a public or commercial product, please check the original sources for any specific restrictions.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatasets%2Flanguage-codes","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdatasets%2Flanguage-codes","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatasets%2Flanguage-codes/lists"}