{"id":48794768,"url":"https://github.com/andrehora/gh-langs","last_synced_at":"2026-04-13T22:35:35.253Z","repository":{"id":345771036,"uuid":"1187297352","full_name":"andrehora/gh-langs","owner":"andrehora","description":"Languages and extensions known to GitHub. Extracted from Linguist.","archived":false,"fork":false,"pushed_at":"2026-04-08T18:11:36.000Z","size":244,"stargazers_count":0,"open_issues_count":1,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-04-08T20:14:41.116Z","etag":null,"topics":["dataset","file-extensions","github","language-statistics","languages","linguist"],"latest_commit_sha":null,"homepage":"https://andrehora.github.io/gh-langs/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/andrehora.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-03-20T15:06:42.000Z","updated_at":"2026-03-25T01:14:30.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/andrehora/gh-langs","commit_stats":null,"previous_names":["andrehora/gh-languages-and-extensions","andrehora/gh-langs"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/andrehora/gh-langs","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrehora%2Fgh-langs","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrehora%2Fgh-langs/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrehora%2Fgh-langs/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrehora%2Fgh-langs/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/andrehora","download_url":"https://codeload.github.com/andrehora/gh-langs/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andrehora%2Fgh-langs/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31774284,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-13T20:17:16.280Z","status":"ssl_error","status_checked_at":"2026-04-13T20:17:08.216Z","response_time":93,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataset","file-extensions","github","language-statistics","languages","linguist"],"created_at":"2026-04-13T22:35:31.279Z","updated_at":"2026-04-13T22:35:35.245Z","avatar_url":"https://github.com/andrehora.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# gh-langs\n\nComprehensive dataset of language names, extensions, aliases, and filenames extracted from [GitHub Linguist](https://github.com/github-linguist/linguist).\nData is extracted from [languages.yml](https://github.com/github-linguist/linguist/blob/main/lib/linguist/languages.yml).\n\nExplore the data: https://andrehora.github.io/gh-langs/\n\n\u003c!-- updated --\u003e\nUpdated: 2026-03-24\n\u003c!-- /updated --\u003e\n\n## Data\n\n### Basic\n\n\u003c!-- gh:start --\u003e\n| File | Count | Description |\n|------|-------|-------------|\n| [`gh_languages.txt`](data/gh_languages.txt) | 804 | Languages known to GitHub |\n| [`gh_extensions.txt`](data/gh_extensions.txt) | 1448 | Language extensions known to GitHub |\n| [`gh_aliases.txt`](data/gh_aliases.txt) | 413 | Language aliases known to GitHub |\n| [`gh_filenames.txt`](data/gh_filenames.txt) | 399 | Language filenames known to GitHub |\n\u003c!-- gh:end --\u003e\n\n### Languages\n\nPopular GitHub languages comes from GitHub Linguist [popular.yml](https://github.com/github-linguist/linguist/blob/main/lib/linguist/popular.yml).\n\n\u003c!-- summary:start --\u003e\n| File | Count | Description |\n|------|-------|-------------|\n| [`languages.json`](data/languages.json) / [`csv`](data/languages.csv) | 804 | All languages |\n| [`languages_popular.json`](data/languages_popular.json) / [`csv`](data/languages_popular.csv) | 25 | Popular languages |\n\u003c!-- summary:end --\u003e\n\n### Languages by Type\n\n\u003c!-- types:start --\u003e\n| File | Count | Description |\n|------|-------|-------------|\n| [`languages_programming.json`](data/languages_programming.json) / [`csv`](data/languages_programming.csv) | 539 | Programming languages |\n| [`languages_data.json`](data/languages_data.json) / [`csv`](data/languages_data.csv) | 178 | Data languages |\n| [`languages_markup.json`](data/languages_markup.json) / [`csv`](data/languages_markup.csv) | 69 | Markup languages |\n| [`languages_prose.json`](data/languages_prose.json) / [`csv`](data/languages_prose.csv) | 18 | Prose languages |\n\u003c!-- types:end --\u003e\n\n## Fields\n\n- `type`: programming (e.g., Python), data (e.g., JSON), markup (e.g., HTML) or prose (e.g., Markdown).\n- `aliases`: List of additional aliases (optional).\n- `filenames`: List of associated filenames. May be omitted if the extensions field is present (and vice versa).\n- `extensions`: List of associated file extensions.\n\n## Fun Facts\n\n### Languages with the most extensions\n\nSee: [`stats_languages_by_extensions.csv`](data/stats_languages_by_extensions.csv)\n\n| Language | Type | Extensions |\n|----------|------|------------|\n| XML | data | 110 |\n| Roff | markup | 28 |\n| JavaScript | programming | 25 |\n| GLSL | programming | 23 |\n| Ruby | programming | 22 |\n| JSON | data | 21 |\n| C++ | programming | 20 |\n| Roff Manpage | markup | 20 |\n| JSON with Comments | data | 18 |\n| Python | programming | 17 |\n\n### Languages with the most filenames\n\nSee: [`stats_languages_by_filenames.csv`](data/stats_languages_by_filenames.csv)\n\n| Language | Type | Filenames |\n|----------|------|-----------|\n| Shell | programming | 41 |\n| Text | prose | 25 |\n| Ruby | programming | 23 |\n| Ignore List | data | 20 |\n| JSON | data | 18 |\n| Dotenv | data | 14 |\n| JSON with Comments | data | 14 |\n| Makefile | programming | 13 |\n| Emacs Lisp | programming | 11 |\n| XML | data | 10 |\n\n### Languages with the most aliases\n\nSee: [`stats_languages_by_aliases.csv`](data/stats_languages_by_aliases.csv)\n\n| Language | Type | Aliases |\n|----------|------|----------|\n| Roff | markup | 8 |\n| Checksums | data | 5 |\n| QuickBASIC | programming | 5 |\n| Ruby | programming | 5 |\n| Shell | programming | 5 |\n| Visual Basic 6.0 | programming | 5 |\n| Wolfram Language | programming | 5 |\n| Adblock Filter List | data | 4 |\n| Adobe Font Metrics | data | 4 |\n| Batchfile | programming | 4 |\n\n## JSON Examples\n\n### type: programming\n\n```json\n{\n  \"Python\": {\n    \"type\": \"programming\",\n    \"aliases\": [\"py\", \"py3\", \"python3\", \"rusthon\"],\n    \"extensions\": [\".py\", \".cgi\", \".fcgi\", \".gyp\", \".gypi\", \".lmi\", \".py3\", \".pyde\", \".pyi\", \".pyp\", \".pyt\", \".pyw\", \".rpy\", \".spec\", \".tac\", \".wsgi\", \".xpy\"],\n    \"filenames\": [\".gclient\", \"DEPS\", \"SConscript\", \"SConstruct\", \"wscript\"]\n  }\n}\n```\n\n### type: data\n\n```json\n{\n  \"JSON\": {\n    \"type\": \"data\",\n    \"aliases\": [\"geojson\", \"jsonl\", \"sarif\", \"topojson\"],\n    \"extensions\": [\".json\", \".4DForm\", \".4DProject\", \".avsc\", \".geojson\", \".gltf\", \".har\", \".ice\", \".JSON-tmLanguage\", \".json.example\", \".jsonl\", \".mcmeta\", \".sarif\", \".tfstate\", \".tfstate.backup\", \".topojson\", \".webapp\", \".webmanifest\", \".yy\", \".yyp\"],\n    \"filenames\": [\".all-contributorsrc\", \".arcconfig\", \".auto-changelog\", \".c8rc\", \".htmlhintrc\", \".imgbotconfig\", \".nycrc\", \".tern-config\", \".tern-project\", \".watchmanconfig\", \"MODULE.bazel.lock\", \"Package.resolved\", \"Pipfile.lock\", \"composer.lock\", \"deno.lock\", \"flake.lock\", \"mcmod.info\"]\n  }\n}\n```\n\n### type: markup\n\n```json\n{\n  \"HTML\": {\n    \"type\": \"markup\",\n    \"aliases\": [\"xhtml\"],\n    \"extensions\": [\".html\", \".hta\", \".htm\", \".html.hl\", \".inc\", \".xht\", \".xhtml\"]\n  }\n}\n```\n\n### type: prose\n\n```json\n{\n  \"Markdown\": {\n    \"type\": \"prose\",\n    \"aliases\": [\"md\", \"pandoc\"],\n    \"extensions\": [\".md\", \".livemd\", \".markdown\", \".mdown\", \".mdwn\", \".mkd\", \".mkdn\", \".mkdown\", \".ronn\", \".scd\", \".workbook\"],\n    \"filenames\": [\"contents.lr\"]\n  }\n}\n```\n\n## Generating the Dataset\n\nTo regenerate the files from the source `languages.yml`:\n\n```bash\npython parse_languages.py\n```\n\nThis parses [languages.yml](https://github.com/github-linguist/linguist/blob/main/lib/linguist/languages.yml) (from GitHub Linguist) and writes output files to the `data/` folder.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandrehora%2Fgh-langs","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fandrehora%2Fgh-langs","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandrehora%2Fgh-langs/lists"}