{"id":19068718,"url":"https://github.com/machine-learning-tokyo/nlp","last_synced_at":"2026-01-31T10:02:07.909Z","repository":{"id":96775808,"uuid":"291136595","full_name":"Machine-Learning-Tokyo/NLP","owner":"Machine-Learning-Tokyo","description":null,"archived":false,"fork":false,"pushed_at":"2020-08-29T07:47:57.000Z","size":732,"stargazers_count":13,"open_issues_count":0,"forks_count":2,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-06-08T07:04:57.870Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Machine-Learning-Tokyo.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-08-28T20:08:10.000Z","updated_at":"2022-08-11T13:41:26.000Z","dependencies_parsed_at":null,"dependency_job_id":"84995481-742e-4832-b7d2-bd54111121fa","html_url":"https://github.com/Machine-Learning-Tokyo/NLP","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Machine-Learning-Tokyo/NLP","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FNLP","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FNLP/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FNLP/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FNLP/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Machine-Learning-Tokyo","download_url":"https://codeload.github.com/Machine-Learning-Tokyo/NLP/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Machine-Learning-Tokyo%2FNLP/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28937808,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-31T08:53:31.997Z","status":"ssl_error","status_checked_at":"2026-01-31T08:51:38.521Z","response_time":128,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-09T01:11:32.185Z","updated_at":"2026-01-31T10:02:07.878Z","avatar_url":"https://github.com/Machine-Learning-Tokyo.png","language":null,"readme":"# NLP resources\n\nLearn about Natural Language Processing, find resources, models, repos, datasets.\n\n## Content\n\n- [A Brief Survey](#a-brief-survey)\n- [NLP Progress](#nlp-progress)\n- [Natural Language Processing Specialization](#natural-language-processing-specialization)\n\n\n## A Brief Survey\n\n### NLP for Supervised Learning - A Brief Survey by Eugene Yan\n\n- Sequential models: RNN (1985), LSTM (1997), GRU (2014)\n- Word embeddings: Word2vec (2013), GloVe (2014), FastText (2016)\n- Word embeddings with context: ELMo (2018)\n- Attention: Transformer (2017)\n- Pre-training: ULMFiT (2017), GPT (2017)\n- Combining the above: BERT (2018)\n- Improving BERT: DistilBERT, ALBERT, RoBERTa, XLNet (2019); Big Bird, Multilingual embeddings (2020)\n- Everything is text-to-text: T5 (2019)\n\nSource: [NLP for Supervised Learning - A Brief Survey](https://eugeneyan.com/writing/nlp-supervised-learning-survey/)\n\n[\u003cp align=\"center\"\u003e\u003cimg src=\"https://github.com/Machine-Learning-Tokyo/NLP/blob/master/images/attention.png\" width=\"800\"\u003e\u003c/p\u003e](https://eugeneyan.com/writing/nlp-supervised-learning-survey/)\n\n\n## NLP Progress\n\n### Tracking Progress in Natural Language Processing by Sebastian Ruder\n\n\"This document aims to track the progress in Natural Language Processing (NLP) and give an overview of the state-of-the-art (SOTA) across the most common NLP tasks and their corresponding datasets.\n\nIt aims to cover both traditional and core NLP tasks such as dependency parsing and part-of-speech tagging as well as more recent ones such as reading comprehension and natural language inference. The main objective is to provide the reader with a quick overview of benchmark datasets and the state-of-the-art for their task of interest, which serves as a stepping stone for further research. To this end, if there is a place where results for a task are already published and regularly maintained, such as a public leaderboard, the reader will be pointed there.\"\n\nSource: [NLP Progress](https://nlpprogress.com/)\n\n[\u003cp align=\"center\"\u003e\u003cimg src=\"https://github.com/Machine-Learning-Tokyo/NLP/blob/master/images/nlpprogress.png\" width=\"800\"\u003e\u003c/p\u003e](https://nlpprogress.com/)\n\n\n## Natural Language Processing Specialization\n\n### Coursera video lectures\n\n\"Natural Language Processing (NLP) uses algorithms to understand and manipulate human language. This technology is one of the most broadly applied areas of machine learning. As AI continues to expand, so will the demand for professionals skilled at building models that analyze speech and language, uncover contextual patterns, and produce insights from text and audio.\"\n\nSource: [Coursera](https://www.coursera.org/specializations/natural-language-processing)\n\n[\u003cp align=\"center\"\u003e\u003cimg src=\"https://github.com/Machine-Learning-Tokyo/NLP/blob/master/images/coursera_nlp.png\" width=\"800\"\u003e\u003c/p\u003e](https://nlpprogress.com/)\n\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmachine-learning-tokyo%2Fnlp","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmachine-learning-tokyo%2Fnlp","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmachine-learning-tokyo%2Fnlp/lists"}