{"id":13754403,"url":"https://github.com/destwang/CTCResources","last_synced_at":"2025-05-09T22:32:20.126Z","repository":{"id":47122374,"uuid":"388299774","full_name":"destwang/CTCResources","owner":"destwang","description":null,"archived":false,"fork":false,"pushed_at":"2024-07-26T02:41:27.000Z","size":362,"stargazers_count":258,"open_issues_count":2,"forks_count":34,"subscribers_count":9,"default_branch":"main","last_synced_at":"2024-11-16T07:33:34.617Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/destwang.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-07-22T02:06:52.000Z","updated_at":"2024-11-02T10:42:20.000Z","dependencies_parsed_at":"2024-08-03T09:17:22.795Z","dependency_job_id":null,"html_url":"https://github.com/destwang/CTCResources","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/destwang%2FCTCResources","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/destwang%2FCTCResources/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/destwang%2FCTCResources/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/destwang%2FCTCResources/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/destwang","download_url":"https://codeload.github.com/destwang/CTCResources/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":253336051,"owners_count":21892780,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-08-03T09:01:58.417Z","updated_at":"2025-05-09T22:32:15.113Z","avatar_url":"https://github.com/destwang.png","language":null,"funding_links":[],"categories":["其他_NLP自然语言处理"],"sub_categories":["其他_文本生成、文本对话"],"readme":"[**English**](https://github.com/destwang/CTCResources) | [**中文**](https://github.com/destwang/CTCResources/blob/main/README_ZH.md)\n\n# CTCResources\n\nResources for Chinese text correction (CTC). The resource list is mainly mantained by Baoxin Wang and Honghong Zhao from HFL (哈工大讯飞联合实验室).\n\n## Contents\n\n- [CTCResources](#ctcresources)\n  - [Contents](#contents)\n  - [Defination](#defination)\n    - [Chinese Spelling Check (CSC)](#chinese-spelling-check-csc)\n    - [Grammatical Error Correction (GEC)](#grammatical-error-correction-gec)\n  - [Competition \u0026 Shared Task](#competition--shared-task)\n    - [CAIL 2022 Legal Instrument Correction（2022.09.01-2022.11.15）!](#cail-2022-legal-instrument-correction20220901-20221115)\n    - [CCL2022 Chinese Learner Text Correction（2022.06.05-2022.09.10）](#ccl2022-chinese-learner-text-correction20220605-20220910)\n  - [Experimental Results on Public Datasets](#experimental-results-on-public-datasets)\n    - [SIGHAN 2015](#sighan-2015)\n  - [Papers](#papers)\n    - [CSC Papers](#csc-papers)\n    - [GEC Papers](#gec-papers)\n  - [Datasets](#datasets)\n    - [CTC 2021 :](#ctc-2021-)\n    - [Others:](#others)\n  - [Systems \u0026 API](#systems--api)\n  - [Other Resources](#other-resources)\n    - [Related Articles](#related-articles)\n    - [Shared Task](#shared-task)\n  - [* CTC 2021](#-ctc-2021)\n\n\n## Defination\n### Chinese Spelling Check (CSC)\nChinese  spelling  check  (CSC)  is  a  task  to detect  and  correct  spelling  errors  in  Chinese text.\n\n### Grammatical Error Correction (GEC)\nGrammatical Error Correction (GEC) is the task of correcting different kinds of errors in text such as spelling, punctuation, grammatical, and word choice errors.\n\n\n## Competition \u0026 Shared Task\n### CAIL 2022 Legal Instrument Correction（2022.09.01-2022.11.15）![](https://img.shields.io/badge/hot-red.svg)\n[Legal Instrument Correction](http://cail.cipsc.org.cn/task2.html?raceID=2\u0026cail_tag=2022) aims at assisting judicial personnel to automatically detect and correct errors in legal documents through machine learning. This task covers four types of errors in legal documents: spelling errors, redundant errors, missing errors and word order errors.\n\n### CCL2022 Chinese Learner Text Correction（2022.06.05-2022.09.10）\n\n[CLTC](https://github.com/blcuicall/CCL2022-CLTC)（Chinese Learner Text Correction）aims to automatically detect and correct punctuation, spelling, grammatical, semantics and other errors in Chinese learners' texts, so as to obtain correct sentences.\n\n## Experimental Results on Public Datasets\n### SIGHAN 2015\n*We have not collected the character-level results because of some bugs in the current commonly used character-level evaluation script. It is also recommended that researchers **do not** use character-level evaluation script or fix the evaluation script before using it.*\n\n* Without Pretraining \n\n| Model | D-P | D-R | D-F | C-P | C-R | C-F |\n| - | - | - | - | - | - | - |\n| [FASPell](https://github.com/iqiyi/FASPell) | 67.6 | 60.0 | 63.5 | 66.6 | 59.1 | 62.6 |\n| BERT | 73.7 | 78.2 | 75.9 | 70.9 | 75.2 | 73.0 |\n| RoBERTa | 74.7 | 77.3 | 76.0 | 72.1 | 74.5 | 73.3 |\n| [SpellGCN](https://github.com/ACL2020SpellGCN/SpellGCN) | 74.8 | **80.7** | 77.7 | 72.1 | **77.7** | 74.8 (75.9) |\n| [DCN](https://github.com/destwang/DCN) | **76.6** | 79.8 | **78.2** | **74.2** | 77.3 | **75.7** |\n\n* With Pretraining\n\n| Model | D-P | D-R | D-F | C-P | C-R | C-F |\n| - | - | - | - | - | - | - |\n| [BERT_CRS + GAD](https://aclanthology.org/2021.findings-acl.122.pdf) | 75.6 | 80.4 | 77.9 | 73.2 | 77.8 | 75.4 |\n| [DCN-pretrain](https://github.com/destwang/DCN) | 77.1 | 80.9 | 79.0 | 74.5 | 78.2 | 76.3 |\n| [REALISE](https://github.com/DaDaMrX/ReaLiSe) | 77.3 | 81.3 | 79.3 | 75.9 | 79.9 | 77.8 |\n| [PLOME](https://github.com/liushulinle/PLOME) | 77.4 | 81.5 | 79.4 | 75.3 | 79.3 | 77.2 |\n| [Soft-Masked BERT](https://aclanthology.org/2020.acl-main.82.pdf) | 73.7 | 73.2 | 73.5 | 66.7 | 66.2 | 66.4 |\n| [Soft-Masked BERT_SSCL](https://aclanthology.org/2021.emnlp-main.281.pdf) | **86.3** | 72.5 | 78.8 | **85.2** | 66.0 | 74.4 |\n| [MLM-phonetics](https://aclanthology.org/2021.findings-acl.198.pdf) | 77.5 | **83.1** | 80.2 | 74.9 | 80.2 | 77.5 |\n| [MDCSpell](https://aclanthology.org/2022.findings-acl.98.pdf) | 80.8 | 80.6 | **80.7** | 78.4 | 78.2 | 78.3 |\n| [ECOPO(BERT)](https://aclanthology.org/2022.findings-acl.252.pdf) | 78.2 | 82.3 | 80.2 | 76.6 | 80.4 | 78.4 |\n| [ECOPO(REALISE)](https://aclanthology.org/2022.findings-acl.252.pdf) | 77.5 | 82.6 | 80.0 | 76.1 | **81.2** | **78.5** |\n\n## Papers\n### CSC Papers\n\u003e ### 2022\n**MDCSpell: A Multi-task Detector-Corrector Framework for Chinese Spelling Correction**. Findings of ACL 2022.  \nChenxi Zhu, Ziqiang Ying, Boyu Zhang, Feng Mao. [[pdf](https://aclanthology.org/2022.findings-acl.98.pdf)]\n\n**CRASpell: A Contextual Typo Robust Approach to Improve Chinese Spelling Correction**. Findings of ACL 2022.  \nShulin Liu, Shengkang Song, Tianchi Yue, Tao Yang, Huihui Cai, TingHao Yu, Shengli Sun. [[pdf](https://aclanthology.org/2022.findings-acl.237.pdf)]\n\n**The Past Mistake is the Future Wisdom: Error-driven Contrastive Probability Optimization for Chinese Spell Checking**.  Findings of ACL 2022.  \nYinghui Li, Qingyu Zhou, Yangning Li, Zhongli Li, Ruiyang Liu, Rongyi Sun, Zizhen Wang, Chao Li, Yunbo Cao, Hai-Tao Zheng. [[pdf](https://aclanthology.org/2022.findings-acl.252.pdf)]\n\n\u003e ### 2021\n**PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction**. ACL 2021.  \nShulin Liu, Tao Yang, Tianchi Yue, Feng Zhang and Di Wang. [[pdf](https://aclanthology.org/2021.acl-long.233.pdf)], [[code](https://github.com/liushulinle/PLOME)].\n\n**PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check**. ACL 2021.  \nLi Huang, Junjie Li, Weiwei Jiang, Zhiyu Zhang, Minchuan Chen, Shaojun Wang and Jing Xiao. [[pdf](https://aclanthology.org/2021.acl-long.464.pdf)].\n\n**Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models**. ACL 2021.  \nChong Li, Cenyuan Zhang, Xiaoqing Zheng and Xuanjing Huang. [[pdf](https://arxiv.org/pdf/2105.14813.pdf)], [[code](https://github.com/FDChongli/TwoWaysToImproveCSC)].\n\n**Dynamic Connected Networks for Chinese Spelling Check**. Findings of ACL 2021.  \nBaoxin Wang, Wanxiang Che, Dayong Wu, Shijin Wang, Guoping Hu and Ting Liu. [[pdf](https://aclanthology.org/2021.findings-acl.216.pdf)], [[code](https://github.com/destwang/DCN)].\n\n**Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking**. Findings of ACL 2021.  \nHeng-Da Xu, Zhongli Li, Qingyu Zhou, Chao Li, Zizhen Wang, Yunbo Cao, Heyan Huang and Xian-Ling Mao. [[pdf](https://arxiv.org/pdf/2105.12306.pdf)], [[code](https://github.com/DaDaMrX/ReaLiSe)].\n\n**Global Attention Decoder for Chinese Spelling Error Correction**. Findings of ACL 2021.  \nZhao Guo, Yuan Ni, Keqiang Wang, Wei Zhu and Guotong Xie. [[pdf](https://aclanthology.org/2021.findings-acl.122.pdf)].\n\n**Correcting Chinese Spelling Errors with Phonetic Pre-training**. Findings of ACL 2021.  \nRuiqing Zhang, Chao Pang, Chuanqiang Zhang, Shuohuan Wang, Zhongjun He, Yu Sun, Hua Wu and Haifeng Wang. [[pdf](https://aclanthology.org/2021.findings-acl.198.pdf)].\n\n**DCSpell: A Detector-Corrector Framework for Chinese Spelling Error Correction**. SIGIR 2021.  \nJing Li, Gaosheng Wu, Dafei Yin, Haozhao Wang, Yonggang Wang. [[pdf](https://dl.acm.org/doi/pdf/10.1145/3404835.3463050)].\n\n**SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check**. EMNLP 2021.  \nTuo Ji, Hang Yan, Xipeng Qiu. [[pdf](https://aclanthology.org/2021.emnlp-main.287.pdf)].\n\n**Self-Supervised Curriculum Learning for Spelling Error Correction**. EMNLP 2021.  \nZifa Gan, Hongfei Xu, Hongying Zan. [[pdf](https://aclanthology.org/2021.emnlp-main.281.pdf)].\n\n**An Alignment-Agnostic Model for Chinese Text Error Correction**. Findings of EMNLP 2021.  \nLiying Zheng, Yue Deng, Weishun Song, Liang Xu and Jing Xiao. [[pdf](https://aclanthology.org/2021.findings-emnlp.30.pdf)].\n\n**Domain-Shift Conditioning Using Adaptable Filtering Via Hierarchical Embeddings for Robust Chinese Spell Check**. TASLP 2021.  \nMinh Nguyen, Gia H. Ngo, and Nancy F. Chen. [[pdf](https://ieeexplore.ieee.org/abstract/document/9439969)].\n\n\u003e ### 2020\n**Chunk-based Chinese Spelling Check with Global Optimization**. Findings of EMNLP 2020.  \nZuyi Bao, Chen Li and Rui Wang. [[pdf](https://aclanthology.org/2020.findings-emnlp.184.pdf)].\n\n**SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check**. ACL 2020.  \nXingyi Cheng, Weidi Xu, Kunlong Chen, Shaohua Jiang, Feng Wang, Taifeng Wang, Wei Chu and Yuan Qi. [[pdf](https://aclanthology.org/2020.acl-main.81.pdf)], [[code](https://github.com/ACL2020SpellGCN/SpellGCN)].\n\n**Spelling Error Correction with Soft-Masked BERT**. ACL 2020.  \nShaohua Zhang, Haoran Huang, Jicong Liu and Hang Li. [[pdf](https://aclanthology.org/2020.acl-main.82.pdf)].\n\n\u003e ### 2019\n**FASPell: A Fast, Adaptable, Simple, Powerful Chinese Spell Checker Based On DAE-Decoder Paradigm**. EMNLP 2019 Workshop W-NUT.  \nYuzhong Hong, Xianguo Yu, Neng He, Nan Liu, Junhui Liu. [[pdf](https://aclanthology.org/D19-5522.pdf)], [[code](https://github.com/iqiyi/FASPell)].\n\n**Confusionset-guided Pointer Networks for Chinese Spelling Check**. ACL 2019.  \nDingmin Wang, Yi Tay, Li Zhong. [[pdf](https://aclanthology.org/P19-1578.pdf)], [[code](https://github.com/sunnyqiny/Confusionset-guided-Pointer-Networks-for-Chinese-Spelling-Check)].\n\n### GEC Papers\n\u003e ### 2022\n\n**Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error Correction**. ACL 2022.  \nMaksym Tarnavskyi, Artem Chernodub, Kostiantyn Omelianchuk. [[pdf](https://aclanthology.org/2022.acl-long.266/)]\n\n**Interpretability for Language Learners Using Example-Based Grammatical Error Correction**. ACL 2022.  \nMasahiro Kaneko, Sho Takase, Ayana Niwa, Naoaki Okazaki. [[pdf](https://aclanthology.org/2022.acl-long.496/)]\n\n**Adjusting the Precision-Recall Trade-Off with Align-and-Predict Decoding for Grammatical Error Correction**. ACL 2022 short.  \nXin Sun, Houfeng Wang. [[pdf](https://aclanthology.org/2022.acl-short.77/)]\n\n**“Is Whole Word Masking Always Better for Chinese BERT?”: Probing on Chinese Grammatical Error Correction**. Findings of ACL 2022.  \nYong Dai, Linyang Li, Cong Zhou, Zhangyin Feng, Enbo Zhao, Xipeng Qiu, Piji Li, Duyu Tang. [[pdf](https://aclanthology.org/2022.findings-acl.1/)]\n\n**Type-Driven Multi-Turn Corrections for Grammatical Error Correction**. Findings of ACL 2022.  \nShaopeng Lai, Qingyu Zhou, Jiali Zeng, Zhongli Li, Chao Li, Yunbo Cao, Jinsong Su. [[pdf](https://aclanthology.org/2022.findings-acl.254/)]\n\n**Reusing a Multi-lingual Setup to Bootstrap a Grammar Checker for a Very Low Resource Language without Data**. ComputEL 2022 Workshop.  \nInga Lill Sigga Mikkelsen, Linda Wiechetek, Flammie A Pirinen. [[pdf](https://aclanthology.org/2022.computel-1.19/)]\n\n\u003e ### 2021\n\n**Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding**. ACL 2021.  \nXin Sun, Tao Ge, Furu Wei, Houfeng Wang.[[pdf](https://aclanthology.org/2021.acl-long.462/)].[[code](https://github.com/AutoTemp/Shallow-Aggressive-Decoding)].\n\n**Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical ErrorCorrection**. ACL 2021.  \nPiji Li, Shuming Shi.[[pdf](https://aclanthology.org/2021.acl-long.385/)].[[code](https://github.com/lipiji/TtT)].\n\n**A Simple Recipe for Multilingual Grammatical Error Correction**. ACL 2021 short.  \nSascha Rothe, Jonathan Mallinson, Eric Malmi, Sebastian Krause, Aliaksei Severyn.[[pdf](https://aclanthology.org/2021.acl-short.89/)].\n\n**Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption Models**. EACL-BEA 2021.  \nFelix Stahlberg, Shankar Kumar.[[pdf](https://aclanthology.org/2021.bea-1.4/)].\n\n**Document-level grammatical error correction**. EACL-BEA 2021.  \nZheng Yuan, Christopher Bryant.[[pdf](https://aclanthology.org/2021.bea-1.8/)].\n\n**Data Strategies for Low-Resource Grammatical Error Correction**. EACL-BEA 2021.  \nSimon Flachs, Felix Stahlberg, Shankar Kumar.[[pdf](https://aclanthology.org/2021.bea-1.12/)].\n\n**Assessing Grammatical Correctness in Language Learning**. EACL-BEA 2021.  \nAnisia Katinskaia, Roman Yangarber.[[pdf](https://aclanthology.org/2021.bea-1.15/)].\n\n**Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction**. NAACL2021.  \nZhenghao Liu, Xiaoyuan Yi, Maosong Sun, Liner Yang, Tat-Seng Chua.[[pdf](https://aclanthology.org/2021.naacl-main.429/)].\n\n**Comparison of Grammatical Error Correction Using Back-Translation Models**. NAACL2021 workshop.  \nAomi Koyama, Kengo Hotate, Masahiro Kaneko, Mamoru Komachi.[[pdf](https://aclanthology.org/2021.naacl-srw.16/)].\n\n**LM-Critic: Language Models for Unsupervised Grammatical Error Correction**. EMNLP 2021.  \nMichihiro Yasunaga, Jure Leskovec and Percy Liang.[[pdf](https://arxiv.org/abs/2109.06822)].\n\n**Multi-Class Grammatical Error Detection for Correction: A Tale of Two Systems**. EMNLP 2021.  \nZheng Yuan, Shiva Taslimipoor, Christopher Davis and Christopher Bryant.\n\n\u003e ### 2020\n\n**On the Robustness of Language Encoders against Grammatical Errors**. ACL 2020.  \nFan Yin, Quanyu Long, Tao Meng, Kai-Wei Chang.[[pdf](https://aclanthology.org/2020.acl-main.310/)].\n\n**Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction**. ACL 2020.  \nMasahiro Kaneko, Masato Mita, Shun Kiyono, Jun Suzuki, Kentaro Inui.[[pdf](https://aclanthology.org/2020.acl-main.391/)].\n\n**Grammatical Error Correction Using Pseudo Learner Corpus Considering Learner’s Error Tendency**. ACL 2020 workshop.  \nYujin Takahashi, Satoru Katsumata, Mamoru Komachi.[[pdf](https://aclanthology.org/2020.acl-srw.5/)].\n\n**GECToR – Grammatical Error Correction: Tag, Not Rewrite**. ACL-BEA 2020.  \nKostiantyn Omelianchuk, Vitaliy Atrasevych, Artem Chernodub, Oleksandr Skurzhanskyi.[[pdf](https://aclanthology.org/2020.bea-1.16/)].\n\n**A Comparative Study of Synthetic Data Generation Methods for Grammatical Error Correction**. ACL-BEA 2020.  \nMax White, Alla Rozovskaya.[[pdf](https://aclanthology.org/2020.bea-1.21/)].\n\n**Improving Grammatical Error Correction Models with Purpose-Built Adversarial Examples**. EMNLP 2020.  \nLihao Wang, Xiaoqing Zheng.[[pdf](https://aclanthology.org/2020.emnlp-main.228/)].\n\n**Improving the Efficiency of Grammatical Error Correction with Erroneous Span Detection and Correction**. EMNLP 2020.  \nMengyun Chen, Tao Ge, Xingxing Zhang, Furu Wei, Ming Zhou.[[pdf](https://aclanthology.org/2020.emnlp-main.581/)].\n\n**Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses**. EMNLP 2020.  \nSimon Flachs, Ophélie Lacroix, Helen Yannakoudakis, Marek Rei, Anders Søgaard.[[pdf](https://aclanthology.org/2020.emnlp-main.680/)].\n\n**Adversarial Grammatical Error Correction**. findings of EMNLP 2020 .  \nVipul Raheja, Dimitris Alikaniotis.[[pdf](https://aclanthology.org/2020.findings-emnlp.275/)].\n\n**A Self-Refinement Strategy for Noise Reduction in Grammatical Error Correction**. findings of EMNLP 2020 .  \nMasato Mita, Shun Kiyono, Masahiro Kaneko, Jun Suzuki, Kentaro Inui.[[pdf](https://aclanthology.org/2020.findings-emnlp.26/)].\n\n**Improving Grammatical Error Correction with Machine Translation Pairs**. findings of EMNLP 2020 .  \nWangchunshu Zhou, Tao Ge, Chang Mu, Ke Xu, Furu Wei, Ming Zhou.[[pdf](https://aclanthology.org/2020.findings-emnlp.30/)].\n\n**Chinese Grammatical Correction Using BERT-based Pre-trained Model**. AACL 2020.  \nHongfei Wang, Michiki Kurosawa, Satoru Katsumata, Mamoru Komachi.[[pdf](https://aclanthology.org/2020.aacl-main.20/)].\n\n**Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model**. AACL 2020.  \nSatoru Katsumata, Mamoru Komachi.[[pdf](https://aclanthology.org/2020.aacl-main.83/)].\n\n**Generating Diverse Corrections with Local Beam Search for Grammatical Error Correction**. COLING 2020.  \nKengo Hotate, Masahiro Kaneko, Mamoru Komachi.[[pdf](https://aclanthology.org/2020.coling-main.193/)].\n\n**Heterogeneous Recycle Generation for Chinese Grammatical Error Correction**. COLING 2020.  \nCharles Hinson, Hen-Hsen Huang, Hsin-Hsi Chen.[[pdf](https://aclanthology.org/2020.coling-main.199/)].\n\n**Improving Grammatical Error Correction with Data Augmentation by Editing Latent Representation**. COLING 2020.  \nZhaohong Wan, Xiaojun Wan, Wenguang Wang.[[pdf](https://aclanthology.org/2020.coling-main.200/)].\n\n**Cross-lingual Transfer Learning for Grammatical Error Correction**. COLING 2020.  \nIkumi Yamashita, Satoru Katsumata, Masahiro Kaneko, Aizhan Imankulova,Mamoru Komachi.[[pdf](https://aclanthology.org/2020.coling-main.415/)].\n\n\u003e ### 2019\n\n**Cross-Sentence Grammatical Error Correction**. ACL 2019.  \nShamil Chollampatt, Weiqi Wang, Hwee Tou Ng.[[pdf](https://aclanthology.org/P19-1042/)].\n\n**Automatic Grammatical Error Correction for Sequence-to-sequence Text Generation: An Empirical Study**. ACL 2019.  \nTao Ge, Xingxing Zhang, Furu Wei, Ming Zhou.[[pdf](https://aclanthology.org/P19-1609/)].\n\n**Controlling Grammatical Error Correction Using Word Edit Rate**. ACL 2019.  \nKengo Hotate, Masahiro Kaneko, Satoru Katsumata, Mamoru Komachi.[[pdf](https://aclanthology.org/P19-2020/)].\n\n**Context is Key: Grammatical Error Detection with Contextual Word Representations**. ACL-BEA 2019.  \nSamuel Bell, Helen Yannakoudakis, Marek Rei.[[pdf](https://aclanthology.org/W19-4410/)].\n\n**The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction**. ACL-BEA 2019.  \nDimitris Alikaniotis, Vipul Raheja.[[pdf](https://aclanthology.org/W19-4412/)].\n\n**(Almost) Unsupervised Grammatical Error Correction using Synthetic Comparable Corpus**. ACL-BEA 2019.  \nSatoru Katsumata, Mamoru Komachi.[[pdf](https://aclanthology.org/W19-4413/)].\n\n**Learning to combine Grammatical Error Corrections**. ACL-BEA 2019.  \nYoav Kantor, Yoav Katz, Leshem Choshen, Edo Cohen-Karlik, Naftali Liberman, Assaf Toledo, Amir Menczel, Noam Slonim.[[pdf](https://aclanthology.org/W19-4414/)].\n\n**Erroneous data generation for Grammatical Error Correction**. ACL-BEA 2019.  \nShuyao Xu, Jiehao Zhang, Jin Chen, Long Qin.[[pdf](https://aclanthology.org/W19-4415/)].\n\n**The CUED’s Grammatical Error Correction Systems for BEA-2019**. ACL-BEA 2019.  \nFelix Stahlberg, Bill Byrne.[[pdf](https://aclanthology.org/W19-4417/)].\n\n**CUNI System for the Building Educational Applications 2019 Shared Task: Grammatical Error Correction**. ACL-BEA 2019.  \nJakub Náplava, Milan Straka.[[pdf](https://aclanthology.org/W19-4419/)].\n\n**Noisy Channel for Low Resource Grammatical Error Correction**. ACL-BEA 2019.  \nSimon Flachs, Ophélie Lacroix, Anders Søgaard.[[pdf](https://aclanthology.org/W19-4420/)].\n\n**TMU Transformer System Using BERT for Re-ranking at BEA 2019 Grammatical Error Correction on Restricted Track**. ACL-BEA 2019.  \nMasahiro Kaneko, Kengo Hotate, Satoru Katsumata, Mamoru Komachi.[[pdf](https://aclanthology.org/W19-4422/)].\n\n**A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning**. ACL-BEA 2019.  \nYo Joong Choe, Jiyeon Ham, Kyubyong Park, Yeoil Yoon.[[pdf](https://aclanthology.org/W19-4423/)].\n\n**Neural and FST-based approaches to grammatical error correction**. ACL-BEA 2019.  \nZheng Yuan, Felix Stahlberg, Marek Rei, Bill Byrne, Helen Yannakoudakis.[[pdf](https://aclanthology.org/W19-4424/)].\n\n**Improving Precision of Grammatical Error Correction with a Cheat Sheet**. ACL-BEA 2019.  \nMengyang Qiu, Xuejiao Chen, Maggie Liu, Krishna Parvathala, Apurva Patil, Jungyeul Park.[[pdf](https://aclanthology.org/W19-4425/)].\n\n**Multi-headed Architecture Based on BERT for Grammatical Errors Correction**. ACL-BEA 2019.  \nBohdan Didenko, Julia Shaptala.[[pdf](https://aclanthology.org/W19-4426/)].\n\n**Neural Grammatical Error Correction Systems with Unsupervised Pre-training on Synthetic Data**. ACL-BEA 2019.  \nRoman Grundkiewicz, Marcin Junczys-Dowmunt, Kenneth Heafield.[[pdf](https://aclanthology.org/W19-4427/)].\n\n**The Unbearable Weight of Generating Artificial Errors for Grammatical Error Correction**. ACL-BEA 2019.  \nPhu Mon Htut, Joel Tetreault.[[pdf](https://aclanthology.org/W19-4449/)].\n\n**An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction**. EMNLP 2019.  \nShun Kiyono, Jun Suzuki, Masato Mita, Tomoya Mizumoto, Kentaro Inui.[[pdf](https://arxiv.org/abs/1909.00502)].\n\n**Encode, Tag, Realize: High-Precision Text Editing**. EMNLP 2019.  \nEric Malmi, Sebastian Krause, Sascha Rothe, Daniil Mirylenka, Aliaksei Severyn.[[pdf](https://aclanthology.org/D19-1510/)].\n\n**Personalizing Grammatical Error Correction: Adaptation to Proficiency Level and L1**. EMNLP 2019.  \nMaria Nadejde, Joel Tetreault.[[pdf](https://aclanthology.org/D19-5504/)].\n\n**Grammatical Error Correction in Low-Resource Scenarios**. EMNLP 2019.  \nJakub Náplava, Milan Straka.[[pdf](https://aclanthology.org/D19-5545/)].\n\n**Minimally-Augmented Grammatical Error Correction**. EMNLP 2019.  \nRoman Grundkiewicz, Marcin Junczys-Dowmunt.[[pdf](https://aclanthology.org/D19-5546/)].\n\n**Parallel Iterative Edit Models for Local Sequence Transduction**. EMNLP 2019.  \nAbhijeet Awasthi, Sunita Sarawagi, Rasna Goyal, Sabyasachi Ghosh, Vihari Piratla.[[pdf](https://aclanthology.org/D19-1435/)].\n\n**Learning to Copy for Automatic Post-Editing**. EMNLP 2019.  \nXuancheng Huang, Yang Liu, Huanbo Luan, Jingfang Xu, Maosong Sun.[[pdf](https://aclanthology.org/D19-1634/)].\n\n**Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data**. NAACL 2019.  \nWei Zhao, Liang Wang, Kewei Shen, Ruoyu Jia, Jingming Liu.[[pdf](https://aclanthology.org/N19-1014/)].\n\n**Cross-Corpora Evaluation and Analysis of Grammatical Error Correction Models — Is Single-Corpus Evaluation Enough?**. NAACL 2019.  \nMasato Mita, Tomoya Mizumoto, Masahiro Kaneko, Ryo Nagata, Kentaro Inui.[[pdf](https://aclanthology.org/N19-1132/)].\n\n**Corpora Generation for Grammatical Error Correction**. NAACL 2019.  \nJared Lichtarge, Chris Alberti, Shankar Kumar, Noam Shazeer, Niki Parmar, Simon Tong.[[pdf](https://aclanthology.org/N19-1333/)].\n\n**Neural Grammatical Error Correction with Finite State Transducers**. NAACL 2019.  \nFelix Stahlberg, Christopher Bryant, Bill Byrne.[[pdf](https://aclanthology.org/N19-1406/)].\n\n\u003e ### 2018\n\n**Inherent Biases in Reference-based Evaluation for Grammatical Error Correction**. ACL 2018.  \nLeshem Choshen, Omri Abend.[[pdf](https://aclanthology.org/P18-1059/)].\n\n**Fluency Boost Learning and Inference for Neural Grammatical Error Correction**. ACL 2018.  \nTao Ge, Furu Wei, Ming Zhou.[[pdf](https://aclanthology.org/P18-1097/)].\n\n**Automatic Metric Validation for Grammatical Error Correction**. ACL 2018.  \nLeshem Choshen, Omri Abend.[[pdf](https://aclanthology.org/P18-1127/)].\n\n**Overview of NLPTEA-2018 Share Task Chinese Grammatical Error Diagnosis**. ACL 2018 NLPTEA.  \nGaoqi Rao, Qi Gong, Baolin Zhang, Endong Xun.[[pdf](https://aclanthology.org/W18-3706/)].\n\n**Approaching Neural Grammatical Error Correction as a Low-Resource Machine Translation Task**. NAACL 2018.  \nMarcin Junczys-Dowmunt, Roman Grundkiewicz, Shubha Guha, Kenneth Heafield.[[pdf](https://aclanthology.org/N18-1055/)].\n\n**Noising and Denoising Natural Language: Diverse Backtranslation for Grammar Correction**. NAACL 2018.  \nZiang Xie, Guillaume Genthial, Stanley Xie, Andrew Ng, Dan Jurafsky.[[pdf](https://aclanthology.org/N18-1057/)].\n\n**Reference-less Measure of Faithfulness for Grammatical Error Correction**. NAACL 2018 short.  \nLeshem Choshen, Omri Abend.[[pdf](https://aclanthology.org/N18-2020/)].\n\n**Near Human-Level Performance in Grammatical Error Correction with Hybrid Machine Translation**. NAACL 2018 short.  \nRoman Grundkiewicz, Marcin Junczys-Dowmunt.[[pdf](https://aclanthology.org/N18-2046/)].\n\n**Language Model Based Grammatical Error Correction without Annotated Training Data**. NAACL 2018 BEA.  \nChristopher Bryant, Ted Briscoe.[[pdf](https://aclanthology.org/W18-0529/)].\n\n**A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction**. AAAI 2018.  \nShamil Chollampatt, Hwee Tou Ng.[[pdf](https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/viewFile/17308/16137)].\n\n**Neural Quality Estimation of Grammatical Error Correction**. EMNLP 2018.  \nShamil Chollampatt, Hwee Tou Ng.[[pdf](https://aclanthology.org/D18-1274/)].\n\n**Wronging a Right: Generating Better Errors to Improve Grammatical Error Detection**. EMNLP 2018.  \nSudhanshu Kasewa, Pontus Stenetorp, Sebastian Riedel.[[pdf](https://aclanthology.org/D18-1541/)].\n\n**Using Wikipedia Edits in Low Resource Grammatical Error Correction**. EMNLP 2018.  \nAdriane Boyd.[[pdf](https://aclanthology.org/W18-6111/)].\n\n**Cool English: a Grammatical Error Correction System Based on Large Learner Corpora**. COLING 2018.  \nYu-Chun Lo, Jhih-Jie Chen, Chingyu Yang, Jason Chang.[[pdf](https://aclanthology.org/C18-2018/)].\n\n**A Reassessment of Reference-Based Grammatical Error Correction Metrics**. COLING 2018.  \nShamil Chollampatt, Hwee Tou Ng.[[pdf](https://aclanthology.org/C18-1231/)].\n\n\u003e ### earlier\n\n**A Nested Attention Neural Hybrid Model for Grammatical Error Correction**. ACL 2017.  \nJianshu Ji, Qinlong Wang, Kristina Toutanova, Yongen Gong, Steven Truong, Jianfeng Gao.[[pdf](https://aclanthology.org/P17-1070/)].\n\n**Automatic Annotation and Evaluation of Error Types for Grammatical Error Correction**. ACL 2017.  \nChristopher Bryant, Mariano Felice, Ted Briscoe.[[pdf](https://aclanthology.org/P17-1074/)].\n\n**Neural Sequence-Labelling Models for Grammatical Error Correction**. EMNLP 2017.  \nHelen Yannakoudakis, Marek Rei, Øistein E. Andersen, Zheng Yuan.[[pdf](https://aclanthology.org/D17-1297/)].\n\n**JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction**. EACL 2017.  \nCourtney Napoles, Keisuke Sakaguchi, Joel Tetreault.[[pdf](https://aclanthology.org/E17-2037/)].\n\n**Grammatical Error Detection Using Error- and Grammaticality-Specific Word Embeddings**. IJCNLP 2017.  \nMasahiro Kaneko, Yuya Sakaizawa, Mamoru Komachi.[[pdf](https://aclanthology.org/I17-1005/)].\n\n**Reference-based Metrics can be Replaced with Reference-less Metrics in Evaluating Grammatical Error Correction Systems**. IJCNLP 2017.  \nHiroki Asano, Tomoya Mizumoto, Kentaro Inui.[[pdf](https://aclanthology.org/I17-2058/)].\n\n**Grammatical Error Correction with Neural Reinforcement Learning**. IJCNLP 2017.  \nKeisuke Sakaguchi, Matt Post, Benjamin Van Durme.[[pdf](https://aclanthology.org/I17-2062/)].\n\n**Grammatical Error Correction: Machine Translation and Classifiers**. ACL 2016.  \nAlla Rozovskaya, Dan Roth.[[pdf](https://aclanthology.org/P16-1208/)].\n\n**Compositional Sequence Labeling Models for Error Detection in Learner Writing**. ACL 2016.  \nMarek Rei, Helen Yannakoudakis.[[pdf](https://aclanthology.org/P16-1112/)].\n\n**Grammatical error correction using neural machine translation**. NAACL 2016 short.  \nZheng Yuan, Ted Briscoe.[[pdf](https://aclanthology.org/N16-1042/)].\n\n**Discriminative Reranking for Grammatical Error Correction with Statistical Machine Translation**. NAACL 2016 short.  \nTomoya Mizumoto, Yuji Matsumoto.[[pdf](https://aclanthology.org/N16-1133/)].\n\n**Phrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction**. EMNLP 2016.  \nMarcin Junczys-Dowmunt, Roman Grundkiewicz.[[pdf](https://aclanthology.org/D16-1161/)].\n\n**Adapting Grammatical Error Correction Based on the Native Language of Writers with Neural Network Joint Models**. EMNLP 2016.  \nShamil Chollampatt, Duc Tam Hoang, Hwee Tou Ng.[[pdf](https://aclanthology.org/D16-1195/)].\n\n**There’s No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction**. EMNLP 2016.  \nCourtney Napoles, Keisuke Sakaguchi, Joel Tetreault.[[pdf](https://aclanthology.org/D16-1228/)].\n\n**Chinese Preposition Selection for Grammatical Error Diagnosis**. COLING 2016.  \nHen-Hsen Huang, Yen-Chi Shao, Hsin-Hsi Chen.[[pdf](https://aclanthology.org/C16-1085/)].\n\n**Neural Network Translation Models for Grammatical Error Correction**. IJCAI 2016.  \nS Chollampatt，K Taghipour，HT Ng.[[pdf](https://arxiv.org/pdf/1606.00189.pdf)].\n\n**Exploiting N-Best Hypotheses to Improve an SMT Approach to Grammatical Error Correction**. IJCAI 2016.  \nDT Hoang，S Chollampatt，HT Ng.[[pdf](http://www.researchgate.net/publication/303749890_Exploiting_N-Best_Hypotheses_to_Improve_an_SMT_Approach_to_Grammatical_Error_Correction)].\n\n**How Far are We from Fully Automatic High Quality Grammatical Error Correction?**. ACL 2015.  \nChristopher Bryant, Hwee Tou Ng.[[pdf](https://aclanthology.org/P15-1068/)].\n\n**Ground Truth for Grammatical Error Correction Metrics**. ACL 2015 short.  \nCourtney Napoles, Keisuke Sakaguchi, Matt Post, Joel Tetreault.[[pdf](https://aclanthology.org/P15-2097/)].\n\n**Towards a standard evaluation method for grammatical error detection and correction**. NAACL 2015.  \nMariano Felice, Ted Briscoe.[[pdf](https://aclanthology.org/N15-1060/)].\n\n**Human Evaluation of Grammatical Error Correction Systems**. EMNLP 2015.  \nRoman Grundkiewicz, Marcin Junczys-Dowmunt, Edward Gillian.[[pdf](https://aclanthology.org/D15-1052/)].\n\n**Ground Truth for Grammatical Error Correction Metrics**. IJCNLP 2015.  \nCourtney Napoles, Keisuke Sakaguchi, Matt Post, Joel Tetreault.[[pdf](https://aclanthology.org/P15-2097/)].\n\n**Go Climb a Dependency Tree and Correct the Grammatical Errors**. EMNLP 2014.  \nLongkai Zhang, Houfeng Wang.[[pdf](https://aclanthology.org/D14-1033/)].\n\n**System Combination for Grammatical Error Correction**. EMNLP 2014.  \nRaymond Hendy Susanto, Peter Phandi, Hwee Tou Ng.[[pdf](https://aclanthology.org/D14-1102/)].\n\n**Data Driven Grammatical Error Detection in Transcripts of Children’s Speech**. EMNLP 2014.  \nEric Morley, Anna Eva Hallin, Brian Roark.[[pdf](https://aclanthology.org/D14-1106/)].\n\n**Generating artificial errors for grammatical error correction**. EACL 2014.  \nMariano Felice, Zheng Yuan.[[pdf](https://aclanthology.org/E14-3013/)].\n\n**Correcting Grammatical Verb Errors**. EACL 2014.  \nAlla Rozovskaya, Dan Roth, Vivek Srikumar.[[pdf](https://aclanthology.org/E14-1038/)].\n\n**Detecting Learner Errors in the Choice of Content Words Using Compositional Distributional Semantics**. COLING 2014.  \nEkaterina Kochmar, Ted Briscoe.[[pdf](https://aclanthology.org/C14-1164/)].\n\n**A Sentence Judgment System for Grammatical Error Detection**. COLING 2014.  \nLung-Hao Lee, Liang-Chih Yu, Kuei-Ching Lee, Yuen-Hsien Tseng, Li-Ping Chang, Hsin-Hsi Chen.[[pdf](https://aclanthology.org/C14-2015/)].\n\n**Automated Grammatical Error Correction for Language Learners**. COLING 2014.  \nJoel Tetreault, Claudia Leacock.[[pdf](https://aclanthology.org/C14-3004/)].\n\n**Grammatical Error Correction Using Integer Linear Programming**. ACL 2013.  \nYuanbin Wu, Hwee Tou Ng.[[pdf](https://aclanthology.org/P13-1143/)].\n\n**Joint Learning and Inference for Grammatical Error Correction**. EMNLP 2013.  \nAlla Rozovskaya, Dan Roth.[[pdf](https://aclanthology.org/D13-1074/)].\n\n**Automated Grammar Correction Using Hierarchical Phrase-Based Statistical Machine Translation**. IJCNLP 2013.  \nBibek Behera, Pushpak Bhattacharyya.[[pdf](https://aclanthology.org/I13-1122/)].\n\n**Grammatical Error Correction Using Feature Selection and Confidence Tuning**. IJCNLP 2013.  \nYang Xiang, Yaoyun Zhang, Xiaolong Wang, Chongqiang Wei, Wen Zheng, Xiaoqiang Zhou, Yuxiu Hu, Yang Qin.[[pdf](https://aclanthology.org/I13-1148/)].\n\n**A Meta Learning Approach to Grammatical Error Correction**. ACL 2012.  \nHongsuck Seo, Jonghoon Lee, Seokhwan Kim, Kyusong Lee, Sechun Kang, Gary Geunbae Lee.[[pdf](https://aclanthology.org/P12-2064/)].\n\n**Grammar Error Correction Using Pseudo-Error Sentences and Domain Adaptation**. ACL 2012.  \nKenji Imamura, Kuniko Saito, Kugatsu Sadamitsu, Hitoshi Nishikawa.[[pdf]()].\n\n**Better Evaluation for Grammatical Error Correction**. NAACL 2012 short.  \nDaniel Dahlmeier, Hwee Tou Ng.[[pdf](https://aclanthology.org/N12-1067/)].\n\n**A Beam-Search Decoder for Grammatical Error Correction**. EMNLP 2012.  \nDaniel Dahlmeier, Hwee Tou Ng.[[pdf](https://aclanthology.org/D12-1052/)].\n\n**Problems in Evaluating Grammatical Error Detection Systems**. COLING 2012.  \nMartin Chodorow, Markus Dickinson, Ross Israel, Joel Tetreault.[[pdf](https://aclanthology.org/C12-1038/)].\n\n**The Effect of Learner Corpus Size in Grammatical Error Correction of ESL Writings**. COLING 2012.  \nTomoya Mizumoto, Yuta Hayashibe, Mamoru Komachi, Masaaki Nagata, Yuji Matsumoto.[[pdf](https://aclanthology.org/C12-2084/)].\n\n**They Can Help: Using Crowdsourcing to Improve the Evaluation of Grammatical Error Detection Systems**. ACL 2011.  \nNitin Madnani, Martin Chodorow, Joel Tetreault, Alla Rozovskaya.[[pdf](https://aclanthology.org/P11-2089/)].\n\n**Grammatical Error Correction with Alternating Structure Optimization**. ACL 2011.  \nDaniel Dahlmeier, Hwee Tou Ng.[[pdf](https://aclanthology.org/P11-1092/)].\n\n**Automated Whole Sentence Grammar Correction Using a Noisy Channel Model**. ACL 2011.  \nY. Albert Park, Roger Levy.[[pdf](https://aclanthology.org/P11-1094/)].\n\n**Grammatical Error Detection for Corrective Feedback Provision in Oral Conversations**. AAAI 2011.  \nSungjin Lee, Hyungjong Noh, Kyusong Lee, Gary Geunbae Lee.[[pdf](https://www.aaai.org/ocs/index.php/AAAI/AAAI11/paper/view/3501/3954)].\n\n**Evaluating performance of grammatical error detection to maximize learning effect**. COLING 2010.  \nRyo Nagata, Kazuhide Nakatani.[[pdf](https://aclanthology.org/C10-2103/)].\n\n\n\n## Datasets\n\n### [CCTC: A Cross-Sentence Chinese Text Correction Dataset for Native Speakers](https://aclanthology.org/2022.coling-1.294.pdf)\n| dataset | download |\n|-|-|\n|CCTC |[download](https://pan.baidu.com/s/1FzWzlshVWBVqZvLt07zqiQ) (password: 45ok)|\n\n\n### [CTC 2021](https://destwang.github.io/CTC2021-explorer/) \n| dataset | download |\n|-|-|\n|training data|[download](https://pan.baidu.com/s/14lSJquTV4eZBGYlnr5Oq3Q) (password: 1yie)|\n|validation data|[download](https://pan.baidu.com/s/1dnpCxGK0m8v-R-wMpYH2kQ) (password: asrb)|\n\n### Others:\n| dataset | task | # sents | source | language |\n|-|-|-|-|-|\n| SIGHAN 2013 | CSC | 350 \u0026 974 | SIGHAN | Zh |\n| SIGHAN 2014 | CSC | 6,526 \u0026 526 | SIGHAN | Zh |\n| SIGHAN 2015 | CSC | 3,174 \u0026 550 | SIGHAN | Zh |\n|[OCR dataset](https://github.com/iqiyi/FASPell)|CSC|4575|[FASPell(iqiyi)](https://www.aclweb.org/anthology/D19-5522.pdf)|Zh|\n| [HybridSet](https://github.com/wdimmy/Automatic-Corpus-Generation) | CSC | 270K | - | Zh |\n| NLPCC 2018 GEC | GEC | - | NLPCC | Zh |\n| CGED | GED | - | HSK | Zh |\n| CoNLL 2013 | GEC | 1,381 | CONLL | En |\n| CoNLL 2014 | GEC | 1,312 | CONLL | En |\n| JFLEG | GEC | 747 | [JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction](https://aclanthology.org/E17-2037.pdf) | En |\n| NUCLE | GEC | 57k | [Building a Large Annotated Corpus of Learner English: The NUS Corpus of Learner English](https://aclanthology.org/W13-1703.pdf) | En |\n| Lang-8 | GEC | 1M+ | Lang-8 | En |\n| Write\u0026Improve+LOCNESS  | GEC | 63,683 \u0026 7,632 | - | En |\n|MMC+PsyTAR (medica)||512 \u0026 79| - |En|\n|brikbeck+holbrook-tagged+holbrook-missp+aspell+wikipedia|(Misspelling word)| 36133/6136 \u0026 1791/1200\u0026 531/450\u0026 2455/1922|[BBK](https://www.dcs.bbk.ac.uk/~ROGER/corpora.htm)|En|\n|[TOEFL-Spell](https://github.com/EducationalTestingService/toefl-spell)|-|-|[A Benchmark Corpus of English Misspellings and a Minimally-supervised Model for Spelling Correction](https://www.aclweb.org/anthology/W19-4407.pdf)|En|\n|[NUC-GEC](https://www.comp.nus.edu.sg/~nlp/corpora.html)|GEC|500 essays|[How Far are We from Fully Automatic High Quality Grammatical Error Correction?](https://www.aclweb.org/anthology/P15-1068.pdf)|En|\n| [BEA2019](https://www.cl.cam.ac.uk/research/nl/bea2019st/) | GEC | 34,308 | BEA | En |\n| PIE-synthetic | GEC | 9,000,000 | [Parallel iterative edit models for local sequence transduction](https://aclanthology.org/D19-1435/) | En |\n| [clang8](https://github.com/google-research-datasets/clang8) | GEC | 2,372,119 \u0026 114,405 \u0026 44,830 | - | En,GE,RU |\n| CTC2021 | CSC | 217,634 | - | Zh |\n\n\n## Systems \u0026 API\nFeiying System: http://check.hfl-rc.com/  \nFeiying API: https://www.xfyun.cn/services/textCorrection\n\n\n## Other Resources\n### Related Articles\n* [语法纠错的研究现状](https://mp.weixin.qq.com/s/0_qp1WsrEsjnj8ST4zQyTQ)\n* [文本语法纠错不完全调研](https://mp.weixin.qq.com/s/Dj8KIe6LbVGonV-Kk9mO2Q)\n\n### Shared Task\n* [CTC 2021](https://github.com/destwang/CTC2021)\n-----\n*The above resources are only used for academic research. If there is any infringement of copyright, please contact us to delete it.*\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdestwang%2FCTCResources","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdestwang%2FCTCResources","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdestwang%2FCTCResources/lists"}