{"id":19839859,"url":"https://github.com/qdata/awesome-robustness-testing-for-nlp","last_synced_at":"2026-03-03T09:31:47.808Z","repository":{"id":69609542,"uuid":"290303075","full_name":"QData/Awesome-Robustness-Testing-for-NLP","owner":"QData","description":"A curated list of papers on testing NLP.","archived":false,"fork":false,"pushed_at":"2020-10-19T11:03:26.000Z","size":82,"stargazers_count":5,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-10-12T19:03:09.325Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/QData.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-08-25T19:19:06.000Z","updated_at":"2023-07-06T20:59:18.000Z","dependencies_parsed_at":"2023-03-11T06:34:48.824Z","dependency_job_id":null,"html_url":"https://github.com/QData/Awesome-Robustness-Testing-for-NLP","commit_stats":null,"previous_names":[],"tags_count":0,"template":true,"template_full_name":null,"purl":"pkg:github/QData/Awesome-Robustness-Testing-for-NLP","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/QData%2FAwesome-Robustness-Testing-for-NLP","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/QData%2FAwesome-Robustness-Testing-for-NLP/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/QData%2FAwesome-Robustness-Testing-for-NLP/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/QData%2FAwesome-Robustness-Testing-for-NLP/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/QData","download_url":"https://codeload.github.com/QData/Awesome-Robustness-Testing-for-NLP/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/QData%2FAwesome-Robustness-Testing-for-NLP/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30039884,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-03T06:58:30.252Z","status":"ssl_error","status_checked_at":"2026-03-03T06:58:15.329Z","response_time":61,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-12T12:24:36.781Z","updated_at":"2026-03-03T09:31:47.756Z","avatar_url":"https://github.com/QData.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Awesome Papers on Automated-Robustness-Testing-for-NLP\n## Why is testing DNNs important?\nDNNs are modern software being deployed everywhere. Like other software these must be tested for corner cases(when the software is likely to be problematic). \n## Why is testing DNNs hard?\nDNNs have too many parameters: too many neurons. Manually finding corner cases is too difficult. Need automated testing , i.e. generating automatically corner cases for large DNNs. \n\n## General Intro Position Papers/ Blogs\n1. [DeepMind Medium Blog](https://medium.com/@deepmindsafetyresearch/towards-robust-and-verified-ai-specification-testing-robust-training-and-formal-verification-69bd1bc48bda)\n2. [General Survey of Testing in ML](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=\u0026arnumber=9000651)\n### Neuron Coverage Based \n  1. [GrayBox Testing: DeepTest](https://arxiv.org/pdf/1708.08559.pdf) \n  2. [White Box Gradient Based Testing](https://arxiv.org/abs/1705.06640)\n  3. [DeepCT](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=\u0026arnumber=8668044)\n  4. [Concolic Testing for Deep Neural Networks](http://qav.comlab.ox.ac.uk/papers/swr+18.pdf)\n  5. [FuzzTesting](https://www.comp.nus.edu.sg/~abhik/pdf/ICSE20_Sensei.pdf) -- Augmentation\n  6. [Testing Deep Neural Networks- Symbolic Execution](https://arxiv.org/abs/1803.04792)\n  7. [MCTS based](https://arxiv.org/abs/1710.07859)\n  \n##### Fuzzing Based \n1. [FuzzTesting](https://www.comp.nus.edu.sg/~abhik/pdf/ICSE20_Sensei.pdf) \n2. [TensorFuzz](http://proceedings.mlr.press/v97/odena19a/odena19a.pdf)\n3. [DLFuzz](https://arxiv.org/pdf/1808.09413.pdf)\n4. [NeuFuzz](https://wcventure.github.io/FuzzingPaper/Paper/Access19_NeuFuzz%20.pdf)\n# Testing for NLP Deep Models \n1. [checklist](https://homes.cs.washington.edu/~marcotcr/acl20_checklist.pdf)\n2. [errudite](https://homes.cs.washington.edu/~marcotcr/acl19_errudite.pdf)\n3. [Semantically Equivalent Adversarial Rules for Debugging NLP Models](https://homes.cs.washington.edu/~marcotcr/acl18.pdf)\n4. [Are Red Roses Red?Evaluating Consistency of Question-Answering Models](https://homes.cs.washington.edu/~marcotcr/acl19_implication.pdf)\n5. [Robustness Verification for Transformers](https://arxiv.org/pdf/2002.06622.pdf)\n6. [Towards a Robust Deep Neural Network in Texts: A Survey](https://arxiv.org/pdf/1902.07285.pdf)\n7. [Certified Robustness to Adversarial Word Substitutions](https://arxiv.org/abs/1909.00986)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fqdata%2Fawesome-robustness-testing-for-nlp","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fqdata%2Fawesome-robustness-testing-for-nlp","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fqdata%2Fawesome-robustness-testing-for-nlp/lists"}