{"id":13429450,"url":"https://github.com/jeffhj/LM-reasoning","last_synced_at":"2025-03-16T03:31:31.252Z","repository":{"id":65569184,"uuid":"580204419","full_name":"jeffhj/LM-reasoning","owner":"jeffhj","description":"This repository contains a collection of papers and resources on Reasoning in Large Language Models.","archived":false,"fork":false,"pushed_at":"2023-11-13T08:44:02.000Z","size":102,"stargazers_count":495,"open_issues_count":5,"forks_count":30,"subscribers_count":16,"default_branch":"main","last_synced_at":"2024-05-21T03:12:03.510Z","etag":null,"topics":["artificial-intelligence","awesome-list","chain-of-thought","chatgpt","deductive-reasoning","gpt-3","human-intelligence","in-context-learning","language-modeling","language-models","large-language-models","llm","natural-language-processing","paper-list","prompt-engineering","prompt-learning","reasoning"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jeffhj.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"License","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-12-20T01:11:36.000Z","updated_at":"2024-06-04T11:11:43.321Z","dependencies_parsed_at":"2024-06-04T11:11:42.786Z","dependency_job_id":"4e673c41-4090-41cb-9785-709e1759a217","html_url":"https://github.com/jeffhj/LM-reasoning","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jeffhj%2FLM-reasoning","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jeffhj%2FLM-reasoning/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jeffhj%2FLM-reasoning/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jeffhj%2FLM-reasoning/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jeffhj","download_url":"https://codeload.github.com/jeffhj/LM-reasoning/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243822312,"owners_count":20353496,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["artificial-intelligence","awesome-list","chain-of-thought","chatgpt","deductive-reasoning","gpt-3","human-intelligence","in-context-learning","language-modeling","language-models","large-language-models","llm","natural-language-processing","paper-list","prompt-engineering","prompt-learning","reasoning"],"created_at":"2024-07-31T02:00:39.324Z","updated_at":"2025-03-16T03:31:30.970Z","avatar_url":"https://github.com/jeffhj.png","language":null,"funding_links":[],"categories":["1 Relevant Surveys and Links","Refer to Awesome Lists","Repositories","Articles","Other Lists","A01_文本生成_文本对话","Other Awesome Lists","Contributing","Others","Uncategorized"],"sub_categories":["Reason","Large Language Model (LLM)","TeX Lists","大语言对话模型及数据","2022","Uncategorized"],"readme":"# Reasoning in Large Language Models\n\n[![Awesome](https://awesome.re/badge.svg)](https://github.com/jeffhj/LM-reasoning) \n[![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](https://opensource.org/licenses/MIT)\n[![Made With Love](https://img.shields.io/badge/Made%20With-Love-red.svg)](https://github.com/chetanraj/awesome-github-badges)\n\nThis repository contains a collection of papers and resources on ***Reasoning in Large Language Models***.\n\nFor more details, please refer to [Towards Reasoning in Large Language Models: A Survey](https://arxiv.org/abs/2212.10403)\n\n\u003eFeel free to let me know the missing papers (issue or pull request).\n\n\n\nContributor: [Jie Huang](https://jeffhj.github.io/) @UIUC\n\nThank [Kevin Chen-Chuan Chang](https://cs.illinois.edu/about/people/faculty/kcchang) @UIUC, [Jason Wei](https://www.jasonwei.net/) @Google Brain, [Denny Zhou](https://dennyzhou.github.io/) @Google Brain for insightful discussions and suggestions.\n\n\n\n## Contents\n\n- [Survey](#survey)\n- [Relevant Survey \u0026 Position Paper \u0026 Blog](#relevant-survey-and-position-paper-and-blog)\n- [Technique](#technique)\n  * [Fully Supervised Finetuning](#fully-supervised-finetuning)\n  * [Prompting \u0026 In-Context Learning](#prompting-and-in-context-learning)\n  * [Hybrid Method](#hybrid-method)\n- [Evaluation \u0026 Analysis](#evaluation-and-analysis)\n\n\n\n## Survey\n\n#### [Towards Reasoning in Large Language Models: A Survey](https://arxiv.org/abs/2212.10403)\t*20 Dec 2022*\n\nJie Huang, Kevin Chen-Chuan Chang\n\n\n\n## Relevant Survey and Position Paper and Blog\n\n#### [Emergent Abilities of Large Language Models](https://arxiv.org/abs/2206.07682)\t*15 Jun 2022*\n\nJason Wei, Yi Tay, Rishi Bommasani, Colin Raffel, Barret Zoph, Sebastian Borgeaud, Dani Yogatama, Maarten Bosma, Denny Zhou, Donald Metzler, Ed H. Chi, Tatsunori Hashimoto, Oriol Vinyals, Percy Liang, Jeff Dean, William Fedus\n\n#### [Language Model Cascades](https://arxiv.org/abs/2207.10342)\t*21 Jul 2022*\n\nDavid Dohan, Winnie Xu, Aitor Lewkowycz, Jacob Austin, David Bieber, Raphael Gontijo Lopes, Yuhuai Wu, Henryk Michalewski, Rif A. Saurous, Jascha Sohl-dickstein, Kevin Murphy, Charles Sutton\n\n#### [How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources](https://yaofu.notion.site/How-does-GPT-Obtain-its-Ability-Tracing-Emergent-Abilities-of-Language-Models-to-their-Sources-b9a57ac0fcf74f30a1ab9e3e36fa1dc1)\t *11 Dec 2022*\n\nYao Fu, Hao Peng, Tushar Shot\n\n#### [Reasoning with Language Model Prompting: A Survey](https://arxiv.org/abs/2212.09597)\t *19 Dec 2022*\n\nShuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Huajun Chen\n\n#### [A Survey of Deep Learning for Mathematical Reasoning](https://arxiv.org/abs/2212.10535)\t*20 Dec 2022*\n\nPan Lu, Liang Qiu, Wenhao Yu, Sean Welleck, Kai-Wei Chang\n\n#### [A Survey for In-context Learning](https://arxiv.org/abs/2301.00234)\t*31 Dec 2022*\n\nQingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Zhiyong Wu, Baobao Chang, Xu Sun, Jingjing Xu, Lei Li, Zhifang Sui\n\n#### [Logical Reasoning over Natural Language as Knowledge Representation: A Survey](https://arxiv.org/abs/2303.12023) *21 Mar 2023*\n\nZonglin Yang, Xinya Du, Rui Mao, Jinjie Ni, Erik Cambria\n\n#### [Nature Language Reasoning, A Survey](https://arxiv.org/abs/2303.14725) *26 Mar 2023*\n\nFei Yu, Hongbo Zhang, Benyou Wang\n\n\n\n## Technique\n\n### Fully Supervised Finetuning\n\n\u003e We mainly focus on techniques that are applicable to improving or eliciting \"reasoning\" in *large* language models like GPT-3 (175B)\n\n\u003e Papers in this paradigm vary a lot and are usually based on *small* models trained on specific datasets. We list several papers here for reference (that is, the list is not complete). Please refer to [our survey](https://arxiv.org/abs/2212.10403) for some discussion.\n\n#### [Explain Yourself! Leveraging Language Models for Commonsense Reasoning](https://arxiv.org/abs/1906.02361)\t*6 Jun 2019*\n\nNazneen Fatema Rajani, Bryan McCann, Caiming Xiong, Richard Socher\n\n#### [Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge](https://arxiv.org/abs/2006.06609)\t*11 Jun 2020*\n\nAlon Talmor, Oyvind Tafjord, Peter Clark, Yoav Goldberg, Jonathan Berant\n\n#### [Measuring Mathematical Problem Solving With the MATH Dataset](https://arxiv.org/abs/2103.03874)\t*5 Mar 2021*\n\nDan Hendrycks, Collin Burns, Saurav Kadavath, Akul Arora, Steven Basart, Eric Tang, Dawn Song, Jacob Steinhardt\n\n#### [Show Your Work: Scratchpads for Intermediate Computation with Language Models](https://arxiv.org/abs/2112.00114)\t*30 Nov 2021*\n\nMaxwell Nye, Anders Johan Andreassen, Guy Gur-Ari, Henryk Michalewski, Jacob Austin, David Bieber, David Dohan, Aitor Lewkowycz, Maarten Bosma, David Luan, Charles Sutton, Augustus Odena\n\n#### [FaiRR: Faithful and Robust Deductive Reasoning over Natural Language](https://arxiv.org/abs/2203.10261)\t*19 Mar 2022*\n\nSoumya Sanyal, Harman Singh, Xiang Ren\n\n#### ......\n\n\n\n### Prompting and In-Context Learning\n\n#### Chain of Thought Prompting and Its Variants/Applications\n\n#### [Chain of Thought Prompting Elicits Reasoning in Large Language Models](https://arxiv.org/abs/2201.11903)\t*28 Jan 2022*\n\nJason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou\n\n#### [Iteratively Prompt Pre-trained Language Models for Chain of Thought](https://arxiv.org/abs/2203.08383)\t*16 Mar 2022*\n\nBoshi Wang, Xiang Deng, Huan Sun\n\n#### [Large Language Models are Zero-Shot Reasoners](https://arxiv.org/abs/2205.11916)\t*24 May 2022*\n\nTakeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke Iwasawa\n\n#### [Psychologically-informed chain-of-thought prompts for metaphor understanding in large language models](https://arxiv.org/abs/2209.08141)\t*16 Sep 2022*\n\nBen Prystawski, Paul Thibodeau, Noah Goodman\n\n#### [Language Models are Multilingual Chain-of-Thought Reasoners](https://arxiv.org/abs/2210.03057)\t*6 Oct 2022*\n\nFreda Shi, Mirac Suzgun, Markus Freitag, Xuezhi Wang, Suraj Srivats, Soroush Vosoughi, Hyung Won Chung, Yi Tay, Sebastian Ruder, Denny Zhou, Dipanjan Das, Jason Wei\n\n#### [Large Language Models are few(1)-shot Table Reasoners](https://arxiv.org/abs/2210.06710)\t*13 Oct 2022*\n\nWenhu Chen\n\n#### [Language Models of Code are Few-Shot Commonsense Learners](https://arxiv.org/abs/2210.07128) *13 Oct 2022*\n\nAman Madaan, Shuyan Zhou, Uri Alon, Yiming Yang, Graham Neubig\n\n#### [PaL: Program-Aided Language Model](https://arxiv.org/abs/2211.10435) *18 Nov 2022*\n\nLuyu Gao*, Aman Madaan*, Shuyan Zhou*, Uri Alon, Pengfei Liu, Yiming Yang, Jamie Callan, Graham Neubig\n\n#### [Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks](https://arxiv.org/abs/2211.12588)\t*22 Nov 2022*\n\nWenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen\n\n#### [Rethinking with Retrieval: Faithful Large Language Model Inference](https://arxiv.org/abs/2301.00303)\t*31 Dec 2022*\n\nHangfeng He, Hongming Zhang, Dan Roth\n\n#### Rationale Engineering\n\n#### [Training Verifiers to Solve Math Word Problems](https://arxiv.org/abs/2110.14168)\t*27 Oct 2021*\n\nKarl Cobbe, Vineet Kosaraju, Mohammad Bavarian, Mark Chen, Heewoo Jun, Lukasz Kaiser, Matthias Plappert, Jerry Tworek, Jacob Hilton, Reiichiro Nakano, Christopher Hesse, John Schulman\n\n#### [Self-Consistency Improves Chain of Thought Reasoning in Language Models](https://arxiv.org/abs/2203.11171)\t*21 Mar 2022*\n\nXuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Sharan Narang, Aakanksha Chowdhery, Denny Zhou\n\n#### [On the Advance of Making Language Models Better Reasoners](https://arxiv.org/abs/2206.02336)\t*6 Jun 2022*\n\nYifei Li, Zeqi Lin, Shizhuo Zhang, Qiang Fu, Bei Chen, Jian-Guang Lou, Weizhu Chen\n\n#### [Complexity-Based Prompting for Multi-Step Reasoning](https://arxiv.org/abs/2210.00720)\t*3 Oct 2022*\n\nYao Fu, Hao Peng, Ashish Sabharwal, Peter Clark, Tushar Khot\n\n#### [Automatic Chain of Thought Prompting in Large Language Models](https://arxiv.org/abs/2210.03493)\t*7 Oct 2022*\n\nZhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola\n\n#### [Teaching Algorithmic Reasoning via In-context Learning](https://arxiv.org/abs/2211.09066)\t*15 Nov 2022*\n\nHattie Zhou, Azade Nova, Hugo Larochelle, Aaron Courville, Behnam Neyshabur, Hanie Sedghi\n\n#### [Large Language Models are reasoners with Self-Verification](https://arxiv.org/abs/2212.09561)\t*19 Dec 2022*\n\nYixuan Weng, Minjun Zhu, Shizhu He, Kang Liu, Jun Zhao\n\n#### Problem Decomposition\n\n#### [Least-to-Most Prompting Enables Complex Reasoning in Large Language Models](https://arxiv.org/abs/2205.10625)\t*21 May 2022*\n\nDenny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, Ed Chi\n\n#### [Compositional Semantic Parsing with Large Language Models](https://arxiv.org/abs/2209.15003)\t*29 Sep 2022*\n\nAndrew Drozdov, Nathanael Schärli, Ekin Akyürek, Nathan Scales, Xinying Song, Xinyun Chen, Olivier Bousquet, Denny Zhou\n\n#### [Decomposed Prompting: A Modular Approach for Solving Complex Tasks](https://arxiv.org/abs/2210.02406)\t*5 Oct 2022*\n\nTushar Khot, Harsh Trivedi, Matthew Finlayson, Yao Fu, Kyle Richardson, Peter Clark, Ashish Sabharwal\n\n#### [Measuring and Narrowing the Compositionality Gap in Language Models](https://arxiv.org/abs/2210.03350)\t*7 Oct 2022*\n\nOfir Press, Muru Zhang, Sewon Min, Ludwig Schmidt, Noah A. Smith, Mike Lewis\n\n#### [Successive Prompting for Decomposing Complex Questions](https://arxiv.org/abs/2212.04092)\t*8 Dec 2022*\n\nDheeru Dua, Shivanshu Gupta, Sameer Singh, Matt Gardner\n\n#### [Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning](https://arxiv.org/abs/2301.13808)\t*31 Jan 2023*\n\nYunhu Ye, Binyuan Hui, Min Yang, Binhua Li, Fei Huang, Yongbin Li\n\n\n\n#### Others\n\n#### [Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents](https://arxiv.org/abs/2201.07207)\t*18 Jan 2022*\n\nWenlong Huang, Pieter Abbeel, Deepak Pathak, Igor Mordatch\n\n#### [Selection-Inference: Exploiting Large Language Models for Interpretable Logical Reasoning](https://arxiv.org/abs/2205.09712)\t*19 May 2022*\n\nAntonia Creswell, Murray Shanahan, Irina Higgins\n\n#### [Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations](https://arxiv.org/abs/2205.11822)\t*24 May 2022*\n\nJaehun Jung, Lianhui Qin, Sean Welleck, Faeze Brahman, Chandra Bhagavatula, Ronan Le Bras, Yejin Choi\n\n#### [Faithful Reasoning Using Large Language Models](https://arxiv.org/abs/2208.14271)\t*30 Aug 2022*\n\nAntonia Creswell, Murray Shanahan\n\n#### [Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering](https://arxiv.org/abs/2209.09513)\t*20 Sep 2022*\n\nPan Lu, Swaroop Mishra, Tony Xia, Liang Qiu, Kai-Wei Chang, Song-Chun Zhu, Oyvind Tafjord, Peter Clark, Ashwin Kalyan\n\n#### [Explanations from Large Language Models Make Small Reasoners Better](https://arxiv.org/abs/2210.06726)\t*13 Oct 2022*\n\nShiyang Li, Jianshu Chen, Yelong Shen, Zhiyu Chen, Xinlu Zhang, Zekun Li, Hong Wang, Jing Qian, Baolin Peng, Yi Mao, Wenhu Chen, Xifeng Yan\n\n#### [Distilling Multi-Step Reasoning Capabilities of Large Language Models into Smaller Models via Semantic Decompositions](https://arxiv.org/abs/2212.00193)\t*1 Dec 2022*\n\nKumar Shridhar, Alessandro Stolfo, Mrinmaya Sachan\n\n#### [Teaching Small Language Models to Reason](https://arxiv.org/abs/2212.08410)\t*16 Dec 2022*\n\nLucie Charlotte Magister, Jonathan Mallinson, Jakub Adamek, Eric Malmi, Aliaksei Severyn\n\n#### [LAMBADA: Backward Chaining for Automated Reasoning in Natural Language](https://arxiv.org/abs/2212.13894)\t*20 Dec 2022*\n\nSeyed Mehran Kazemi, Najoung Kim, Deepti Bhatia, Xin Xu, Deepak Ramachandran\n\n#### [Reasoning with Language Model is Planning with World Model](https://arxiv.org/abs/2305.14992) *24 May 2023*\n\nShibo Hao, Yi Gu, Haodi Ma, Joshua Jiahua Hong, Zhen Wang, Daisy Zhe Wang, Zhiting Hu\n\n\n\n### Hybrid Method\n\n#### Reasoning-Enhanced Training and Prompting\n\n#### [Reasoning Like Program Executors](https://arxiv.org/abs/2201.11473)\t*27 Jan 2022*\n\nXinyu Pi, Qian Liu, Bei Chen, Morteza Ziyadi, Zeqi Lin, Qiang Fu, Yan Gao, Jian-Guang Lou, Weizhu Chen\n\n#### [Solving Quantitative Reasoning Problems with Language Models](https://arxiv.org/abs/2206.14858)\t*29 Jun 2022*\n\nAitor Lewkowycz, Anders Andreassen, David Dohan, Ethan Dyer, Henryk Michalewski, Vinay Ramasesh, Ambrose Slone, Cem Anil, Imanol Schlag, Theo Gutman-Solo, Yuhuai Wu, Behnam Neyshabur, Guy Gur-Ari, Vedant Misra\n\n#### [Exploring Length Generalization in Large Language Models](https://arxiv.org/abs/2207.04901)\t*11 Jul 2022*\n\nCem Anil, Yuhuai Wu, Anders Andreassen, Aitor Lewkowycz, Vedant Misra, Vinay Ramasesh, Ambrose Slone, Guy Gur-Ari, Ethan Dyer, Behnam Neyshabur\n\n#### [Scaling Instruction-Finetuned Language Models](https://arxiv.org/abs/2210.11416)\t*20 Oct 2022*\n\nHyung Won Chung, Le Hou, Shayne Longpre, Barret Zoph, Yi Tay, William Fedus, Yunxuan Li, Xuezhi Wang, Mostafa Dehghani, Siddhartha Brahma, Albert Webson, Shixiang Shane Gu, Zhuyun Dai, Mirac Suzgun, Xinyun Chen, Aakanksha Chowdhery, Alex Castro-Ros, Marie Pellat, Kevin Robinson, Dasha Valter, Sharan Narang, Gaurav Mishra, Adams Yu, Vincent Zhao, Yanping Huang, Andrew Dai, Hongkun Yu, Slav Petrov, Ed H. Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V. Le, Jason Wei\n\n#### [Galactica: A Large Language Model for Science](https://arxiv.org/abs/2211.09085)\t*16 Nov 2022*\n\nRoss Taylor, Marcin Kardas, Guillem Cucurull, Thomas Scialom, Anthony Hartshorn, Elvis Saravia, Andrew Poulton, Viktor Kerkez, Robert Stojnic\n\n#### [ALERT: Adapting Language Models to Reasoning Tasks](https://arxiv.org/abs/2212.08286)\t*16 Dec 2022*\n\nPing Yu, Tianlu Wang, Olga Golovneva, Badr Alkhamissy, Gargi Ghosh, Mona Diab, Asli Celikyilmaz\n\n#### Bootstrapping and Self-Improving\n\n#### [STaR: Bootstrapping Reasoning With Reasoning](https://arxiv.org/abs/2203.14465)\t*28 Mar 2022*\n\nEric Zelikman, Yuhuai Wu, Jesse Mu, Noah D. Goodman\n\n#### [Language Models Can Teach Themselves to Program Better](https://arxiv.org/abs/2207.14502)\t*29 Jul 2022*\n\nPatrick Haluptzok, Matthew Bowers, Adam Tauman Kalai\n\n#### [Large Language Models Can Self-Improve](https://arxiv.org/abs/2210.11610)\t*20 Oct 2022*\n\nJiaxin Huang, Shixiang Shane Gu, Le Hou, Yuexin Wu, Xuezhi Wang, Hongkun Yu, Jiawei Han\n\n\n\n## Evaluation and Analysis\n\n#### [Are NLP Models really able to Solve Simple Math Word Problems?](https://arxiv.org/abs/2103.07191)\t*12 Mar 2021*\n\nArkil Patel, Satwik Bhattamishra, Navin Goyal\n\n#### [Impact of Pretraining Term Frequencies on Few-Shot Reasoning](https://arxiv.org/abs/2202.07206)\t*15 Feb 2022*\n\nYasaman Razeghi, Robert L. Logan IV, Matt Gardner, Sameer Singh\n\n#### [Are Large Pre-Trained Language Models Leaking Your Personal Information?](https://arxiv.org/abs/2205.12628)\t*25 May 2022*\n\nJie Huang, Hanyin Shao, Kevin Chen-Chuan Chang\n\n#### [Large Language Models Still Can't Plan (A Benchmark for LLMs on Planning and Reasoning about Change)](https://arxiv.org/abs/2206.10498)\t*21 Jun 2022*\n\nKarthik Valmeekam, Alberto Olmo, Sarath Sreedharan, Subbarao Kambhampati\n\n#### [Exploring Length Generalization in Large Language Models](https://arxiv.org/abs/2207.04901)\t*11 Jul 2022*\n\nCem Anil, Yuhuai Wu, Anders Andreassen, Aitor Lewkowycz, Vedant Misra, Vinay Ramasesh, Ambrose Slone, Guy Gur-Ari, Ethan Dyer, Behnam Neyshabur\n\n#### [Language models show human-like content effects on reasoning](https://arxiv.org/abs/2207.07051)\t*14 Jul 2022*\n\nIshita Dasgupta, Andrew K. Lampinen, Stephanie C. Y. Chan, Antonia Creswell, Dharshan Kumaran, James L. McClelland, Felix Hill\n\n#### [FOLIO: Natural Language Reasoning with First-Order Logic](https://arxiv.org/abs/2209.00840)\t*2 Sep 2022*\n\nSimeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Luke Benson, Lucy Sun, Ekaterina Zubova, Yujie Qiao, Matthew Burtell, David Peng, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Shafiq Joty, Alexander R. Fabbri, Wojciech Kryscinski, Xi Victoria Lin, Caiming Xiong, Dragomir Radev\n\n#### [Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought](https://arxiv.org/abs/2210.01240)\t*3 Oct 2022*\n\nAbulhair Saparov, He He\n\n#### [Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them](https://arxiv.org/abs/2210.09261)\t*17 Oct 2022*\n\nMirac Suzgun, Nathan Scales, Nathanael Schärli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc V. Le, Ed H. Chi, Denny Zhou, Jason Wei\n\n#### [Large language models are not zero-shot communicators](https://arxiv.org/abs/2210.14986)\t*26 Oct 2022*\n\nLaura Ruis, Akbir Khan, Stella Biderman, Sara Hooker, Tim Rocktäschel, Edward Grefenstette\n\n#### [ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning](https://arxiv.org/abs/2212.07919)\t*15 Dec 2022*\n\nOlga Golovneva, Moya Chen, Spencer Poff, Martin Corredor, Luke Zettlemoyer, Maryam Fazel-Zarandi, Asli Celikyilmaz\n\n#### [Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters](https://arxiv.org/abs/2212.10001)\t*20 Dec 2022*\n\nBoshi Wang, Sewon Min, Xiang Deng, Jiaming Shen, You Wu, Luke Zettlemoyer, Huan Sun\n\n\n\n## Citation\n\nIf you find this repo useful, please kindly cite our survey:\n\n```\n@article{huang2022towards,\n  title={Towards Reasoning in Large Language Models: A Survey},\n  author={Huang, Jie and Chang, Kevin Chen-Chuan},\n  journal={arXiv preprint arXiv:2212.10403},\n  year={2022}\n}\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjeffhj%2FLM-reasoning","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjeffhj%2FLM-reasoning","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjeffhj%2FLM-reasoning/lists"}