Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/hughplay/visual-reasoning-papers

πŸ“„ A curated list of visual reasoning papers.
https://github.com/hughplay/visual-reasoning-papers

awesome neural-symbolic-reasoning paper-list physical-reasoning reasoning visual-reasoning

Last synced: about 2 months ago
JSON representation

πŸ“„ A curated list of visual reasoning papers.

Awesome Lists containing this project

README

        

# Visual Reasoning Papers

A curated list of visual reasoning papers.

- Last update time: 2022-11-09.
- Maintainer: [Xin Hong](https://hongxin2019.github.io)

## Visual Reasoning Papers on arXiv

[](arxiv_visual_reasoning.md)

In addition to the papers listed below, we also provide an automatically generated [arXiv paper list](arxiv_visual_reasoning.md), which is updated monthly. Click on the trend chart above to check.

---

"β˜…" means the paper introduces a new task or dataset.

## Survey Papers

- **Deep Learning Methods for Abstract Visual Reasoning: A Survey on Raven's Progressive Matrices**, MaΕ‚kiΕ„ski & MaΕ„dziuk, *arXiv 2022*. [Paper](https://arxiv.org/abs/2201.12382)
- **A Review of Emerging Research Directions in Abstract Visual Reasoning**, MaΕ‚kiΕ„ski & MaΕ„dziuk, *arXiv 2022*. [Paper](https://arxiv.org/abs/2202.10284)
- **Reasoning about Actions over Visual and Linguistic Modalities: A Survey**, Sampat et al., *arXiv 2022*. [Paper](https://arxiv.org/abs/2207.07568)

## Related Paper Lists & Tutorials

- [Deep-Reasoning-Papers](https://github.com/floodsung/Deep-Reasoning-Papers/): Recent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep learning and reasoning.
- [Awesome deep logic](https://github.com/ccclyu/awesome-deeplogic): A collection of papers of neural-symbolic AI (mainly focus on NLP applications).
- [Neural Machine Reasoning](https://neuralreasoning.github.io/): This tutorial reviews recent advances on dynamic neural networks that aim to reach a deliberative reasoning capability. This goes beyond the current associative pattern matching excelled by deep learning.

## 2022
- β˜… **WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models**, Bitton et al., *NeurIPS 2022*. [Paper](https://openreview.net/forum?id=aJtVdI251Vv)
- β˜… **REX: Reasoning-aware and Grounded Explanation**, Chen & Zhao, *CVPR 2022*. [Paper](https://ieeexplore.ieee.org/document/9879365/)
- β˜… **The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning**, Hessel et al., *arXiv 2022*. [Paper](https://arxiv.org/abs/2202.04800)
- β˜… **Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions**, Jiang et al., *CVPR 2022*. [Paper](https://ieeexplore.ieee.org/document/9878697/)
- β˜… **Maintaining Reasoning Consistency in Compositional Visual Question Answering**, Jing et al., *CVPR 2022*. [Paper](https://ieeexplore.ieee.org/document/9879826/)
- β˜… **Visual Abductive Reasoning**, Liang et al., *CVPR 2022*. [Paper](https://ieeexplore.ieee.org/document/9880226/)
- β˜… **QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning**, Li & SΓΈgaard, *ACL 2022*. [Paper](https://aclanthology.org/2022.findings-naacl.73)
- β˜… **From Representation to Reasoning: Towards Both Evidence and Commonsense Reasoning for Video Question-Answering**, Li et al., *CVPR 2022*. [Paper](https://ieeexplore.ieee.org/document/9878800/)
- β˜… **Visual Spatial Reasoning**, Liu et al., *arXiv 2022*. [Paper](https://arxiv.org/abs/2205.00363)
- **Grammar-Based Grounded Lexicon Learning**, Mao et al., *NeurIPS 2022*. [Paper](https://openreview.net/forum?id=iI6nkEZkOl)
- **RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning**, Ma et al., *ICLR 2022*. [Paper](https://openreview.net/forum?id=afoV8W3-IYp)
- β˜… **IntPhys 2019: A Benchmark for Visual Intuitive Physics Understanding**, Riochet et al., *TPAMI 2022*.
- β˜… **Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality**, Thrush et al., *CVPR 2022*. [Paper](https://ieeexplore.ieee.org/document/9878945/)
- β˜… **Self-Supervised Spatial Reasoning on Multi-View Line Drawings**, Xiang et al., *CVPR 2022*. [Paper](https://ieeexplore.ieee.org/document/9879170/)
- **Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning**, Zhang et al., *ECCV 2022*. [Paper](https://link.springer.com/content/pdf/10.1007/978-3-031-19842-7_40.pdf)
- β˜… **VideoABC: A Real-World Video Dataset for Abductive Visual Reasoning**, Zhao et al., *TIP 2022*. [Paper](https://ieeexplore.ieee.org/abstract/document/9893026)

## 2021
- β˜… **Scale-Localized Abstract Reasoning**, Benny et al., *CVPR 2021*. [Paper](https://ieeexplore.ieee.org/document/9577474/)
- **Grounding Physical Concepts of Objects and Events Through Dynamic
Visual Reasoning**, Chen et al., *ICLR 2021*. [Paper](https://openreview.net/forum?id=bhCDO\_cEGCz)
- **Meta Module Network for Compositional Visual Reasoning**, Chen et al., *WACV 2021*. [Paper](https://ieeexplore.ieee.org/document/9423385/)
- **Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language**, Ding et al., *NeurIPS 2021*. [Paper](https://proceedings.neurips.cc/paper/2021/hash/07845cd9aefa6cde3f8926d25138a3a2-Abstract.html)
- β˜… **Transformation Driven Visual Reasoning**, Hong et al., *CVPR 2021*. [Paper](https://ieeexplore.ieee.org/document/9578722/)
- β˜… **Stratified Rule-Aware Network for Abstract Visual Reasoning**, Hu et al., *AAAI 2021*. [Paper](https://ojs.aaai.org/index.php/AAAI/article/view/16248)
- **Interpretable Visual Reasoning via Induced Symbolic Space**, Wang et al., *ICCV 2021*. [Paper](https://ieeexplore.ieee.org/document/9710153/)

## 2020
- **Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"**, Amizadeh et al., *ICML 2020*. [Paper](http://proceedings.mlr.press/v119/amizadeh20a.html)
- β˜… **CoPhy: Counterfactual Learning of Physical Dynamics**, Baradel et al., *ICLR 2020*. [Paper](https://openreview.net/forum?id=SkeyppEFvS)
- **Differentiable Adaptive Computation Time for Visual Reasoning**, Eyzaguirre & Soto, *CVPR 2020*. [Paper](https://doi.org/10.1109/CVPR42600.2020.01283)
- β˜… **CATER: A diagnostic dataset for Compositional Actions \& TEmporal
Reasoning**, Girdhar & Ramanan, *ICLR 2020*. [Paper](https://openreview.net/forum?id=HJgzt2VKPB)
- **Forward Prediction for Physical Reasoning**, Girdhar et al., *arXiv 2020*. [Paper](https://arxiv.org/abs/2006.10734)
- **Dynamic Language Binding in Relational Visual Reasoning**, Le et al., *IJCAI 2020*. [Paper](https://doi.org/10.24963/ijcai.2020/114)
- β˜… **Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and
Reasoning**, Nie et al., *NeurIPS 2020*. [Paper](https://proceedings.neurips.cc/paper/2020/hash/bf15e9bbff22c7719020f9df4badc20a-Abstract.html)
- β˜… **VisualCOMET: Reasoning About the Dynamic Context of a Still Image**, Park et al., *ECCV 2020*. [Paper](https://doi.org/10.1007/978-3-030-58558-7_30)
- β˜… **V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive
Matrices**, Teney et al., *AAAI 2020*. [Paper](https://aaai.org/ojs/index.php/AAAI/article/view/6885)
- **What Can Neural Networks Reason About?**, Xu et al., *ICLR 2020*. [Paper](https://openreview.net/forum?id=rJxbJeHFPS)
- β˜… **CLEVRER: Collision Events for Video Representation and Reasoning**, Yi et al., *ICLR 2020*. [Paper](https://openreview.net/forum?id=HkxYzANYDB)

## 2019
- β˜… **PHYRE: A New Benchmark for Physical Reasoning**, Bakhtin et al., *NeurIPS 2019*. [Paper](https://proceedings.neurips.cc/paper/2019/hash/4191ef5f6c1576762869ac49281130c9-Abstract.html)
- β˜… **GQA: A New Dataset for Real-World Visual Reasoning and Compositional
Question Answering**, Hudson & Manning, *CVPR 2019*. [Paper](http://openaccess.thecvf.com/content\_CVPR\_2019/html/Hudson\_GQA\_A\_New\_Dataset\_for\_Real-World\_Visual\_Reasoning\_and\_Compositional\_CVPR\_2019\_paper.html)
- **Learning by Abstraction: The Neural State Machine**, Hudson & Manning, *NeurIPS 2019*. [Paper](https://proceedings.neurips.cc/paper/2019/hash/c20a7ce2a627ba838cfbff082db35197-Abstract.html)
- **Visual Reasoning by Progressive Module Networks**, Kim et al., *ICLR 2019*. [Paper](https://openreview.net/forum?id=B1fpDsAqt7)
- β˜… **CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions**, Liu et al., *CVPR 2019*. [Paper](http://openaccess.thecvf.com/content\_CVPR\_2019/html/Liu\_CLEVR-Ref\_Diagnosing\_Visual\_Reasoning\_With\_Referring\_Expressions\_CVPR\_2019\_paper.html)
- **The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and
Sentences From Natural Supervision**, Mao et al., *ICLR 2019*. [Paper](https://openreview.net/forum?id=rJgMlhRctm)
- β˜… **Robust Change Captioning**, Park et al., *ICCV 2019*. [Paper](https://doi.org/10.1109/ICCV.2019.00472)
- **Explainable and Explicit Visual Reasoning Over Scene Graphs**, Shi et al., *CVPR 2019*. [Paper](http://openaccess.thecvf.com/content\_CVPR\_2019/html/Shi\_Explainable\_and\_Explicit\_Visual\_Reasoning\_Over\_Scene\_Graphs\_CVPR\_2019\_paper.html)
- β˜… **A Corpus for Reasoning about Natural Language Grounded in Photographs**, Suhr et al., *ACL 2019*. [Paper](https://aclanthology.org/P19-1644)
- β˜… **Visual Entailment: A Novel Task for Fine-Grained Image Understanding**, Xie et al., *arXiv 2019*. [Paper](https://arxiv.org/abs/1901.06706)
- β˜… **From Recognition to Cognition: Visual Commonsense Reasoning**, Zellers et al., *CVPR 2019*. [Paper](http://openaccess.thecvf.com/content\_CVPR\_2019/html/Zellers\_From\_Recognition\_to\_Cognition\_Visual\_Commonsense\_Reasoning\_CVPR\_2019\_paper.html)
- **Learning Perceptual Inference by Contrasting**, Zhang et al., *NeurIPS 2019*. [Paper](https://proceedings.neurips.cc/paper/2019/hash/6766aa2750c19aad2fa1b32f36ed4aee-Abstract.html)
- β˜… **RAVEN: A Dataset for Relational and Analogical Visual REasoNing**, Zhang et al., *CVPR 2019*. [Paper](http://openaccess.thecvf.com/content\_CVPR\_2019/html/Zhang\_RAVEN\_A\_Dataset\_for\_Relational\_and\_Analogical\_Visual\_REasoNing\_CVPR\_2019\_paper.html)

## 2018
- β˜… **Measuring abstract reasoning in neural networks**, Santoro et al., *ICML 2018*. [Paper](http://proceedings.mlr.press/v80/santoro18a.html)
- **Compositional Attention Networks for Machine Reasoning**, Hudson & Manning, *ICLR 2018*. [Paper](https://openreview.net/forum?id=S1Euwz-Rb)
- **FiLM: Visual Reasoning with a General Conditioning Layer**, Perez et al., *AAAI 2018*. [Paper](https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16528)
- **Chain of Reasoning for Visual Question Answering**, Wu et al., *NeurIPS 2018*. [Paper](https://proceedings.neurips.cc/paper/2018/hash/31fefc0e570cb3860f2a6d4b38c6490d-Abstract.html)
- **Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language
Understanding**, Yi et al., *NeurIPS 2018*. [Paper](https://proceedings.neurips.cc/paper/2018/hash/5e388103a391daabe3de1d76a6739ccd-Abstract.html)

## 2017
- **Learning to Reason: End-to-End Module Networks for Visual Question
Answering**, Hu et al., *ICCV 2017*. [Paper](https://doi.org/10.1109/ICCV.2017.93)
- β˜… **CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
Visual Reasoning**, Johnson et al., *CVPR 2017*. [Paper](https://doi.org/10.1109/CVPR.2017.215)
- **Inferring and Executing Programs for Visual Reasoning**, Johnson et al., *ICCV 2017*. [Paper](https://doi.org/10.1109/ICCV.2017.325)
- **A simple neural network module for relational reasoning**, Santoro et al., *NeurIPS 2017*. [Paper](https://proceedings.neurips.cc/paper/2017/hash/e6acf4b0f69f6f6e60e9a815938aa1ff-Abstract.html)
- β˜… **A Corpus of Natural Language for Visual Reasoning**, Suhr et al., *ACL 2017*. [Paper](https://aclanthology.org/P17-2034)

## 2016
- **Neural Module Networks**, Andreas et al., *CVPR 2016*. [Paper](https://doi.org/10.1109/CVPR.2016.12)
- β˜… **Visual Storytelling**, Huang et al., *ACL 2016*. [Paper](https://aclanthology.org/N16-1147)