An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with visual-reasoning

A curated list of projects in awesome lists tagged with visual-reasoning .

https://github.com/salesforce/blip

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

image-captioning image-text-retrieval vision-and-language-pre-training vision-language vision-language-transformer visual-question-answering visual-reasoning

Last synced: 14 May 2025

https://github.com/salesforce/BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

image-captioning image-text-retrieval vision-and-language-pre-training vision-language vision-language-transformer visual-question-answering visual-reasoning

Last synced: 16 Mar 2025

https://github.com/floodsung/deep-reasoning-papers

Recent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep learning and reasoning

logical-reasoning modularity neural-symbolic-reasoning physical-reasoning planning visual-reasoning

Last synced: 28 Jan 2026

https://github.com/floodsung/Deep-Reasoning-Papers

Recent Papers including Neural Symbolic Reasoning, Logical Reasoning, Visual Reasoning, planning and any other topics connecting deep learning and reasoning

logical-reasoning modularity neural-symbolic-reasoning physical-reasoning planning visual-reasoning

Last synced: 14 Mar 2025

https://github.com/thuml/miniveo3-reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

chain-of-frames maze veo3 video-diffusion-model video-reasoning visual-planning visual-reasoning wan world-model

Last synced: 01 Apr 2026

https://github.com/shijx12/XNM-Net

Pytorch implementation of "Explainable and Explicit Visual Reasoning over Scene Graphs "

clevr cvpr2019 explainable-ai neural-module-networks scene-graph visual-reasoning

Last synced: 02 Apr 2025

https://github.com/hughplay/tvr

:boom: Transformation Driven Visual Reasoning - CVPR 2021

blender clevr cvpr2021 trance tvr visual-reasoning

Last synced: 12 Oct 2025

https://github.com/fscdc/rewardmap

[arxiv 2025] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

grpo reasoning reinforcement-learning visual-reasoning

Last synced: 28 Oct 2025

https://github.com/aelnouby/relational-networks

Pytorch implementation of " A simple neural network module for relational reasoning" paper aka Relational networks for visual reasoning.

clevr pytorch relational-networks visual-reasoning

Last synced: 19 Jun 2025

https://github.com/andrewliao11/longperceptualthoughts

[COLM'25] The official implementation of "LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception"

computer-vision large-language-models reasoning reasoning-language-models vision-language-model visual-reasoning

Last synced: 23 Sep 2025

https://github.com/msmrexe/neurosymbolic-vqa-program-generator

A comprehensive implementation of a Neurosymbolic framework for Visual Question Answering (VQA) on the CLEVR dataset. This project translates natural language questions into symbolic programs using three different learning strategies: Supervised (LSTM & Transformer), Reinforcement Learning (REINFORCE), and In-Context Learning (LLM).

clevr course-project in-context-learning large-language-models lstm neurosymbolic neurosymbolic-ai policy-gradient program-generator pytorch reinforce reinforcement-learning seq2seq supervised-learning system-2 transformer university-project visual-question-answering visual-reasoning vqa

Last synced: 07 May 2026