An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with visual-grounding

A curated list of projects in awesome lists tagged with visual-grounding .

https://github.com/daveredrum/scanrefer

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

3d computer-vision dataset deep-learning eccv natural-language-processing point-cloud pytorch visual-grounding

Last synced: 09 Oct 2025

https://github.com/LeapLabTHU/Pseudo-Q

[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding

computer-vision cvpr2022 deep-learning multimodal-deep-learning pytorch vision-and-language visual-grounding

Last synced: 03 Apr 2025

https://github.com/3dlg-hcvc/m3dref-clip

[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects

3d clip computer-vision cuda deep-learning localization pytorch pytorch-lightning transformer visual-grounding

Last synced: 04 Aug 2025

https://github.com/svip-lab/lbylnet

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

cvpr cvpr2021 pytorch visual-grounding

Last synced: 04 Aug 2025

https://github.com/daveredrum/d3net

[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

3d caption-generation computer-vision deep-learning eccv eccv2022 natural-language-processing point-cloud semi-supervised-learning visual-grounding

Last synced: 09 Oct 2025

https://github.com/timbmg/belief

Implementation of Master Thesis on "Belief State for Visually Grounded, Task-Oriented Neural Dialogue Model"

deep-learning dialog dialogue dialogue-systems machine-learning neural-network neural-networks nlg nlp nlproc pytorch visual-grounding

Last synced: 17 May 2026

https://github.com/izhx/phrase-grounding-with-pronoun

[EMNLP 22] Extending Phrase Grounding with Pronouns in Visual Dialogues.

computer-vision phrase-grounding visual-dialog visual-grounding

Last synced: 01 Mar 2025

https://github.com/3dlg-hcvc/enet-scannet

Helper tools for extracting and projecting ENet features to ScanNet pointclouds.

2d 3d computer-vision visual-grounding

Last synced: 13 Oct 2025

https://github.com/tarasrashkevych99/visual-grounding

This is a deep learning project focused on the visual grounding task

clip deep-learning visual-grounding

Last synced: 26 Mar 2025