Projects in Awesome Lists tagged with visual-grounding
A curated list of projects in awesome lists tagged with visual-grounding .
https://github.com/daveredrum/scanrefer
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
3d computer-vision dataset deep-learning eccv natural-language-processing point-cloud pytorch visual-grounding
Last synced: 09 Oct 2025
https://github.com/LeapLabTHU/Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
computer-vision cvpr2022 deep-learning multimodal-deep-learning pytorch vision-and-language visual-grounding
Last synced: 03 Apr 2025
https://github.com/3dlg-hcvc/m3dref-clip
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
3d clip computer-vision cuda deep-learning localization pytorch pytorch-lightning transformer visual-grounding
Last synced: 04 Aug 2025
https://github.com/svip-lab/lbylnet
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
cvpr cvpr2021 pytorch visual-grounding
Last synced: 04 Aug 2025
https://github.com/daveredrum/d3net
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
3d caption-generation computer-vision deep-learning eccv eccv2022 natural-language-processing point-cloud semi-supervised-learning visual-grounding
Last synced: 09 Oct 2025
https://github.com/scofield7419/muie
MUIE: Multimodal Universal Information Extraction
information-extraction multimodal-information-extraction universal-information-extraction visual-grounding
Last synced: 05 Mar 2026
https://github.com/timbmg/belief
Implementation of Master Thesis on "Belief State for Visually Grounded, Task-Oriented Neural Dialogue Model"
deep-learning dialog dialogue dialogue-systems machine-learning neural-network neural-networks nlg nlp nlproc pytorch visual-grounding
Last synced: 17 May 2026
https://github.com/izhx/phrase-grounding-with-pronoun
[EMNLP 22] Extending Phrase Grounding with Pronouns in Visual Dialogues.
computer-vision phrase-grounding visual-dialog visual-grounding
Last synced: 01 Mar 2025
https://github.com/3dlg-hcvc/enet-scannet
Helper tools for extracting and projecting ENet features to ScanNet pointclouds.
2d 3d computer-vision visual-grounding
Last synced: 13 Oct 2025
https://github.com/antonio-f/florence-2-test
Florence-2 quick test
colab-notebook florence-2 huggingface-transformers image-captioning image-to-text jupyter-notebook multimodal-large-language-models python referring-expression-comprehension tutorial vision-foundation-model visual-grounding
Last synced: 27 Apr 2026
https://github.com/tarasrashkevych99/visual-grounding
This is a deep learning project focused on the visual grounding task
clip deep-learning visual-grounding
Last synced: 26 Mar 2025