Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Awesome-Referring-Image-Segmentation
:books: A collection of papers about Referring Image Segmentation.
https://github.com/MarkMoHR/Awesome-Referring-Image-Segmentation
Last synced: 5 days ago
JSON representation
-
5. Referring Video Object Segmentation
- Video Object Segmentation with Language Referring Expressions
- Temporal Collection and Distribution for Referring Video Object Segmentation
- HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation
- OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
- Spectrum-guided Multi-granularity Referring Video Object Segmentation - miao/SgMg) |
- Towards Robust Referring Video Object Segmentation with Cyclic Relational Consistency
- End-to-End Referring Video Object Segmentation with Multimodal Transformers
- Language as Queries for Referring Video Object Segmentation
- SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
- Local-Global Context Aware Transformer for Language-Guided Video Segmentation
- Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation
- ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Multi-Attention Network for Compressed Video Referring Object Segmentation
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
- LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
- Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
- Multi-Level Representation Learning with Semantic Alignment for Referring Video Object Segmentation
- You Only Infer Once: Cross-Modal Meta-Transfer for Referring Video Object Segmentation
- RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Towards Robust Referring Video Object Segmentation with Cyclic Relational Consistency
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Spectrum-guided Multi-granularity Referring Video Object Segmentation - miao/SgMg) |
- Video Object Segmentation with Language Referring Expressions
-
4. Referring Video Object Segmentation
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
- Video Object Segmentation with Language Referring Expressions
-
1. Datasets
- Referit game: Referring to objects in photographs of natural scenes
- Generation and comprehension of unambiguous object descriptions
- Modeling context in referring expressions
- CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions - ref+) |
- PhraseCut: Language-based Image Segmentation in the Wild
- ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
- ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation
- Modeling context in referring expressions
- MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions
-
3. Traditional Referring Image Segmentation
- LISA: Reasoning Segmentation via Large Language Model - research/LISA) |
- Text Augmented Spatial-aware Zero-shot Referring Image Segmentation
- Bilateral Knowledge Interaction Network for Referring Image Segmentation
- Advancing Referring Expression Segmentation Beyond Single Image - res) |
- PolyFormer: Referring Image Segmentation as Sequential Polygon Generation - science/polygon-transformer) [[project]](https://polyformer.github.io/) |
- Contrastive Grouping with Transformer for Referring Image Segmentation
- Towards Robust Referring Image Segmentation - ref-seg) [[project]](https://lxtgh.github.io/project/robust_ref_seg/) |
- Segmentation from natural language expressions
- Unsupervised Domain Adaptation for Referring Semantic Segmentation
- Adaptive Selection based Referring Image Segmentation - coder/ASDA) |
- CARIS: Context-Aware Referring Image Segmentation
- Dual Convolutional LSTM Network for Referring Image Segmentation
- Two-stage Visual Cues Enhancement Network for Referring Image Segmentation - net) |
- Cascade Grouped Attention Network for Referring Expression Segmentation
- Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding
- LISA: Reasoning Segmentation via Large Language Model - research/LISA) |
- Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
- Towards Robust Referring Image Segmentation - ref-seg) [[project]](https://lxtgh.github.io/project/robust_ref_seg/) |
- A Simple Baseline with Single-encoder for Referring Image Segmentation - Yu/Shared-RIS) |
- Mask Grounding for Referring Image Segmentation - grounding/) |
- Semantics-Aware Dynamic Localization and Refinement for Referring Image Segmentation
- CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation
- Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation - Yu/Pseudo-RIS) |
- Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation
- GSVA: Generalized Segmentation via Multimodal Large Language Models
- Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
- Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation - Xuan/MRES) [[webpage]](https://rubics-xuan.github.io/MRES/) |
- Towards Robust Referring Image Segmentation - ref-seg) |
- Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network - segmentation) |
- Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency
- Shatter and Gather: Learning Referring Image Segmentation with Text Supervision
- Referring Image Segmentation Using Text Supervision
- Beyond One-to-One: Rethinking the Referring Image Segmentation - DMMI) |
- Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
- Segment Everything Everywhere All at Once - Decoder/Segment-Everything-Everywhere-All-At-Once) |
- SLViT: Scale-Wise Language-Guided Vision Transformer for Referring Image Segmentation
- WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation
- Multi-Modal Mutual Attention and Iterative Interaction for Referring Image Segmentation
- X-Decoder: Generalized Decoding for Pixel, Image and Language - Decoder) [[project]](https://x-decoder-vl.github.io/) |
- Learning to Segment Every Referring Object Point by Point - RES) |
- Meta Compositional Referring Expression Segmentation
- Zero-shot Referring Image Segmentation with Global-Local Context Features - Yu/Zero-shot-RIS) |
- Learning From Box Annotations for Referring Image Segmentation - Supervised-RIS) |
- Instance-Specific Feature Propagation for Referring Segmentation
- LAVT: Language-Aware Vision Transformer for Referring Image Segmentation - RIS) |
- CRIS: CLIP-Driven Referring Image Segmentation
- ReSTR: Convolution-free Referring Image Segmentation Using Transformers
- Vision-Language Transformer and Query Generation for Referring Segmentation - Language-Transformer) |
- MDETR - Modulated Detection for End-to-End Multi-Modal Understanding
- Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation
- Bottom-Up Shift and Reasoning for Referring Image Segmentation
- Locate then Segment: A Strong Pipeline for Referring Image Segmentation
- Linguistic Structure Guided Context Modeling for Referring Image Segmentation - Refseg) |
- Referring Image Segmentation via Cross-Modal Progressive Comprehension - Refseg) |
- Bi-directional Relationship Inferring Network for Referring Image Segmentation - BRINet) |
- PhraseCut: Language-based Image Segmentation in the Wild
- Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
- See-Through-Text Grouping for Referring Image Segmentation
- Referring Expression Object Segmentation with Caption-Aware Consistency
- Cross-Modal Self-Attention Network for Referring Image Segmentation - Net) |
- Key-Word-Aware Network for Referring Expression Image Segmentation - word-aware-network-pycaffe) |
- Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries - Uniandes/DMS) |
- Referring Image Segmentation via Recurrent Refinement Networks
- MAttNet: Modular Attention Network for Referring Expression Comprehension
- Recurrent Multimodal Interaction for Referring Image Segmentation - phrasecut-public) |
- Segmentation from natural language expressions
- Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation
- Prompt-Driven Referring Image Segmentation with Instance Contrasting
- LQMFormer: Language-aware Query Mask Transformer for Referring Image Segmentation
- GTMS: A Gradient-driven Tree-guided Mask-free Referring Image Segmentation Method
- SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation - Application-and-Integration-Lab/SAM4MLLM) |
- GRES: Generalized Referring Expression Segmentation
- ReMamber: Referring Image Segmentation with Mamba Twister - rain-song/ReMamber) |
- SafaRi: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
- Segment Everything Everywhere All at Once - Decoder/Segment-Everything-Everywhere-All-At-Once) |
-
2. Traditional Referring Image Segmentation
-
3. Interactive Referring Image Segmentation
-
5. Referring 3D Instance Segmentation
-
6. Referring 3D Instance Segmentation
-
2. Challenges
- RVOS Challenge - scale Video Object Segmentation Challenge](https://lsvos.github.io/) | Aug 2024| [[CodaLab]](https://codalab.lisn.upsaclay.fr/competitions/19583) |
- 1st MeViS Challenge - level Video Understanding in the Wild](https://www.vspwdataset.com/Workshop2024.html) | May 2024| [[CodaLab]](https://codalab.lisn.upsaclay.fr/competitions/15094) |
Programming Languages
Categories
3. Traditional Referring Image Segmentation
75
5. Referring Video Object Segmentation
36
4. Referring Video Object Segmentation
21
1. Datasets
9
2. Traditional Referring Image Segmentation
2
2. Challenges
2
6. Referring 3D Instance Segmentation
1
5. Referring 3D Instance Segmentation
1
3. Interactive Referring Image Segmentation
1
Sub Categories