Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
https://github.com/ActiveVisionLab/Awesome-LLM-3D

Last synced: 22 days ago
JSON representation

3D Understanding via LLM
3D Understanding via other Foundation Models
3D Reasoning
3D Generation
3D Embodied Agent
- RT-1: Robotics Transformer for Real-World Control at Scale - transformer1.github.io/) |
- RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control - transformer2.github.io/) |
- SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning
- Unified Human-Scene Interaction via Prompted Chain-of-Contacts
- LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models - NLP-Group/LLM-Planner/) |
- See and Think: Embodied Agent in Virtual Environment
- On Bringing Robots Home - e) |
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
- RT-1: Robotics Transformer for Real-World Control at Scale - transformer1.github.io/) |
- RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control - transformer2.github.io/) |
- SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning
- Unified Human-Scene Interaction via Prompted Chain-of-Contacts
- LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models - NLP-Group/LLM-Planner/) |
- See and Think: Embodied Agent in Virtual Environment
- Diffusion-based Generation, Optimization, and Planning in 3D Scenes - Diffuser) |
- On Bringing Robots Home - e) |
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
- RT-1: Robotics Transformer for Real-World Control at Scale - transformer1.github.io/) |
- SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning
- Unified Human-Scene Interaction via Prompted Chain-of-Contacts
- LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models - NLP-Group/LLM-Planner/) |
- See and Think: Embodied Agent in Virtual Environment
- On Bringing Robots Home - e) |
- VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models
- Unified Human-Scene Interaction via Prompted Chain-of-Contacts
- SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning
- LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models - NLP-Group/LLM-Planner/) |
- RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control - transformer2.github.io/) |
- Open-vocabulary Queryable Scene Representations for Real World Planning - saycan.github.io/) |
- On Bringing Robots Home - e) |
- CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory - fields) |
- An Embodied Generalist Agent in 3D World - generalist/embodied-generalist) |
- SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities - vlm.github.io/) |
- An Embodied Generalist Agent in 3D World - generalist/embodied-generalist) |
- Towards Learning a Generalist Model for Embodied Navigation
- CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory - fields) |
3D Benchmarks
Acknowledgement
- Awesome-LLM
- Awesome-LLM
🔥 News
- 2023-12-16
- 2024-01-06 - A order for better following the latest advances.
- 2024-05-16
Star History
- ![Star History Chart - history.com/#ActiveVisionLab/Awesome-LLM-3D&Date)
- ![Star History Chart - history.com/#ActiveVisionLab/Awesome-LLM-3D&Date)

Categories

3D Understanding via other Foundation Models 50 3D Understanding via LLM 42 3D Embodied Agent 36 3D Benchmarks 28 3D Generation 15 🔥 News 3 3D Reasoning 3 Acknowledgement 2 Star History 2

Sub Categories