https://github.com/2toinf/X-VLA

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
https://github.com/2toinf/X-VLA

cloth-folding florence-2 manipulation pretrained-models robotics robotics-dataset vision-language-action-model vision-language-model

Last synced: about 1 month ago
JSON representation

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

Host: GitHub
URL: https://github.com/2toinf/X-VLA
Owner: 2toinf
License: apache-2.0
Created: 2025-09-25T07:49:18.000Z (10 months ago)
Default Branch: main
Last Pushed: 2026-03-09T13:30:04.000Z (5 months ago)
Last Synced: 2026-03-09T18:10:46.778Z (5 months ago)
Topics: cloth-folding, florence-2, manipulation, pretrained-models, robotics, robotics-dataset, vision-language-action-model, vision-language-model
Language: C++
Homepage: https://thu-air-dream.github.io/X-VLA/
Size: 56.5 MB
Stars: 544
Watchers: 2
Forks: 43
Open Issues: 21
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-robotics-ee-opensource - GitHub

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/2toinf/X-VLA

Awesome Lists containing this project