https://github.com/skyworkai/vitron
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
https://github.com/skyworkai/vitron
mllm multimodal-large-language-models segmentation
Last synced: about 1 year ago
JSON representation
NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
- Host: GitHub
- URL: https://github.com/skyworkai/vitron
- Owner: SkyworkAI
- Created: 2024-03-18T04:26:51.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-20T15:58:07.000Z (over 1 year ago)
- Last Synced: 2025-05-07T17:41:59.075Z (about 1 year ago)
- Topics: mllm, multimodal-large-language-models, segmentation
- Language: Python
- Homepage: https://vitron-llm.github.io/
- Size: 667 MB
- Stars: 531
- Watchers: 15
- Forks: 33
- Open Issues: 15
-
Metadata Files:
- Readme: README.md