An open API service indexing awesome lists of open source software.

https://github.com/mbzuai-oryx/groundinglmm

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks [CVPR 2024].
https://github.com/mbzuai-oryx/groundinglmm

foundation-models llm-agent lmm vision-and-language vision-language-model

Last synced: 6 months ago
JSON representation

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks [CVPR 2024].

Awesome Lists containing this project