Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with lmm
A curated list of projects in awesome lists tagged with lmm .
https://github.com/roboflow/multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥
cross-modal gpt-4 gpt-4-vision instance-segmentation llava lmm multimodality object-detection prompt-engineering segment-anything vision-language-model visual-prompting
Last synced: 02 Aug 2024
https://github.com/BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
ai ai-agent ai-agents-framework computer-control cradle foundation-agent gcc general-computer-control generative-ai grounding large-language-models llm lmm multimodality personoid vision-language-model vlm
Last synced: 03 Aug 2024
https://github.com/nvlabs/eagle
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
demo eagle gpt4 huggingface large-language-models llama llama3 llava llm lmm lvlm mllm nvdia
Last synced: 01 Oct 2024
https://github.com/NVlabs/EAGLE
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
demo eagle gpt4 huggingface large-language-models llama llama3 llava llm lmm lvlm mllm nvdia
Last synced: 26 Sep 2024
https://tiger-ai-lab.github.io/Mantis/
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
fuyu language llava-llama3 lmm mantis mllm multi-image-understanding multimodal video vision vlm
Last synced: 01 Aug 2024