Projects in Awesome Lists tagged with multimodal-llms
A curated list of projects in awesome lists tagged with multimodal-llms .
https://github.com/aimagelab/llava-more
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
deepseek-r1 gemma-2 llama3 llama3-1 llama3-vision llava llava-llama3 llms multimodal-llms siglip siglip2 vision-and-language
Last synced: 06 Apr 2025
https://github.com/aimagelab/LLaVA-MORE
LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1
llama3 llama3-1 llama3-vision llava llava-llama3 llms multimodal-llms vision-and-language
Last synced: 30 Dec 2024
https://github.com/adm-2005/picnarrate-image-captioner
A tool for generating accurate and detailed captions for images.
blip image-captioning multimodal-llms
Last synced: 22 Mar 2025