Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-large-vision-language-model

Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
https://github.com/superbrucejia/awesome-large-vision-language-model

Last synced: 3 days ago
JSON representation

  • Survey

  • Multimodal Large Language Models

    • Alignment Before Projection

      • [Paper - YuanGroup/Video-LLaVA)]\
    • Intermediate Networks

      • [Paper - GPT/NExT-GPT)] [[Webpage](https://next-gpt.github.io/)]\
      • [Paper - PLUG/mPLUG-Owl)] [[Webpage](https://www.modelscope.cn/studios/iic/mPLUG-Owl)]\
      • [Paper - ai-lab/MiniGPT-5)] [[Webpage](https://eric-ai-lab.github.io/minigpt-5.github.io/)]\
      • [Paper - NLP-SG/Video-LLaMA)] [[Video](https://www.youtube.com/watch?v=RDNYs3Rswhc)]\
      • [Paper - CAIR/MiniGPT-4)] [[Dataset](https://huggingface.co/datasets/Vision-CAIR/cc_sbu_align)] [[Webpage](https://minigpt-4.github.io/)]\
      • [Paper - Adapter)]\
      • [Paper - research/bubogpt)] [[Webpage](https://bubo-gpt.github.io/)]\
      • [Paper
      • [Paper - LLM)] [[Webpage](https://x-llm.github.io/)]\
    • Feature-level Fusion

    • Linear Layers Projection

      • [Paper - VL/LLaVA-NeXT)] [[Webpage](https://next-gpt.github.io/)]\
      • [Paper - VL/LLaVA-NeXT)] [[Webpage](https://llava-vl.github.io/blog/2024-06-16-llava-next-interleave/)]\
      • [Paper - YuanGroup/MoE-LLaVA)]\
      • [Paper - oryx/Video-ChatGPT)] [[Webpage](https://mbzuai-oryx.github.io/Video-ChatGPT/)]\
      • [Paper - liu/LLaVA)] [[Webpage](https://llava-vl.github.io/)]\
      • [Paper - liu/LLaVA)] [[Webpage](https://llava-vl.github.io/)]\
      • [Paper - Code/tree/main/CoDi-2)] [[Webpage](https://codi-2.github.io/)]\
      • [Paper - XInstructBLIP)] [[Webpage](https://artemisp.github.io/X-InstructBLIP-page/)]\
      • [Paper
      • [Paper - CAIR/MiniGPT-4)] [[Webpage](https://minigpt-v2.github.io/)]\
      • [Paper - VL)] [[Webpage](https://qianwen.aliyun.com/)]\
      • [Paper
      • [Paper
      • [Paper
      • [Paper
      • [Paper - gpt.github.io/)]\
      • [Paper
    • Prompt Tuning

  • Contrastive Language-Image Pre-Training

    • Intermediate Networks

      • [Paper
      • [Paper
      • [Paper - pytorch)] [[Video](https://www.youtube.com/watch?v=smUHQndcmOY)] \
    • Simple Contrastive Learning Paradigms

    • Preference Alignment

      • [Paper - rlhf/LLaVA-RLHF)] [[Webpage](https://llava-rlhf.github.io/)]\
  • Universal Embedding Space

  • Training Recipes