Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-large-vision-language-model

Awesome Large Vision-Language Model: A Curated List of Large Vision-Language Model
https://github.com/superbrucejia/awesome-large-vision-language-model

Last synced: 3 days ago
JSON representation

Survey
- [Paper
- [Paper - llms.github.io/archives/)]\
- [Paper - Multimodal-Large-Language-Models)]\
- [Paper
- [Paper - tutorial.github.io/2023/)]\
Multimodal Large Language Models
- Alignment Before Projection
  - [Paper - YuanGroup/Video-LLaVA)]\
- Intermediate Networks
  - [Paper - GPT/NExT-GPT)] [[Webpage](https://next-gpt.github.io/)]\
  - [Paper - PLUG/mPLUG-Owl)] [[Webpage](https://www.modelscope.cn/studios/iic/mPLUG-Owl)]\
  - [Paper - ai-lab/MiniGPT-5)] [[Webpage](https://eric-ai-lab.github.io/minigpt-5.github.io/)]\
  - [Paper - NLP-SG/Video-LLaMA)] [[Video](https://www.youtube.com/watch?v=RDNYs3Rswhc)]\
  - [Paper - CAIR/MiniGPT-4)] [[Dataset](https://huggingface.co/datasets/Vision-CAIR/cc_sbu_align)] [[Webpage](https://minigpt-4.github.io/)]\
  - [Paper - Adapter)]\
  - [Paper - research/bubogpt)] [[Webpage](https://bubo-gpt.github.io/)]\
  - [Paper
  - [Paper - LLM)] [[Webpage](https://x-llm.github.io/)]\
- Feature-level Fusion
  - [Paper
  - [Paper
- Linear Layers Projection
  - [Paper - VL/LLaVA-NeXT)] [[Webpage](https://next-gpt.github.io/)]\
  - [Paper - VL/LLaVA-NeXT)] [[Webpage](https://llava-vl.github.io/blog/2024-06-16-llava-next-interleave/)]\
  - [Paper - YuanGroup/MoE-LLaVA)]\
  - [Paper - oryx/Video-ChatGPT)] [[Webpage](https://mbzuai-oryx.github.io/Video-ChatGPT/)]\
  - [Paper - liu/LLaVA)] [[Webpage](https://llava-vl.github.io/)]\
  - [Paper - liu/LLaVA)] [[Webpage](https://llava-vl.github.io/)]\
  - [Paper - Code/tree/main/CoDi-2)] [[Webpage](https://codi-2.github.io/)]\
  - [Paper - XInstructBLIP)] [[Webpage](https://artemisp.github.io/X-InstructBLIP-page/)]\
  - [Paper
  - [Paper - CAIR/MiniGPT-4)] [[Webpage](https://minigpt-v2.github.io/)]\
  - [Paper - VL)] [[Webpage](https://qianwen.aliyun.com/)]\
  - [Paper
  - [Paper
  - [Paper
  - [Paper
  - [Paper - gpt.github.io/)]\
  - [Paper
- Prompt Tuning
  - [Paper - jian/BLIText)]\
  - [Paper - Adapter)]\
Contrastive Language-Image Pre-Training
- Intermediate Networks
  - [Paper
  - [Paper
  - [Paper - pytorch)] [[Video](https://www.youtube.com/watch?v=smUHQndcmOY)] \
- Simple Contrastive Learning Paradigms
  - [Paper
- Preference Alignment
  - [Paper - rlhf/LLaVA-RLHF)] [[Webpage](https://llava-rlhf.github.io/)]\
Universal Embedding Space
- Preference Alignment
  - [Paper - Bind_Point-LLM)]\
  - [Paper
Training Recipes
- Preference Alignment
  - [Paper - demo.hanlab.ai/)]\
  - PodGPT

Ecosyste.ms: Awesome

awesome-large-vision-language-model

Survey

Multimodal Large Language Models

Alignment Before Projection

Intermediate Networks

Feature-level Fusion

Linear Layers Projection

Prompt Tuning

Contrastive Language-Image Pre-Training

Intermediate Networks

Simple Contrastive Learning Paradigms

Preference Alignment

Universal Embedding Space

Preference Alignment

Training Recipes

Preference Alignment