Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Awesome-Video-LLMs
Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.
https://github.com/zyayoung/Awesome-Video-LLMs
Last synced: 3 days ago
JSON representation
-
Awesome Video Large Language Models π₯π¦
- Paper - NLP-SG/Video-LLaMA) [Demo](https://huggingface.co/spaces/DAMO-NLP-SG/Video-LLaMA)
- Paper - YuanGroup/Video-LLaVA) [Demo](https://huggingface.co/spaces/LanguageBind/Video-LLaVA)
- Paper - research/LLaMA-VID) [Demo](http://103.170.5.190:7864/)
- Paper - research/LLaMA-VID) [Demo](http://103.170.5.190:7864/)
- Paper - Anything)
- Paper - oryx/Video-ChatGPT)
- Paper - YuanGroup/Video-LLaVA) [Demo](https://huggingface.co/spaces/LanguageBind/Video-LLaVA)
- Paper - Anything/tree/main/video_chat2) [Demo](https://huggingface.co/spaces/OpenGVLab/VideoChat2)
-
Datasets πΎπ
-
Results
-
Join the Awesome Video Large Language Models Community ππ€
-
Action Recognition
-
-
Get Started with VLM-Eval: Your Journey Begins Here ππ¨βπ»
Programming Languages
Categories
Sub Categories
Keywords
instruction-tuning
3
large-language-models
3
large-vision-language-model
2
multi-modality
2
foundation-models
2
chatgpt
2
llama
2
blip2
1
cross-modal-pretraining
1
video-understanding
1
video-question-answering
1
video
1
stablelm
1
large-model
1
langchain
1
gradio
1
minigpt4
1
multi-modal-chatgpt
1
chat
1
captioning-videos
1
big-model
1
vision-language-pretraining
1
video-language-pretraining
1
visual-language-learning
1
vision-language-model
1
multimodal
1
llava
1
llama2
1
llama-2
1
gpt-4
1
chatbot
1
visual-instruction-tuning
1
visual-in-context-learning
1
visual-chain-of-thought
1
multimodal-large-language-models
1
multimodal-instruction-tuning
1
multimodal-in-context-learning
1
multimodal-chain-of-thought
1
large-vision-language-models
1
instruction-following
1
in-context-learning
1
chain-of-thought
1
multi-modal
1