Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by mbzuai-oryx
A curated list of projects in awesome lists by mbzuai-oryx .
https://github.com/mbzuai-oryx/video-chatgpt
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
chatbot clip gpt-4 llama llava mulit-modal vicuna video-chatboat video-conversation vision-language vision-language-pretraining
Last synced: 24 Sep 2024
https://github.com/mbzuai-oryx/llava-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
conversation llama-3-llava llama-3-vision llama3 llama3-llava llama3-vision llava llava-llama3 llava-phi3 llm lmms phi-3-llava phi-3-vision phi3 phi3-llava phi3-vision vision-language
Last synced: 24 Sep 2024
https://github.com/mbzuai-oryx/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
conversation llama-3-llava llama-3-vision llama3 llama3-llava llama3-vision llava llava-llama3 llava-phi3 llm lmms phi-3-llava phi-3-vision phi3 phi3-llava phi3-vision vision-language
Last synced: 01 Aug 2024
https://github.com/mbzuai-oryx/MobiLlama
MobiLlama : Small Language Model tailored for edge devices
efficient-llm llm mobile-llm slm tiny-llm
Last synced: 01 Aug 2024
https://github.com/mbzuai-oryx/xraygpt
[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
Last synced: 02 Aug 2024
https://github.com/mbzuai-oryx/XrayGPT
XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
Last synced: 31 Jul 2024
https://github.com/mbzuai-oryx/videogpt-plus
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
chatbot clip dual-encoder gpt4 gpt4o image-encoder llama3 llava multimodal phi-3-mini vicuna video-chatbot video-conversation video-encoder vision-language vision-language-pretraining
Last synced: 24 Sep 2024