Projects in Awesome Lists tagged with minigpt4
A curated list of projects in awesome lists tagged with minigpt4 .
https://github.com/damo-nlp-sg/video-llama
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
blip2 cross-modal-pretraining large-language-models llama minigpt4 multi-modal-chatgpt video-language-pretraining vision-language-pretraining
Last synced: 15 May 2025
https://github.com/DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
blip2 cross-modal-pretraining large-language-models llama minigpt4 multi-modal-chatgpt video-language-pretraining vision-language-pretraining
Last synced: 24 Mar 2025
https://github.com/Maknee/minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
c cpp deep-learning ggml machine-learning minigpt4 multimodal quantization
Last synced: 15 Apr 2025
https://github.com/paddlepaddle/paddlemix
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
aigc blip2 clip controlnet dit eva-clip image-to-text llava minigpt4 multimodal ppdiffusers qwen-vl sd-xl sora stable-diffusion stablevideodiffusion text-to-image text-to-video
Last synced: 04 Apr 2025
https://github.com/buaadreamer/chinese-llava-med
中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine
ai chinese gpt4v huggingface-datasets llama-factory llava medical minigpt4 mllm multimodal qwen1-5 transformers
Last synced: 05 Jan 2025
https://github.com/abdur75648/medicalgpt
Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)
chatgpt chatgpt4o llama llm llms medical-dataset medical-imaging medical-report-generation medicalgpt minigpt4 multimodal-llm vicuna vqa vqa-dataset xraygpt
Last synced: 31 Mar 2025
https://github.com/neemiasbsilva/minigpt4-image-caption-generation
Streamline the creation of supervised datasets to facilitate data augmentation for deep learning architectures focused on image captioning. The core framework leverages MiniGPT-4, complemented by the pre-trained Vicuna model, which boasts 13 billion parameters.
caption image-caption-generator minigpt4
Last synced: 23 Feb 2025