Projects in Awesome Lists by X-PLUG
A curated list of projects in awesome lists by X-PLUG .
https://github.com/x-plug/mobileagent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
agent android app automation copilot gpt4v gui harmony ios mllm mobile mobile-agents multimodal multimodal-agent multimodal-large-language-models
Last synced: 26 Jun 2025
https://github.com/X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
agent android app automation copilot gpt4v gui harmony ios mllm mobile mobile-agents multimodal multimodal-agent multimodal-large-language-models
Last synced: 29 Apr 2025
https://github.com/x-plug/mplug-owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
alpaca chatbot chatgpt damo dialogue gpt gpt4 gpt4-api huggingface instruction-tuning large-language-models llama mplug mplug-owl multimodal pretraining pytorch transformer video visual-recognition
Last synced: 10 Apr 2025
https://github.com/X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
alpaca chatbot chatgpt damo dialogue gpt gpt4 gpt4-api huggingface instruction-tuning large-language-models llama mplug mplug-owl multimodal pretraining pytorch transformer video visual-recognition
Last synced: 19 Apr 2025
https://github.com/x-plug/mplug-docowl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding
Last synced: 14 May 2025
https://github.com/X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
chart-understanding document-understanding mllm multimodal multimodal-large-language-models table-understanding
Last synced: 11 May 2025
https://github.com/x-plug/cvalues
面向中文大模型价值观的评估与对齐研究
benchmark chinese-llms evaluation human-values llms multi-choice responsibility safety
Last synced: 26 Jun 2025
https://github.com/X-PLUG/CValues
面向中文大模型价值观的评估与对齐研究
benchmark chinese-llms evaluation human-values llms multi-choice responsibility safety
Last synced: 09 May 2025
https://github.com/x-plug/chatplug
A Chinese Open-Domain Dialogue System
chat chatbot chatgpt chinese dialogue encoder-decoder instruction-finetuning knowledge-augment large-language-models open-domain-dialogue-system personality pretraining
Last synced: 26 Jun 2025
https://github.com/x-plug/youku-mplug
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
benchmark chinese dataset mllm multimodal multimodal-large-language-models multimodal-pretraining video video-question-answering video-retrieval youku
Last synced: 26 Jun 2025
https://github.com/X-PLUG/ChatPLUG
A Chinese Open-Domain Dialogue System
chat chatbot chatgpt chinese dialogue encoder-decoder instruction-finetuning knowledge-augment large-language-models open-domain-dialogue-system personality pretraining
Last synced: 09 May 2025
https://github.com/X-PLUG/Youku-mPLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
benchmark chinese dataset mllm multimodal multimodal-large-language-models multimodal-pretraining video video-question-answering video-retrieval youku
Last synced: 20 Apr 2025
https://github.com/x-plug/mplug-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
foundation-models image-retrieval mllm mplug multimodal multimodal-pretraining video video-question-answering video-retrieval vqa
Last synced: 09 Sep 2025
https://github.com/x-plug/writingbench
WritingBench: A Comprehensive Benchmark for Generative Writing
ai benchmark evaluation-framework huggingface llm long-context long-text nlp text-generation writing
Last synced: 01 Sep 2025
https://github.com/x-plug/mplug-halowl
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
benchmark contrastive-learning hallucinations mllm multimodal-hallucination multimodal-large-language-models
Last synced: 31 Aug 2025
https://github.com/x-plug/mplug
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)
image-captioning image-text image-text-retrieval multimodal pretraining pytorch transformer visual-language vqa
Last synced: 26 Jun 2025
https://github.com/x-plug/socialbench
RoleInteract: Evaluating the Social Interaction of Role-Playing Agents
Last synced: 26 Jun 2025