https://github.com/deepseek-ai/DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding
https://github.com/deepseek-ai/DeepSeek-VL

foundation-models vision-language-model vision-language-pretraining

Last synced: over 1 year ago
JSON representation

DeepSeek-VL: Towards Real-World Vision-Language Understanding

awesome-deepseek - 🖼️ DeepSeek-VL - Vision-language model for real-world understanding (1.3B/7B), 4096 context length (Models / Multimodal Models)
awesome-github-projects - DeepSeek-VL - DeepSeek-VL: Towards Real-World Vision-Language Understanding ⭐4,135 `Python` (📦 Legacy & Inactive Projects)
StarryDivineSky - deepseek-ai/DeepSeek-VL - VL具备通用的多模态理解能力，能够在复杂场景下处理逻辑图、网页、公式识别、科学文献、自然图像和具身智能。 (其他_机器视觉 / 网络服务_其他)
Awesome-Segment-Anything - [code
AiTreasureBox - deepseek-ai/DeepSeek-VL - 11-03_3997_0](https://img.shields.io/github/stars/deepseek-ai/DeepSeek-VL.svg)|DeepSeek-VL: Towards Real-World Vision-Language Understanding| (Repos)
Awesome-LLM-VLM-Foundation-Models - DeepSeek‑VL
awesome-vision-ai-stack - DeepSeek-VL - Open multimodal reasoning models from DeepSeek. (Foundation VLMs / General-purpose open models)
awesome-open-source-llms - DeepSeek-VL (DeepSeek AI) - Open-source vision-language model family with compact sizes (1.3B and 4.5B parameters) using a Mixture of Experts architecture for efficiency. Combines a SigLIP-L vision encoder with DeepSeekMoE LLM, delivering strong reasoning capabilities particularly suited for scientific diagram analysis and technical vision tasks. ([Read more](/details/deepseek-vl-deepseek-ai.md)) `Open Source` `Mixture Of Experts` `Scientific Reasoning` (Multimodal Models)

ecosyste.ms