Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
https://github.com/OpenBMB/MiniCPM-V
minicpm minicpm-v multi-modal
Last synced: about 1 month ago
JSON representation
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
- Host: GitHub
- URL: https://github.com/OpenBMB/MiniCPM-V
- Owner: OpenBMB
- License: apache-2.0
- Created: 2024-01-29T05:30:33.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-10-22T08:08:52.000Z (about 2 months ago)
- Last Synced: 2024-10-29T15:04:05.531Z (about 1 month ago)
- Topics: minicpm, minicpm-v, multi-modal
- Language: Python
- Homepage:
- Size: 302 MB
- Stars: 12,410
- Watchers: 102
- Forks: 871
- Open Issues: 81
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-llm - MiniCPM-V 2.6 - 一款可以在手机上使用的 GPT-4V 级别的 MLLM,支持单图、多图和视频处理。 (热门 LLM 项目)
- awesome-llm - MiniCPM-V 2.6 - 一款可以在手机上使用的 GPT-4V 级别的 MLLM,支持单图、多图和视频处理。 (热门 LLM 项目)
- ai-game-devtools - MiniCPM-Llama3-V 2.5 - 4V Level MLLM on Your Phone. | | | Visual | (<span id="visual">Visual</span> / <span id="tool">Tool (AI LLM)</span>)
- Awesome-Chinese-LLM - MiniCPM-Llama3-V 2.5
- StarryDivineSky - OpenBMB/MiniCPM-V - V 2.8B:可在终端设备上部署的先进多模态大模型。最新发布的 MiniCPM-V 2.0 可以接受 180 万像素的任意长宽比图像输入,实现了和 Gemini Pro 相近的场景文字识别能力以及和 GPT-4V 相匹的低幻觉率。OmniLMM-12B:相比同规模其他模型在多个基准测试中具有领先性能,实现了相比 GPT-4V 更低的幻觉率。 (其他_机器视觉 / 网络服务_其他)
- Awesome-LLM - MiniCPM-V 2.6 - A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone (Trending LLM Projects)
- AiTreasureBox - OpenBMB/MiniCPM-V - 12-07_12795_1](https://img.shields.io/github/stars/OpenBMB/MiniCPM-V.svg)|MiniCPM-V 2.0: An Efficient End-side MLLM with Strong OCR and Understanding Capabilities| (Repos)