https://github.com/datacanvasio/mmalaya
https://github.com/datacanvasio/mmalaya
Last synced: about 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/datacanvasio/mmalaya
- Owner: DataCanvasIO
- Created: 2024-01-23T11:11:35.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-12-27T08:08:51.000Z (over 1 year ago)
- Last Synced: 2025-03-27T20:51:14.404Z (about 1 year ago)
- Language: Python
- Size: 191 KB
- Stars: 8
- Watchers: 9
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# MMAlaya
MMAlaya是基于大语言模型[Alaya](https://github.com/DataCanvasIO/Alaya)的多模态模型,模型权重文件在[DataCanvas/MMAlaya](https://huggingface.co/DataCanvas/MMAlaya/tree/main)
MMAlaya包含以下三个模块:
1,大语言模型[Alaya-7B-Chat](https://huggingface.co/DataCanvas/Alaya-7B-Chat)。
2,图像文本特征编码器来自[blip2-opt-2.7b](https://huggingface.co/Salesforce/blip2-opt-2.7b)的EVA-G。
3,图像文本特征到大预言模型的连接器,使用的是来自[blip2-opt-2.7b](https://huggingface.co/Salesforce/blip2-opt-2.7b)的Qformer和线性投影器。
模型的训练主要基于[LLaVA](https://github.com/haotian-liu/LLaVA)架构
OpenCompass 评测榜单,均分41.1,排名25名。

MMBench 评测榜单,开源开放的模型,中文测试集,均分58.6,排名25名。

推理可以参考 [inference.py](https://github.com/DataCanvasIO/MMAlaya/blob/main/inference.py)
请使用transformers==4.33.0,否则会报错ImportError: cannot import name '_expand_mask' from 'transformers.models.bloom.modeling_bloom'。
# Citation
MMAlaya使用Apache 2.0 Lisense,开放模型权重,允许商业用途。如果您的项目引用了我们的MMAlaya,请标明出处,并点个小星星 :star: :
```
@misc{datacanvas2024mmalaya,
author = {DataCanvas Ltd.},
title = {mmalaya},
year = {2024},
howpublished = {\url{https://github.com/DataCanvasIO/MMAlaya}},
}
```