Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/kwaivgi/uniaa

Unified Multi-modal IAA Baseline and Benchmark
https://github.com/kwaivgi/uniaa

benchmark dataset image-aesthetic-assessment llava mllm

Last synced: 4 days ago
JSON representation

Unified Multi-modal IAA Baseline and Benchmark

Host: GitHub
URL: https://github.com/kwaivgi/uniaa
Owner: KwaiVGI
Created: 2024-03-15T08:50:53.000Z (8 months ago)
Default Branch: main
Last Pushed: 2024-04-16T16:05:23.000Z (7 months ago)
Last Synced: 2024-06-21T22:06:39.273Z (5 months ago)
Topics: benchmark, dataset, image-aesthetic-assessment, llava, mllm
Homepage:
Size: 9.11 MB
Stars: 57
Watchers: 3
Forks: 4
Open Issues: 1
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Uniaa: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

The Unified Multi-modal Image Aesthetic Assessment Framework, containing a baseline (a) and a benchmark (b). The aesthetic perception performance of UNIAA-LLaVA and other MLLMs is shown in (c).

The IAA Datasets Conversion Paradigm for UNIAA-LLaVA.

The UNIAA-Bench overview. (a) UNIAA-QA contains 5354 Image-Question-Answer samples and (b) UNIAA-Describe contains 501 Image-Description samples. (c) For open-source MLLMs, Logits can be extracted to calculate the score.

## Release
- [9/25] 🔥 Our [UNIAA](https://huggingface.co/datasets/zkzhou/UNIAA) data is released! The corresponding fine-tuning and evaluation code can be found in the GitHub repository folder.
- [4/15] 🔥 We build the page of UNIAA!

## Performance

### Aesthetic Perception Performance

### Aesthetic Description Performance

### Aesthetic Assessment Performance
#### Zero-shot

#### Supervised learning on AVA and TAD66K

## Training on data of UNIAA
#### Step 1: Download Images and Json files
#### Step 2: Training On Specific MLLM

## Test on UNIAA-Bench
### For Aesthetic Perception
#### Step 1: Download Images and Json files
#### Step 2: Run the inference code
#### Step 3: Calculate the score

### For Aesthetic Description
#### Step 1: Download Images and Json files
#### Step 2: Run the inference code

## Citation

If you find UNIAA useful for your your research and applications, please cite using this BibTeX:
```bibtex
@misc{zhou2024uniaa,
title={UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark},
author={Zhaokun Zhou and Qiulin Wang and Bin Lin and Yiwei Su and Rui Chen and Xin Tao and Amin Zheng and Li Yuan and Pengfei Wan and Di Zhang},
year={2024},
eprint={2404.09619},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
```

## Contact
If you have any questions, please feel free to email [email protected] and [email protected].