Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/pleaseconnectwifi/DANCE

PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)
https://github.com/pleaseconnectwifi/DANCE

Last synced: about 2 months ago
JSON representation

PyTorch code for Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles (DANCE)

Awesome Lists containing this project

README

        

# Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles

![](imgs/fig1.gif)

### [Paper (ArXiv)]() | [Project Page](https://shuquanye.com/DANCE_website) | [Pre-trained Models]()

[Shuquan Ye](https://shuquanye.com/)2,[Yujia Xie](https://sites.google.com/view/yujia?pli=1)1,[Dongdong Chen](https://www.dongdongchen.bid/)1, [Yichong Xu](https://xycking.wixsite.com/yichongxu)1, [Lu Yuan](https://www.microsoft.com/en-us/research/people/luyuan/)1, [Chenguang Zhu](https://www.microsoft.com/en-us/research/people/chezhu/)1, [Jing Liao](https://liaojing.github.io/html/)2

1Microsoft, 2City University of Hong Kong

This is the PyTorch code of the [DANCE](https://shuquanye.com/DANCE_website) [\[paper\]](). The code is on PyTorch 1.11. Pre-training with ours code requires 4 nodes each with 8 A100 GPUs.

Catalog:

- [ ] Code for DANCE-augmented Pre-training

- [ ] Code for DANCE-augmented Fine-tuning

- [ ] Code for Image-Text Retrieval, OK-VQA

- [ ] Download of Pre-trained and Fine-tuned Checkpoints

## BibTeX

```

```