Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/vision-cair/chatcaptioner

Official Repository of ChatCaptioner
https://github.com/vision-cair/chatcaptioner

Last synced: 1 day ago
JSON representation

Official Repository of ChatCaptioner

Awesome Lists containing this project

README

        

# Interactive ChatCaptioner for image and video

Official repository of **ChatCaptioner** and **Video ChatCaptioner**.

ChatCaptioner paper [ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions](https://arxiv.org/abs/2303.06594)

Video ChatCaptioner paper [Video ChatCaptioner: Towards the Enriched Spatiotemporal Descriptions](https://arxiv.org/abs/2304.04227)

## Demo
![demo1](ChatCaptioner/demo_pic/demo1.gif)
![demo2](ChatCaptioner/demo_pic/demo2.gif)
![demo3](Video_ChatCaptioner/demo_pic/dance.gif)
![demo4](Video_ChatCaptioner/demo_pic/skating.gif)

* ChatCaptiners:
* ChatCaptioner for images: [ChatCaptioner](ChatCaptioner/README.md)
* ChatCaptioner for videos: [Video ChatCaptioner](Video_ChatCaptioner/README.md)

## Acknowledgement

+ [ChatGPT](https://openai.com/blog/chatgpt/)
+ [BLIP2](https://huggingface.co/docs/transformers/main/model_doc/blip-2)

Please cite ChatCaptioner and Video ChatCaptioner from the following bibtex

```
@article{zhu2023chatgpt,
title={ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions},
author={Deyao Zhu and Jun Chen and Kilichbek Haydarov and Xiaoqian Shen and Wenxuan Zhang and Mohamed Elhoseiny},
journal={arXiv preprint arXiv:2303.06594},
year={2023}
}
```

```
@article{chen2023video,
title={Video ChatCaptioner: Towards the Enriched Spatiotemporal Descriptions},
author={Jun Chen and Deyao Zhu and Kilichbek Haydarov and Xiang Li and Mohamed Elhoseiny},
journal={arXiv preprint arXiv:2304.04227},
year={2023}
}
```

## *License*

ChatCaptioner and Video ChatCaptioner are released under the [MIT license](LICENSE).