An open API service indexing awesome lists of open source software.

https://github.com/shreydan/visiongpt2

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
https://github.com/shreydan/visiongpt2

gpt image-captioning multimodal pytorch transformers vit

Last synced: 5 months ago
JSON representation

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.

Awesome Lists containing this project