An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with image-text

A curated list of projects in awesome lists tagged with image-text .

https://github.com/Sense-GVT/DeCLIP

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

big-model clip image-text multi-model self-supervised vision-language-pretraining zero-shot

Last synced: 03 Apr 2025

https://github.com/x-plug/mplug

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

image-captioning image-text image-text-retrieval multimodal pretraining pytorch transformer visual-language vqa

Last synced: 26 Jun 2025

https://github.com/TheoCoombes/crawlingathome

A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.

clip dall-e dataset dataset-generation image-text machine-learning

Last synced: 03 Apr 2025

https://github.com/antonlukin/poster-editor

Wrapper for PHP's GD Library for easy image manipulation. Support for scaling multi-line text, shapes, filters and smart resize.

composer image-processing image-text intervention php php-class php-gd php-image php-library poster-editor

Last synced: 07 Mar 2026

https://github.com/huangrunhua/livetextwithimage

WWDC22: Enabling Live Text interactions with images in SwiftUI

image-processing image-text live-text swift swiftui swiftui-demo swiftui-example wwdc wwdc22

Last synced: 12 Oct 2025

https://github.com/zabir-nabil/imagebert-keras

Keras implementation of ImageBERT from Microsoft

image-text imagebert keras

Last synced: 26 Jul 2025

https://github.com/dinhanhx/visualroberta

The first public Vietnamese visual linguistic foundation model(s)

image-captioning image-text python python-3 python3 vietnamese-nlp visual-linguistic visual-question-answering

Last synced: 13 May 2025

https://github.com/jianzhnie/multimodaltransformers

lmmtoolkit is a toolkit for Multi-Modal Learning

image-text multi-modal-learning text-image text-to-video

Last synced: 15 Sep 2025