Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with cross-modal
A curated list of projects in awesome lists tagged with cross-modal .
https://github.com/jina-ai/discoart
đĒŠ Create Disco Diffusion artworks in one line
clip-guided-diffusion creative-ai creative-art cross-modal dalle diffusion disco-diffusion discodiffusion generative-art imgen latent-diffusion midjourney multimodal prompts stable-diffusion
Last synced: 15 Dec 2024
https://github.com/docarray/docarray
Represent, send, store and search multimodal data
cross-modal data-structures dataclass deep-learning docarray elasticsearch fastapi machine-learning multi-modal multimodal nearest-neighbor-search nested-data neural-search protobuf pydantic pytorch qdrant semantic-search weaviate
Last synced: 16 Dec 2024
https://github.com/towhee-io/examples
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
audio-classification cross-modal embeddings image-classification machine-learning nlp video-tagging
Last synced: 21 Dec 2024
https://github.com/DRSY/MoTIS
[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)
ai clip cross-modal image-search ios-swift k-means k-means-clustering knn knowledge-distillation lsh naacl random-projection retrieval semantic-search vector-search
Last synced: 15 Nov 2024
https://github.com/Zengyi-Qin/Weakly-Supervised-3D-Object-Detection
Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020
3d-object-detection acm-mm-2020 cross-modal kitti lidar monocular object-proposals point-cloud stereo tensorflow transfer-learning unsupervised-learning unsupervised-object-detection vs3d weakly-supervised-detection ws3d
Last synced: 27 Oct 2024
https://github.com/rohitrango/objects-that-sound
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
audio-video audioset cross-modal deep-learning deep-neural-networks deeplearning deepmind embeddings machine-learning
Last synced: 08 Nov 2024
https://github.com/qcraftai/distill-bev
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
3d-object-detection autonomous-driving bev cross-modal distillation knowledge-distillation lidar multi-camera multi-modal nuscenes point-cloud self-driving
Last synced: 28 Oct 2024
https://github.com/Eaphan/UPIDet
Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]
3d-object-detection cross-modal multi-modal
Last synced: 28 Oct 2024
https://github.com/gt-ripl/xmodal-ctx
Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning
clip cross-modal image-captioning vision-and-language
Last synced: 14 Nov 2024
https://github.com/ovshake/cobra
Code for COBRA: Contrastive Bi-Modal Representation Algorithm (https://arxiv.org/abs/2005.03687)
contrastive-learning cross-modal machine-learning pytorch representation-learning
Last synced: 17 Nov 2024
https://github.com/petarv-/x-cnn
Cross-modal convolutional neural networks
convolutional-neural-networks cross-modal keras python
Last synced: 09 Nov 2024
https://github.com/buaadreamer/ccrk
[KDD 2024] Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning
cross-lingual cross-lingual-retrieval cross-modal cross-modal-retrieval iglue image-text-retrieval image-text-search kdd2024 mscoco multi30k retrieval swin-transformer vision-language-pretraining wit xflickrco xlm-roberta
Last synced: 06 Dec 2024
https://github.com/prithivirajdamodaran/whatthefood
An intentionally simple Image to Food cross-modal search. Created by Prithiviraj Damodaran.
cross-modal cross-modal-learning cross-modal-retrieval multimodal
Last synced: 28 Nov 2024