Projects in Awesome Lists tagged with video-text-retrieval
A curated list of projects in awesome lists tagged with video-text-retrieval .
https://github.com/ArrowLuo/CLIP4Clip
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
activitynet clip didemo lsmdc msrvtt msvd multimodal multimodal-learning multimodality ranking retrieval retrieval-model search video-clip-retrieval video-text-retrieval
Last synced: 03 Apr 2025
https://github.com/microsoft/univl
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
alignment caption caption-task coin joint localization msrvtt multimodal-sentiment-analysis multimodality pretrain pretraining retrieval-task segmentation video video-language video-text video-text-retrieval youcookii
Last synced: 05 Apr 2025
https://github.com/salesforce/alpro
Align and Prompt: Video-and-Language Pre-training with Entity Prompts
prompt-learning representation-learning video-language video-question-answering video-text-retrieval vision-and-language
Last synced: 19 Dec 2024
https://github.com/m-bain/condensedmovies
Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
dataset precomputed-features retrieval source-videos video-text-retrieval
Last synced: 19 Dec 2024
https://github.com/alipay/ant-multi-modal-framework
Research Code for Multimodal-Cognition Team in Ant Group
image-text-retrieval multimodal-learning multimodal-llm video-editing video-text-retrieval
Last synced: 25 Apr 2025
https://github.com/amazon-science/crossmodal-contrastive-learning
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
computer-vision contrastive-learning multi-modality natural-language-processing transformers video video-captioning video-text-retrieval
Last synced: 03 May 2025