Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/amazon-science/crossmodal-contrastive-learning

CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
https://github.com/amazon-science/crossmodal-contrastive-learning

computer-vision contrastive-learning multi-modality natural-language-processing transformers video video-captioning video-text-retrieval

Last synced: 2 days ago
JSON representation

CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021

Awesome Lists containing this project

README

        

# CrossCLR - ICCV 2021




This is the official implementation of paper:

### CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations [[Paper]](https://arxiv.org/abs/2103.00020)

Authors:
[Mohammadreza Zolfaghari](https://mzolfaghari.github.io/),
[Yi Zhu](https://bryanyzhu.github.io/),
[Peter Gehler](http://gehler.io/),
[Thomas Brox](https://lmb.informatik.uni-freiburg.de/people/brox/index.html),

## Update

##### [Dec 2021] CrossCLR-onlyIntraModality released
## Loss Function
The loss function [`CrossCLR`](https://github.com/amazon-research/crossmodal-contrastive-learning) in `loss.py` takes `video features` and `text features` as input, and return the loss.

Usage:
```python
from trainer.loss import CrossCLR_onlyIntraModality

# define loss with a temperature `temp` and weights for negative samples `w`
criterion = CrossCLR_onlyIntraModality(temperature=temp, negative_weight=w)

# features: [bsz, f_dim]
video_features = ...
text_features = ...

# CrossCLR
loss = criterion(video_features, text_features)

...
```

## Qualitative samples



## Reference
```
@article{crossclr_aws_21,
author = {Mohammadreza Zolfaghari and
Yi Zhu and
Peter V. Gehler and
Thomas Brox},
title = {CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations},
url = {https://arxiv.org/abs/2109.14910},
eprinttype = {arXiv},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2021},
}
```

## Security

See [CONTRIBUTING](CONTRIBUTING.md#security-issue-notifications) for more information.

## License

This project is licensed under the Apache-2.0 License.