Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
https://github.com/willyfh/msvd-indonesian
bahasa-indonesia deep-learning indonesian-dataset msvd msvd-indonesian multimodal-dataset neural-network video-captioning video-description video-retrieval video-text
Last synced: 5 days ago
JSON representation
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
- Host: GitHub
- URL: https://github.com/willyfh/msvd-indonesian
- Owner: willyfh
- License: mit
- Created: 2023-04-29T08:43:46.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-08-04T13:13:08.000Z (over 1 year ago)
- Last Synced: 2023-08-05T22:23:25.048Z (over 1 year ago)
- Topics: bahasa-indonesia, deep-learning, indonesian-dataset, msvd, msvd-indonesian, multimodal-dataset, neural-network, video-captioning, video-description, video-retrieval, video-text
- Homepage: https://arxiv.org/abs/2306.11341
- Size: 2.55 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# MSVD-Indonesian
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/video-retrieval-on-msvd-indonesian)](https://paperswithcode.com/sota/video-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/text-to-video-retrieval-on-msvd-indonesian)](https://paperswithcode.com/sota/text-to-video-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/video-to-text-retrieval-on-msvd-indonesian)](https://paperswithcode.com/sota/video-to-text-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/video-captioning-on-msvd-indonesian)](https://paperswithcode.com/sota/video-captioning-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal)MSVD-Indonesian (Paper: [link](https://arxiv.org/abs/2306.11341)) is derived from the MSVD dataset, which is obtained with the help of a machine translation service. This dataset can be used for multimodal video-text tasks, including text-to-video retrieval, video-to-text retrieval, and video captioning. Same as the original English dataset, the MSVD-Indonesian dataset contains about 80k video-text pairs.
## Data
Indonesian (Bahasa Indonesia) sentences: [link](https://github.com/willyfh/msvd-indonesian/blob/main/data/MSVD-indonesian.txt)
Raw videos: [link](https://www.cs.utexas.edu/users/ml/clamp/videoDescription/YouTubeClips.tar)
## Qualitative Results
### Text-to-Video Retrieval
![Text-to-Video Retrieval](https://raw.githubusercontent.com/willyfh/msvd-indonesian/main/figures/qualitative-results-t2v-ret.png)### Video-to-Text Retrieval
![Video-to-Text Retrieval](https://raw.githubusercontent.com/willyfh/msvd-indonesian/main/figures/qualitative-results-v2t-ret.png)### Video Captioning
![Video Captioning](https://raw.githubusercontent.com/willyfh/msvd-indonesian/main/figures/qualitative-results-v2t-cap.png)## Citation
If you find our work useful in your research, please cite:```bibtex
@article{Hendria2023MSVDID,
title={{MSVD}-{I}ndonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian},
author={Willy Fitra Hendria},
journal={arXiv preprint arXiv:2306.11341},
year={2023}
}
```## Acknowledgments
Our experimental results are obtained utilizing the resources from [X-CLIP](https://github.com/xuguohai/X-CLIP) and [VNS-GRU](https://github.com/WingsBrokenAngel/delving-deeper-into-the-decoder-for-video-captioning). We thank the original authors for their open-sourcing.