https://github.com/willyfh/msvd-indonesian
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
https://github.com/willyfh/msvd-indonesian
bahasa-indonesia deep-learning indonesian-dataset msvd msvd-indonesian multimodal-dataset neural-network video-captioning video-description video-retrieval video-text
Last synced: about 1 month ago
JSON representation
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).
- Host: GitHub
- URL: https://github.com/willyfh/msvd-indonesian
- Owner: willyfh
- License: mit
- Created: 2023-04-29T08:43:46.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-08-04T13:13:08.000Z (almost 2 years ago)
- Last Synced: 2025-02-13T21:16:28.109Z (3 months ago)
- Topics: bahasa-indonesia, deep-learning, indonesian-dataset, msvd, msvd-indonesian, multimodal-dataset, neural-network, video-captioning, video-description, video-retrieval, video-text
- Homepage: https://arxiv.org/abs/2306.11341
- Size: 2.55 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# MSVD-Indonesian
[](https://paperswithcode.com/sota/video-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [](https://paperswithcode.com/sota/text-to-video-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [](https://paperswithcode.com/sota/video-to-text-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [](https://paperswithcode.com/sota/video-captioning-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal)MSVD-Indonesian (Paper: [link](https://arxiv.org/abs/2306.11341)) is derived from the MSVD dataset, which is obtained with the help of a machine translation service. This dataset can be used for multimodal video-text tasks, including text-to-video retrieval, video-to-text retrieval, and video captioning. Same as the original English dataset, the MSVD-Indonesian dataset contains about 80k video-text pairs.
## Data
Indonesian (Bahasa Indonesia) sentences: [link](https://github.com/willyfh/msvd-indonesian/blob/main/data/MSVD-indonesian.txt)
Raw videos: [link](https://www.cs.utexas.edu/users/ml/clamp/videoDescription/YouTubeClips.tar)
## Qualitative Results
### Text-to-Video Retrieval
### Video-to-Text Retrieval
### Video Captioning
## Citation
If you find our work useful in your research, please cite:```bibtex
@article{Hendria2023MSVDID,
title={{MSVD}-{I}ndonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian},
author={Willy Fitra Hendria},
journal={arXiv preprint arXiv:2306.11341},
year={2023}
}
```## Acknowledgments
Our experimental results are obtained utilizing the resources from [X-CLIP](https://github.com/xuguohai/X-CLIP) and [VNS-GRU](https://github.com/WingsBrokenAngel/delving-deeper-into-the-decoder-for-video-captioning). We thank the original authors for their open-sourcing.