{"id":18206696,"url":"https://github.com/willyfh/msvd-indonesian","last_synced_at":"2026-03-19T03:26:24.735Z","repository":{"id":186115196,"uuid":"634162870","full_name":"willyfh/msvd-indonesian","owner":"willyfh","description":"MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian (Bahasa Indonesia).","archived":false,"fork":false,"pushed_at":"2025-06-30T00:58:49.000Z","size":2679,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-30T01:36:31.060Z","etag":null,"topics":["bahasa-indonesia","deep-learning","indonesian-dataset","msvd","msvd-indonesian","multimodal-dataset","neural-network","video-captioning","video-description","video-retrieval","video-text"],"latest_commit_sha":null,"homepage":"https://arxiv.org/abs/2306.11341","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/willyfh.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2023-04-29T08:43:46.000Z","updated_at":"2025-06-30T00:58:53.000Z","dependencies_parsed_at":null,"dependency_job_id":"91c39364-8368-438e-a66b-992c8a5d2d8b","html_url":"https://github.com/willyfh/msvd-indonesian","commit_stats":null,"previous_names":["willyfh/msvd-indonesian"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/willyfh/msvd-indonesian","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willyfh%2Fmsvd-indonesian","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willyfh%2Fmsvd-indonesian/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willyfh%2Fmsvd-indonesian/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willyfh%2Fmsvd-indonesian/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/willyfh","download_url":"https://codeload.github.com/willyfh/msvd-indonesian/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willyfh%2Fmsvd-indonesian/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28632781,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-21T04:47:28.174Z","status":"ssl_error","status_checked_at":"2026-01-21T04:47:22.943Z","response_time":86,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bahasa-indonesia","deep-learning","indonesian-dataset","msvd","msvd-indonesian","multimodal-dataset","neural-network","video-captioning","video-description","video-retrieval","video-text"],"created_at":"2024-11-03T12:05:58.369Z","updated_at":"2026-01-21T12:02:17.804Z","avatar_url":"https://github.com/willyfh.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# MSVD-Indonesian\n[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/video-retrieval-on-msvd-indonesian)](https://paperswithcode.com/sota/video-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/text-to-video-retrieval-on-msvd-indonesian)](https://paperswithcode.com/sota/text-to-video-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/video-to-text-retrieval-on-msvd-indonesian)](https://paperswithcode.com/sota/video-to-text-retrieval-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal) [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/msvd-indonesian-a-benchmark-for-multimodal/video-captioning-on-msvd-indonesian)](https://paperswithcode.com/sota/video-captioning-on-msvd-indonesian?p=msvd-indonesian-a-benchmark-for-multimodal)\n\n\nMSVD-Indonesian (Paper: [link](https://arxiv.org/abs/2306.11341)) is derived from the MSVD dataset, which is obtained with the help of a machine translation service. This dataset can be used for multimodal video-text tasks, including text-to-video retrieval, video-to-text retrieval, and video captioning. Same as the original English dataset, the MSVD-Indonesian dataset contains about 80k video-text pairs.\n\n## Data\n\nIndonesian (Bahasa Indonesia) sentences: [link](https://github.com/willyfh/msvd-indonesian/blob/main/data/MSVD-indonesian.txt)\n\nRaw videos: [link](https://www.cs.utexas.edu/users/ml/clamp/videoDescription/YouTubeClips.tar)\n\n## Qualitative Results\n\n### Text-to-Video  Retrieval\n![Text-to-Video Retrieval](https://raw.githubusercontent.com/willyfh/msvd-indonesian/main/figures/qualitative-results-t2v-ret.png)\n\n### Video-to-Text Retrieval\n![Video-to-Text Retrieval](https://raw.githubusercontent.com/willyfh/msvd-indonesian/main/figures/qualitative-results-v2t-ret.png)\n\n### Video Captioning\n![Video Captioning](https://raw.githubusercontent.com/willyfh/msvd-indonesian/main/figures/qualitative-results-v2t-cap.png)\n\n## Citation\nIf you find our work useful in your research, please cite:\n\n```bibtex\n@article{Hendria2023MSVDID,\n  title={{MSVD}-{I}ndonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian},\n  author={Willy Fitra Hendria},\n  journal={arXiv preprint arXiv:2306.11341},\n  year={2023}\n}\n```\n\n## Acknowledgments\nOur experimental results are obtained utilizing the resources from [X-CLIP](https://github.com/xuguohai/X-CLIP) and [VNS-GRU](https://github.com/WingsBrokenAngel/delving-deeper-into-the-decoder-for-video-captioning). We thank the original authors for their open-sourcing.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwillyfh%2Fmsvd-indonesian","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fwillyfh%2Fmsvd-indonesian","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwillyfh%2Fmsvd-indonesian/lists"}