https://github.com/showlab/VLog

[CVPR 2025] Video Narration as Vocabulary & Video as Long Document
https://github.com/showlab/VLog

chatgpt langchain large-language-model video-language vocabulary whisper

Last synced: 17 days ago
JSON representation

[CVPR 2025] Video Narration as Vocabulary & Video as Long Document

Host: GitHub
URL: https://github.com/showlab/VLog
Owner: showlab
Created: 2023-04-20T13:43:25.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2025-03-13T04:51:30.000Z (about 1 month ago)
Last Synced: 2025-04-04T09:05:27.886Z (20 days ago)
Topics: chatgpt, langchain, large-language-model, video-language, vocabulary, whisper
Language: Python
Homepage:
Size: 10.8 MB
Stars: 566
Watchers: 8
Forks: 28
Open Issues: 8
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-chatgpt - VLog - Convert videos to documents. (Web apps / Hosted and self-hosted)
Awesome-Multimodal-Chatbot - [code
awesome-chatgpt - VLog - Convert videos to documents. (Web apps / Hosted and self-hosted)
awesome-llms-for-video-understanding - **VLog: Video as a Long Document** - | (😎 Vid-LLMs: Models / 🗒️ Taxonomy 2)

README

        
# [VLog](https://arxiv.org/abs/2503.09402)

VLog aims to seek a new perspective on video-language understanding.

👇 Click the branch to see more instructions.

|      | [VLog (CVPR'25)](./VLog)   |[VLog-Agent](./VLog-agent/)|

|-----------|-----------|-----------|

|**TL;DR**| Video Narration as Vocabulary|Video as Long Document|

||  |  |

|**Method**|A novel, efficient video narrator (GPT2-based) with Narration Vocabulary via Generative Retrieval.|Given a video, we turn it into a textual document containing visual + audio info. By sending this doc to LLM, we can chat over the video!|

## 🎓 BibTeX

If you find our work helpful, please kindly consider citing our paper.

```

@misc{lin2025vlog,

      title={VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary}, 

      author={Kevin Qinghong Lin and Mike Zheng Shou},

      year={2025},

      eprint={2503.09402},

      archivePrefix={arXiv},

      primaryClass={cs.CV},

      url={https://arxiv.org/abs/2503.09402}, 

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/showlab/VLog

Awesome Lists containing this project

README