An open API service indexing awesome lists of open source software.

https://github.com/showlab/gui-narrator

Repository of GUI Action Narrator
https://github.com/showlab/gui-narrator

Last synced: 12 days ago
JSON representation

Repository of GUI Action Narrator

Awesome Lists containing this project

README

        

## GUI Action Narrator: Where and When Did That Action Take Place?

Qinchen Wu, Difei Gao, Kevin Qinghong Lin, Zhuoyu Wu, Xiangwu Guo, Peiran Li, Weichen Zhang, Hengxu Wang, Mike Zheng Shou

## 🤖: Introduction

We introduce GUI action dataset **Act2Cap** as well as an effective framework: **GUI Narrator** for GUI video captioning that utilizes the cursor detection to enhance the interpretation of high-resolution screenshots and keyframe extraction in GUI actions.

## 📋 ToDo List

- [x] Model for Cursor detector and Narrator
- [ ] Code of conduct

-- Our model and test benchmark are availble on [![Hugging Face](https://img.shields.io/badge/Demo-HuggingFace-blue)](https://huggingface.co/FRank62Wu/ShowUI-Narrator).