https://github.com/microsoft/OmniParser
A simple screen parsing tool towards pure vision based GUI agent
https://github.com/microsoft/OmniParser
Last synced: 7 months ago
JSON representation
A simple screen parsing tool towards pure vision based GUI agent
- Host: GitHub
- URL: https://github.com/microsoft/OmniParser
- Owner: microsoft
- License: cc-by-4.0
- Created: 2024-09-20T05:18:18.000Z (about 1 year ago)
- Default Branch: master
- Last Pushed: 2025-03-17T20:18:19.000Z (8 months ago)
- Last Synced: 2025-03-23T07:01:09.208Z (8 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 42.2 MB
- Stars: 20,939
- Watchers: 170
- Forks: 1,710
- Open Issues: 174
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Security: SECURITY.md
Awesome Lists containing this project
- awesomeLibrary - OmniParser - A simple screen parsing tool towards pure vision based GUI agent (语言资源库 / python)
- awesome-azure-openai-copilot - OmniParser - ✨Vision-based GUI parsing for screen-grounded agent control. (Agent Frameworks)
- awesome-azure-openai-copilot - OmniParser - Vision-based GUI parsing for screen-grounded agent control. (Agent Frameworks)
- acu - Code
- awesome-ui-agents - code
- awesome - microsoft/OmniParser - A simple screen parsing tool towards pure vision based GUI agent (Jupyter Notebook)
- AiTreasureBox - microsoft/OmniParser - 11-03_23785_-1](https://img.shields.io/github/stars/microsoft/OmniParser.svg)|A simple screen parsing tool towards pure vision based GUI agent| (Repos)