{"id":24884168,"url":"https://github.com/bytedance/ui-tars-desktop","last_synced_at":"2025-09-09T21:24:56.565Z","repository":{"id":273470087,"uuid":"918932603","full_name":"bytedance/UI-TARS-desktop","owner":"bytedance","description":"A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.","archived":false,"fork":false,"pushed_at":"2025-05-08T10:21:00.000Z","size":45974,"stargazers_count":13258,"open_issues_count":145,"forks_count":1072,"subscribers_count":123,"default_branch":"main","last_synced_at":"2025-05-09T02:45:01.719Z","etag":null,"topics":["agent","browser-use","computer-use","electron","gui-agents","mcp","mcp-server","vision","vite","vlm"],"latest_commit_sha":null,"homepage":"https://agent-tars.com","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/bytedance.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-01-19T09:04:43.000Z","updated_at":"2025-05-09T01:31:49.000Z","dependencies_parsed_at":null,"dependency_job_id":"1e5e640c-b3cd-4981-9297-ce3dd8b3cb05","html_url":"https://github.com/bytedance/UI-TARS-desktop","commit_stats":null,"previous_names":["bytedance/ui-tars-desktop"],"tags_count":212,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bytedance%2FUI-TARS-desktop","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bytedance%2FUI-TARS-desktop/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bytedance%2FUI-TARS-desktop/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/bytedance%2FUI-TARS-desktop/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/bytedance","download_url":"https://codeload.github.com/bytedance/UI-TARS-desktop/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":253508612,"owners_count":21919476,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["agent","browser-use","computer-use","electron","gui-agents","mcp","mcp-server","vision","vite","vlm"],"created_at":"2025-02-01T14:19:38.057Z","updated_at":"2025-09-09T21:24:56.553Z","avatar_url":"https://github.com/bytedance.png","language":"TypeScript","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cpicture\u003e\n  \u003cimg alt=\"Agent TARS Banner\" src=\"./images/tars.png\"\u003e\n\u003c/picture\u003e\n\n\u003cbr/\u003e\n\n## Introduction\n\nEnglish | [简体中文](./README.zh-CN.md)\n\n[![](https://trendshift.io/api/badge/repositories/13584)](https://trendshift.io/repositories/13584)\n\n\u003cb\u003eTARS\u003csup\u003e\\*\u003c/sup\u003e\u003c/b\u003e is a Multimodal AI Agent stack, currently shipping two projects: [Agent TARS](#agent-tars) and [UI-TARS-desktop](#ui-tars-desktop):\n\n\u003ctable\u003e\n  \u003cthead\u003e\n    \u003ctr\u003e\n      \u003cth width=\"50%\" align=\"center\"\u003e\u003ca href=\"#agent-tars\"\u003eAgent TARS\u003c/a\u003e\u003c/th\u003e\n      \u003cth width=\"50%\" align=\"center\"\u003e\u003ca href=\"#ui-tars-desktop\"\u003eUI-TARS-desktop\u003c/a\u003e\u003c/th\u003e\n    \u003c/tr\u003e\n  \u003c/thead\u003e\n  \u003ctbody\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003cvideo src=\"https://github.com/user-attachments/assets/c9489936-afdc-4d12-adda-d4b90d2a869d\" width=\"50%\"\u003e\u003c/video\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003cvideo src=\"https://github.com/user-attachments/assets/e0914ce9-ad33-494b-bdec-0c25c1b01a27\" width=\"50%\"\u003e\u003c/video\u003e\n      \u003c/td\u003e\n    \u003c/tr\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"left\"\u003e\n        \u003cb\u003eAgent TARS\u003c/b\u003e is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product.\n        \u003cbr\u003e\n        \u003cbr\u003e\n        It primarily ships with a \u003ca href=\"https://agent-tars.com/guide/basic/cli.html\" target=\"_blank\"\u003eCLI\u003c/a\u003e and \u003ca href=\"https://agent-tars.com/guide/basic/web-ui.html\" target=\"_blank\"\u003eWeb UI\u003c/a\u003e for usage.\n        It aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world \u003ca href=\"https://agent-tars.com/guide/basic/mcp.html\" target=\"_blank\"\u003eMCP\u003c/a\u003e tools.\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003e\n        \u003cb\u003eUI-TARS Desktop\u003c/b\u003e is a desktop application that provides a native GUI Agent based on the \u003ca href=\"https://github.com/bytedance/UI-TARS\" target=\"_blank\"\u003eUI-TARS\u003c/a\u003e model.\n        \u003cbr\u003e\n        \u003cbr\u003e\n        It primarily ships a\n        \u003ca href=\"https://github.com/bytedance/UI-TARS-desktop/blob/main/docs/quick-start.md#get-model-and-run-local-operator\" target=\"_blank\"\u003elocal\u003c/a\u003e and \n        \u003ca href=\"https://github.com/bytedance/UI-TARS-desktop/blob/main/docs/quick-start.md#run-remote-operator\" target=\"_blank\"\u003eremote\u003c/a\u003e computer as well as browser operators.\n      \u003c/td\u003e\n    \u003c/tr\u003e\n  \u003c/tbody\u003e\n\u003c/table\u003e\n\n## Table of Contents\n\n\u003c!-- START doctoc generated TOC please keep comment here to allow auto update --\u003e\n\u003c!-- DON'T EDIT THIS SECTION, INSTEAD RE-RUN doctoc TO UPDATE --\u003e\n\n- [News](#news)\n- [Agent TARS](#agent-tars)\n  - [Showcase](#showcase)\n  - [Core Features](#core-features)\n  - [Quick Start](#quick-start)\n  - [Documentation](#documentation)\n- [UI-TARS Desktop](#ui-tars-desktop)\n  - [Showcase](#showcase-1)\n  - [Features](#features)\n  - [Quick Start](#quick-start-1)\n- [Contributing](#contributing)\n- [License](#license)\n- [Citation](#citation)\n\n\u003c!-- END doctoc generated TOC please keep comment here to allow auto update --\u003e\n\n## News\n\n- **\\[2025-06-25\\]** We released a Agent TARS Beta and Agent TARS CLI - [Introducing Agent TARS Beta](https://agent-tars.com/blog/2025-06-25-introducing-agent-tars-beta.html), a multimodal AI agent that aims to explore a work form that is closer to human-like task completion through rich multimodal capabilities (such as GUI Agent, Vision) and seamless integration with various real-world tools.\n- **\\[2025-06-12\\]** - 🎁 We are thrilled to announce the release of UI-TARS Desktop v0.2.0! This update introduces two powerful new features: **Remote Computer Operator** and **Remote Browser Operator**—both completely free. No configuration required: simply click to remotely control any computer or browser, and experience a new level of convenience and intelligence.\n- **\\[2025-04-17\\]** - 🎉 We're thrilled to announce the release of new UI-TARS Desktop application v0.1.0, featuring a redesigned Agent UI. The application enhances the computer using experience, introduces new browser operation features, and supports [the advanced UI-TARS-1.5 model](https://seed-tars.com/1.5) for improved performance and precise control.\n- **\\[2025-02-20\\]** - 📦 Introduced [UI TARS SDK](./docs/sdk.md), is a powerful cross-platform toolkit for building GUI automation agents.\n- **\\[2025-01-23\\]** - 🚀 We updated the **[Cloud Deployment](./docs/deployment.md#cloud-deployment)** section in the 中文版: [GUI模型部署教程](https://bytedance.sg.larkoffice.com/docx/TCcudYwyIox5vyxiSDLlgIsTgWf#U94rdCxzBoJMLex38NPlHL21gNb) with new information related to the ModelScope platform. You can now use the ModelScope platform for deployment.\n\n\u003cbr\u003e\n\n## Agent TARS\n\n\u003cp\u003e\n    \u003ca href=\"https://npmjs.com/package/@agent-tars/cli?activeTab=readme\"\u003e\u003cimg src=\"https://img.shields.io/npm/v/@agent-tars/cli?style=for-the-badge\u0026colorA=1a1a2e\u0026colorB=3B82F6\u0026logo=npm\u0026logoColor=white\" alt=\"npm version\" /\u003e\u003c/a\u003e\n    \u003ca href=\"https://npmcharts.com/compare/@agent-tars/cli?minimal=true\"\u003e\u003cimg src=\"https://img.shields.io/npm/dm/@agent-tars/cli.svg?style=for-the-badge\u0026colorA=1a1a2e\u0026colorB=0EA5E9\u0026logo=npm\u0026logoColor=white\" alt=\"downloads\" /\u003e\u003c/a\u003e\n    \u003ca href=\"https://nodejs.org/en/about/previous-releases\"\u003e\u003cimg src=\"https://img.shields.io/node/v/@agent-tars/cli.svg?style=for-the-badge\u0026colorA=1a1a2e\u0026colorB=06B6D4\u0026logo=node.js\u0026logoColor=white\" alt=\"node version\"\u003e\u003c/a\u003e\n    \u003ca href=\"https://discord.gg/HnKcSBgTVx\"\u003e\u003cimg src=\"https://img.shields.io/badge/Discord-Join%20Community-5865F2?style=for-the-badge\u0026logo=discord\u0026logoColor=white\" alt=\"Discord Community\" /\u003e\u003c/a\u003e\n    \u003ca href=\"https://twitter.com/agent_tars\"\u003e\u003cimg src=\"https://img.shields.io/badge/Twitter-Follow%20%40agent__tars-1DA1F2?style=for-the-badge\u0026logo=twitter\u0026logoColor=white\" alt=\"Official Twitter\" /\u003e\u003c/a\u003e\n    \u003ca href=\"https://applink.larkoffice.com/client/chat/chatter/add_by_link?link_token=279h3365-b0fa-407f-89f3-0f96f36cd4d8\"\u003e\u003cimg src=\"https://img.shields.io/badge/飞书群-加入交流群-00D4AA?style=for-the-badge\u0026logo=lark\u0026logoColor=white\" alt=\"飞书交流群\" /\u003e\u003c/a\u003e\n    \u003ca href=\"https://deepwiki.com/bytedance/UI-TARS-desktop\"\u003e\u003cimg src=\"https://img.shields.io/badge/DeepWiki-Ask%20AI-8B5CF6?style=for-the-badge\u0026logo=gitbook\u0026logoColor=white\" alt=\"Ask DeepWiki\" /\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n\u003cb\u003eAgent TARS\u003c/b\u003e is a general multimodal AI Agent stack, it brings the power of GUI Agent and Vision into your terminal, computer, browser and product. \u003cbr\u003e \u003cbr\u003e\nIt primarily ships with a \u003ca href=\"https://agent-tars.com/guide/basic/cli.html\" target=\"_blank\"\u003eCLI\u003c/a\u003e and \u003ca href=\"https://agent-tars.com/guide/basic/web-ui.html\" target=\"_blank\"\u003eWeb UI\u003c/a\u003e for usage.\nIt aims to provide a workflow that is closer to human-like task completion through cutting-edge multimodal LLMs and seamless integration with various real-world \u003ca href=\"https://agent-tars.com/guide/basic/mcp.html\" target=\"_blank\"\u003eMCP\u003c/a\u003e tools.\n\n\n### Showcase\n\n```\nPlease help me book the earliest flight from San Jose to New York on September 1st and the last return flight on September 6th on Priceline\n```\n\nhttps://github.com/user-attachments/assets/772b0eef-aef7-4ab9-8cb0-9611820539d8\n\n\u003cbr\u003e\n\n\u003ctable\u003e\n  \u003cthead\u003e\n    \u003ctr\u003e\n      \u003cth width=\"50%\" align=\"center\"\u003eBooking Hotel\u003c/th\u003e\n      \u003cth width=\"50%\" align=\"center\"\u003eGenerate Chart with extra MCP Servers\u003c/th\u003e\n    \u003c/tr\u003e\n  \u003c/thead\u003e\n  \u003ctbody\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003cvideo src=\"https://github.com/user-attachments/assets/c9489936-afdc-4d12-adda-d4b90d2a869d\" width=\"50%\"\u003e\u003c/video\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003cvideo src=\"https://github.com/user-attachments/assets/a9fd72d0-01bb-4233-aa27-ca95194bbce9\" width=\"50%\"\u003e\u003c/video\u003e\n      \u003c/td\u003e\n    \u003c/tr\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"left\"\u003e\n        \u003cb\u003eInstruction:\u003c/b\u003e \u003ci\u003eI am in Los Angeles from September 1st to September 6th, with a budget of $5,000. Please help me book a Ritz-Carlton hotel closest to the airport on booking.com and compile a transportation guide for me\u003c/i\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003e\n        \u003cb\u003eInstruction:\u003c/b\u003e \u003ci\u003eDraw me a chart of Hangzhou's weather for one month\u003c/i\u003e\n      \u003c/td\u003e\n    \u003c/tr\u003e\n  \u003c/tbody\u003e\n\u003c/table\u003e\n\nFor more use cases, please check out [#842](https://github.com/bytedance/UI-TARS-desktop/issues/842).\n\n### Core Features\n\n- 🖱️ **One-Click Out-of-the-box CLI** - Supports both **headful** [Web UI](https://agent-tars.com/guide/basic/web-ui.html) and **headless** [server](https://agent-tars.com/guide/advanced/server.html)) [execution](https://agent-tars.com/guide/basic/cli.html).\n- 🌐 **Hybrid Browser Agent** - Control browsers using [GUI Agent](https://agent-tars.com/guide/basic/browser.html#visual-grounding), [DOM](https://agent-tars.com/guide/basic/browser.html#dom), or a hybrid strategy.\n- 🔄 **Event Stream** - Protocol-driven Event Stream drives [Context Engineering](https://agent-tars.com/beta#context-engineering) and [Agent UI](https://agent-tars.com/blog/2025-06-25-introducing-agent-tars-beta.html#easy-to-build-applications).\n- 🧰 **MCP Integration** - The kernel is built on MCP and also supports mounting [MCP Servers](https://agent-tars.com/guide/basic/mcp.html) to connect to real-world tools.\n\n### Quick Start\n\n\u003cimg alt=\"Agent TARS CLI\" src=\"https://agent-tars.com/agent-tars-cli.png\"\u003e\n\n```bash\n# Luanch with `npx`.\nnpx @agent-tars/cli@latest\n\n# Install globally, required Node.js \u003e= 22\nnpm install @agent-tars/cli@latest -g\n\n# Run with your preferred model provider\nagent-tars --provider volcengine --model doubao-1-5-thinking-vision-pro-250428 --apiKey your-api-key\nagent-tars --provider anthropic --model claude-3-7-sonnet-latest --apiKey your-api-key\n```\n\nVisit the comprehensive [Quick Start](https://agent-tars.com/guide/get-started/quick-start.html) guide for detailed setup instructions.\n\n### Documentation\n\n\u003e 🌟 **Explore Agent TARS Universe** 🌟\n\n\u003ctable\u003e\n  \u003cthead\u003e\n    \u003ctr\u003e\n      \u003cth width=\"20%\" align=\"center\"\u003eCategory\u003c/th\u003e\n      \u003cth width=\"30%\" align=\"center\"\u003eResource Link\u003c/th\u003e\n      \u003cth width=\"50%\" align=\"left\"\u003eDescription\u003c/th\u003e\n    \u003c/tr\u003e\n  \u003c/thead\u003e\n  \u003ctbody\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e🏠 \u003cstrong\u003eCentral Hub\u003c/strong\u003e\u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003ca href=\"https://agent-tars.com\"\u003e\n          \u003cimg src=\"https://img.shields.io/badge/Visit-Website-4F46E5?style=for-the-badge\u0026logo=globe\u0026logoColor=white\" alt=\"Website\" /\u003e\n        \u003c/a\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003eYour gateway to Agent TARS ecosystem\u003c/td\u003e\n    \u003c/tr\u003e\n      \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e📚 \u003cstrong\u003eQuick Start\u003c/strong\u003e\u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003ca href=\"https://agent-tars.com/guide/get-started/quick-start.html\"\u003e\n          \u003cimg src=\"https://img.shields.io/badge/Get-Started-06B6D4?style=for-the-badge\u0026logo=rocket\u0026logoColor=white\" alt=\"Quick Start\" /\u003e\n        \u003c/a\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003eZero to hero in 5 minutes\u003c/td\u003e\n    \u003c/tr\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e🚀 \u003cstrong\u003eWhat's New\u003c/strong\u003e\u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003ca href=\"https://agent-tars.com/beta\"\u003e\n          \u003cimg src=\"https://img.shields.io/badge/Read-Blog-F59E0B?style=for-the-badge\u0026logo=rss\u0026logoColor=white\" alt=\"Blog\" /\u003e\n        \u003c/a\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003eDiscover cutting-edge features \u0026 vision\u003c/td\u003e\n    \u003c/tr\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e🛠️ \u003cstrong\u003eDeveloper Zone\u003c/strong\u003e\u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003ca href=\"https://agent-tars.com/guide/get-started/introduction.html\"\u003e\n          \u003cimg src=\"https://img.shields.io/badge/View-Docs-10B981?style=for-the-badge\u0026logo=gitbook\u0026logoColor=white\" alt=\"Docs\" /\u003e\n        \u003c/a\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003eMaster every command \u0026 features\u003c/td\u003e\n    \u003c/tr\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e🎯 \u003cstrong\u003eShowcase\u003c/strong\u003e\u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003ca href=\"https://github.com/bytedance/UI-TARS-desktop/issues/842\"\u003e\n          \u003cimg src=\"https://img.shields.io/badge/View-Examples-8B5CF6?style=for-the-badge\u0026logo=github\u0026logoColor=white\" alt=\"Examples\" /\u003e\n        \u003c/a\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003eView use cases built by the official and community\u003c/td\u003e\n    \u003c/tr\u003e\n    \u003ctr\u003e\n      \u003ctd align=\"center\"\u003e🔧 \u003cstrong\u003eReference\u003c/strong\u003e\u003c/td\u003e\n      \u003ctd align=\"center\"\u003e\n        \u003ca href=\"https://agent-tars.com/api/\"\u003e\n          \u003cimg src=\"https://img.shields.io/badge/API-Reference-EF4444?style=for-the-badge\u0026logo=book\u0026logoColor=white\" alt=\"API\" /\u003e\n        \u003c/a\u003e\n      \u003c/td\u003e\n      \u003ctd align=\"left\"\u003eComplete technical reference\u003c/td\u003e\n    \u003c/tr\u003e\n  \u003c/tbody\u003e\n\u003c/table\u003e\n\n\u003cbr/\u003e\n\u003cbr/\u003e\n\u003cbr/\u003e\n\n## UI-TARS Desktop\n\n\u003cp align=\"center\"\u003e\n  \u003cimg alt=\"UI-TARS\" width=\"260\" src=\"./apps/ui-tars/resources/icon.png\"\u003e\n\u003c/p\u003e\n\nUI-TARS Desktop is a native GUI agent for your local computer, driven by [UI-TARS](https://github.com/bytedance/UI-TARS) and Seed-1.5-VL/1.6 series models.\n\n\u003cdiv align=\"center\"\u003e\n\u003cp\u003e\n        \u0026nbsp\u0026nbsp 📑 \u003ca href=\"https://arxiv.org/abs/2501.12326\"\u003ePaper\u003c/a\u003e \u0026nbsp\u0026nbsp\n        | 🤗 \u003ca href=\"https://huggingface.co/ByteDance-Seed/UI-TARS-1.5-7B\"\u003eHugging Face Models\u003c/a\u003e\u0026nbsp\u0026nbsp\n        | \u0026nbsp\u0026nbsp🫨 \u003ca href=\"https://discord.gg/pTXwYVjfcs\"\u003eDiscord\u003c/a\u003e\u0026nbsp\u0026nbsp\n        | \u0026nbsp\u0026nbsp🤖 \u003ca href=\"https://www.modelscope.cn/collections/UI-TARS-bccb56fa1ef640\"\u003eModelScope\u003c/a\u003e\u0026nbsp\u0026nbsp\n\u003cbr\u003e\n🖥️ Desktop Application \u0026nbsp\u0026nbsp\n| \u0026nbsp\u0026nbsp 👓 \u003ca href=\"https://github.com/web-infra-dev/midscene\"\u003eMidscene (use in browser)\u003c/a\u003e \u0026nbsp\u0026nbsp\n\u003c/p\u003e\n\n\u003c/div\u003e\n\n### Showcase\n\n\u003c!-- // FIXME: Choose only two demo, one local computer and one remote computer showcase. --\u003e\n\n|                                                          Instruction                                                           |                                                Local Operator                                                |                                               Remote Operator                                                |\n| :----------------------------------------------------------------------------------------------------------------------------: | :----------------------------------------------------------------------------------------------------------: | :----------------------------------------------------------------------------------------------------------: |\n| Please help me open the autosave feature of VS Code and delay AutoSave operations for 500 milliseconds in the VS Code setting. | \u003cvideo src=\"https://github.com/user-attachments/assets/e0914ce9-ad33-494b-bdec-0c25c1b01a27\" height=\"300\" /\u003e | \u003cvideo src=\"https://github.com/user-attachments/assets/01e49b69-7070-46c8-b3e3-2aaaaec71800\" height=\"300\" /\u003e |\n|                    Could you help me check the latest open issue of the UI-TARS-Desktop project on GitHub?                     | \u003cvideo src=\"https://github.com/user-attachments/assets/3d159f54-d24a-4268-96c0-e149607e9199\" height=\"300\" /\u003e | \u003cvideo src=\"https://github.com/user-attachments/assets/072fb72d-7394-4bfa-95f5-4736e29f7e58\" height=\"300\" /\u003e |\n\n### Features\n\n- 🤖 Natural language control powered by Vision-Language Model\n- 🖥️ Screenshot and visual recognition support\n- 🎯 Precise mouse and keyboard control\n- 💻 Cross-platform support (Windows/MacOS/Browser)\n- 🔄 Real-time feedback and status display\n- 🔐 Private and secure - fully local processing\n\n### Quick Start\n\nSee [Quick Start](./docs/quick-start.md)\n\n## Contributing\n\nSee [CONTRIBUTING.md](./CONTRIBUTING.md).\n\n## License\n\nThis project is licensed under the Apache License 2.0.\n\n## Citation\n\nIf you find our paper and code useful in your research, please consider giving a star :star: and citation :pencil:\n\n```BibTeX\n@article{qin2025ui,\n  title={UI-TARS: Pioneering Automated GUI Interaction with Native Agents},\n  author={Qin, Yujia and Ye, Yining and Fang, Junjie and Wang, Haoming and Liang, Shihao and Tian, Shizuo and Zhang, Junda and Li, Jiahao and Li, Yunxin and Huang, Shijue and others},\n  journal={arXiv preprint arXiv:2501.12326},\n  year={2025}\n}\n```","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbytedance%2Fui-tars-desktop","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbytedance%2Fui-tars-desktop","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbytedance%2Fui-tars-desktop/lists"}