Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/c0mm4nd/command-windows
CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window
https://github.com/c0mm4nd/command-windows
ai chatgpt copilot gemini gemini-pro-vision gpt gpt-4-vision
Last synced: 8 days ago
JSON representation
CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window
- Host: GitHub
- URL: https://github.com/c0mm4nd/command-windows
- Owner: c0mm4nd
- Created: 2023-12-06T18:14:51.000Z (about 1 year ago)
- Default Branch: dev
- Last Pushed: 2024-04-04T01:14:20.000Z (8 months ago)
- Last Synced: 2024-11-29T17:49:44.666Z (13 days ago)
- Topics: ai, chatgpt, copilot, gemini, gemini-pro-vision, gpt, gpt-4-vision
- Language: TypeScript
- Homepage:
- Size: 532 KB
- Stars: 6
- Watchers: 1
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- project-awesome - c0mm4nd/command-windows - CommandWindows is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application window (TypeScript)
README
⌘ Command ⊞ WindowsCommandWindows (⌘ ⊞) is a desktop opeating system copilot based on multi-modal large language model, supporting all-platforms which have application windows.
## Supported Model
- [x] GPT4 Vision from OpenAI (`gpt-4-vision-preview`)
- [x] Gemini Pro Vision from Google (`gemini-pro-vision`)
- [ ] Gemini Nano/Ultra Vision from Google
- [ ] Local Vision Model
- [ ] Vary-toy## Features
Preview:
> Help me create a Google doc and write the definition of Blockchain on that
[![CommandWin Test Video](https://img.youtube.com/vi/0nnb4PhPv4k/0.jpg)](https://www.youtube.com/watch?v=0nnb4PhPv4k)
- [x] A digital assistant rather than just software
- [x] Use multi-modal perception to help you operate your computer
- [x] Step by step notification showing
- [ ] Testing on more platforms
- [ ] More detailed device information to LLM
- [ ] Enhanced chat experience with better reply content
- [ ] More interactive operation
- [ ] Convinent shortcuts## How to use
While an official release is not yet available due to the experimental status, you can still try out this tool by cloning the repository and then running on your system.
```
git clone https://github.com/c0mm4nd/command-windows
cd command-windows
npm i
npm run start
```The pre-built releases will be available soon!
## How to build
Simply run
```
npm run make
```The built file is inside the `make` folder
## Status
Inspired by [SOC](https://github.com/OthersideAI/self-operating-computer) but written in Electron with Javascript.
Currently, this project is
- actively in development and experimental, not suitable for any production
- welcoming any kind of issues and pull requests!