Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/richardyc/chrome-gpt
An AutoGPT agent that controls Chrome on your desktop
https://github.com/richardyc/chrome-gpt
ai autogpt chatgpt gpt-3-5-turbo gpt-4 langchain
Last synced: 21 days ago
JSON representation
An AutoGPT agent that controls Chrome on your desktop
- Host: GitHub
- URL: https://github.com/richardyc/chrome-gpt
- Owner: richardyc
- License: gpl-3.0
- Created: 2023-04-25T06:15:05.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-10-25T05:13:08.000Z (about 1 year ago)
- Last Synced: 2024-10-01T19:27:28.659Z (about 1 month ago)
- Topics: ai, autogpt, chatgpt, gpt-3-5-turbo, gpt-4, langchain
- Language: Python
- Homepage:
- Size: 82 KB
- Stars: 1,672
- Watchers: 23
- Forks: 204
- Open Issues: 23
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
- awesome-ChatGPT-repositories - Chrome-GPT - An AutoGPT agent that controls Chrome on your desktop (Browser-extensions)
README
# 🤖 Chrome-GPT: An experimental AutoGPT agent that interacts with Chrome
[![lint](https://github.com/richardyc/chrome-gpt/actions/workflows/lint.yml/badge.svg)](https://github.com/richardyc/chrome-gpt/actions/workflows/lint.yml) [![test](https://github.com/richardyc/chrome-gpt/actions/workflows/tests.yml/badge.svg)](https://github.com/richardyc/chrome-gpt/actions/workflows/tests.yml) [![Twitter](https://img.shields.io/twitter/url/https/twitter.com/RealRichomie.svg?style=social&label=Follow%20%40RealRichomie)](https://twitter.com/RealRichomie)⚠️This is an experimental AutoGPT agent that might take incorrect actions and could lead to serious consequences. Please use it at your own discretion⚠️
Chrome-GPT is an AutoGPT experiment that utilizes [Langchain](https://github.com/hwchase17/langchain) and [Selenium](https://github.com/SeleniumHQ/selenium) to enable an AutoGPT agent take control of an entire Chrome session. With the ability to interactively scroll, click, and input text on web pages, the AutoGPT agent can navigate and manipulate web content.
🖥️ Demo
Input Prompt: `Find me a bar that can host a 20 person event near Chelsea, Manhattan evening of Apr 30th. Fill out contact us form if they have one with info: Name Richard, email [email protected].`
https://user-images.githubusercontent.com/14324698/234191011-ec73af54-4a8e-4298-be1d-4252050f08c1.mov
Demo made by [Richard He](https://twitter.com/RealRichomie)
🔮 Features
- 🌎 Google search
- 🧠 Long-term and short-term memory management
- 🔨 Chrome actions: describe a webpage, scroll to element, click on buttons/links, input forms, switch tabs
- 🤖 Supports multiple agent types: Zero-shot, BabyAGI and Auto-GPT
- 🔥 (IN PROGRESS) Chrome plugin support🧱 Known Limitations
- There are limited web crawling features, with buttons and input fields sometimes failing to appear in prompt.
- The response time is slow, with each action taking between 1-10 seconds to run.
- At times, langchain agents are unable to parse GPT outputs (refer to langchain discussion: https://github.com/hwchase17/langchain/discussions/4065). If you run into this, try specifying a different agent; ie: `python -m chromegpt -a auto-gpt -v -t "{your request}"`Requirements
- Chrome
- Python >3.8
- Install [Poetry](https://python-poetry.org/docs/#installation)🛠️ Setup
1. Set up your OpenAI [API Keys](https://platform.openai.com/account/api-keys) and add `OPENAI_API_KEY` env variable
2. Install Python requirements via poetry `poetry install`
3. Open a poetry shell `poetry shell`
4. Run chromegpt via `python -m chromegpt`You can start in you own codespace here:
[![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://github.com/erlichsefi/Chrome-GPT/codespaces)
🧠 Usage
- GPT-3.5 Usage (Default): `python -m chromegpt -v -t "{your request}"`
- GPT-4 Usage (Recommended, needs GPT-4 access): `python -m chromegpt -v -a auto-gpt -m gpt-4 -t "{your request}"`
- For help: `python -m chromegpt --help`
```
Usage: python -m chromegpt [OPTIONS]Run ChromeGPT: An AutoGPT agent that interacts with Chrome
Options:
-t, --task TEXT The task to execute [required]
-a, --agent [auto-gpt|baby-agi|zero-shot]
The agent type to use
-m, --model TEXT The model to use
--headless Run in headless mode
-v, --verbose Run in verbose mode
--human-in-loop Run in human-in-loop mode, only available
when using auto-gpt agent
--help Show this message and exit.
```Or Just update .env and
`source .env && docker-compose up`
⭐ Star History
[![Star History Chart](https://api.star-history.com/svg?repos=richardyc/Chrome-GPT&type=Date)](https://star-history.com/#richardyc/Chrome-GPT&Date)