Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/xxyQwQ/GenAgent
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
https://github.com/xxyQwQ/GenAgent
Last synced: 2 days ago
JSON representation
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
- Host: GitHub
- URL: https://github.com/xxyQwQ/GenAgent
- Owner: xxyQwQ
- Created: 2024-09-01T08:35:29.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2025-01-14T05:59:48.000Z (14 days ago)
- Last Synced: 2025-01-14T06:46:25.952Z (14 days ago)
- Language: Python
- Homepage: https://xxyqwq.github.io/ComfyBench/
- Size: 56.1 MB
- Stars: 134
- Watchers: 3
- Forks: 11
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- ai-game-devtools - GenAgent - Case Studies on ComfyUI. |[arXiv](https://arxiv.org/abs/2409.01392) | | Agent | (<span id="game">Game (Agent)</span> / <span id="tool">Tool (AI LLM)</span>)
README
ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
Xiangyuan Xue
Zeyu Lu
Di Huang
Zidong Wang
Wanli Ouyang
Lei Bai*
![Teaser](assets/teaser.png)
(a) **ComfyBench** is a comprehensive benchmark to evaluate agents's ability to design collaborative AI systems in ComfyUI. Given the task instruction, agents are required to learn from documents and create workflows to describe collaborative AI systems. The performance is measured by *pass rate* and *resolve rate*, reflecting whether the workflow can be correctly executed and whether the task requirements are realized. (b) **ComfyAgent** builds collaborative Al systems in ComfyUI by generating workflows. The workflows are converted into equivalent code so that LLMs can better understand them. ComfyAgent can learn from existing workflows and autonomously design new ones. The generated workflows can be interpreted as collaborative AI systems to complete given tasks.
## 📰 News
- **[2024/11/20]** The latest version of our code is updated.
- **[2024/11/14]** The work is further extended and renamed as *ComfyBench*.
- **[2024/09/04]** Our code implementation is released on GitHub.
- **[2024/09/02]** The initial version of our paper is submitted to arXiv.## ⚙️ Configuration
First, clone the repository and navigate to the project directory:
```bash
git clone https://github.com/xxyQwQ/ComfyBench
cd ComfyBench
```Then, create a new conda environment and install the dependencies:
```bash
conda create -n comfybench python=3.12
conda activate comfybench
pip install -r requirements.txt
```Finally, modify `config.yaml` to set your ComfyUI server and API key. Feel free to change proxies and models if necessary.
⚠️ Before executing the workflows, prepare your ComfyUI with necessary models and extensions.
Despite some models can be automatically installed, other models need to be manually downloaded and placed in the specific directory. You may find them by yourself on [Hugging Face](https://huggingface.co/) or directly download from our [Cloud Drive](https://pan.baidu.com/s/1z83-XnoHcnMz6y0At2i_iw?pwd=1120). Besides, we provide a list of extensions in `assets/extension.md` so that you can install them manually. You can verify the completeness with the workflows in `dataset/benchmark/workflow`.
## 🚀 ComfyAgent Execution
Run the following commands to execute the ComfyAgent pipeline:
```bash
# activate the conda environment
conda activate comfybench# execute the main script
# see `main.py` for more parameter settings
python main.py \
--instruction "task-instruction" \
--agent_name "comfy" \
--save_path "path/to/save/result"
```The log file together with the workflow will be saved in the specified path. If your ComfyUI server is working properly, the workflow will be executed automatically to produce the result.
## 📊 ComfyBench Evaluation
ComfyBench is provided under `dataset/benchmark`. The `document` folder contains documentation for 3205 nodes, where `meta.json` records the metadata of each node. The `workflow` folder contains 20 curriculum workflows for agents to learn from. The `instruction` provides all the tasks in ComfyBench, where `complete.json` contains 200 task instructions and `sample.json` contains a subset of 10 task instructions for validation.
Before evaluating on ComfyBench, you should copy the resource files in `dataset/benchmark/resource` into the input folder of ComfyUI, so that ComfyUI can load them during the workflow execution. Then you can evaluate the specific agent by running the following commands. Here we take ComfyAgent as an example.
```bash
# activate the conda environment and set up environment variables
conda activate comfybench
export PYTHONPATH=./# execute the inference script
# see `script/inference.py` for more parameter settings
python script/inference.py \
--agent_name "comfy" \
--save_path "cache/benchmark/comfy"# execute the evaluation script
# see `script/evaluation.py` for more parameter settings
python script/evaluation.py \
--submit_folder "cache/benchmark/comfy/workflow" \
--cache_path "cache/benchmark/comfy/outcome"
```In this example, the log files and generated workflows will be saved in `cache/benchmark/comfy/logging` and `cache/benchmark/comfy/workflow`, respectively. The produced results will be saved in `cache/benchmark/comfy/outcome`, together with a `result.json` recording whether each task is passed and resolved, and a `summary.txt` summarizing the overall metrics.
## 🎬 Gallery
Here are some examples of the results produced by ComfyAgent on ComfyBench. Visit our [Project Page](https://xxyqwq.github.io/ComfyBench) for more details.
| Task Instruction | Input | Result |
| ------------------------------------------------------------ | :-----------------------------------: | :-------------------------------------: |
| *Generate an image of a hot air balloon floating over a scenic valley at sunrise. The result should be a high-quality image.* | N/A | ![Result](assets/gallery/05_result.png) |
| *Generate an image of a modern city skyline at night with illuminated skyscrapers. The result should be a high-quality image.* | N/A | ![Result](assets/gallery/06_result.png) |
| *You are given an image of a scribble flower. Repaint the scribble into a realistic red flower. The result should be an image of a red flower.* | ![Input](assets/gallery/09_input.png) | ![Result](assets/gallery/09_result.png) |
| *You are given an image of a red apple. Change it into a green apple on a table while maintaining other details. The result should be an image of a green apple.* | ![Input](assets/gallery/18_input.jpg) | ![Result](assets/gallery/18_result.png) |
| *You are given an image of a sample logo containing a bird pattern. Convert it into a cubist art poster with dark colors. The result should be an image of a poster without watermark.* | ![Input](assets/gallery/17_input.png) | ![Result](assets/gallery/17_result.png) |
| *You are given an image of a large castle standing on top of a hill. Convert the castle into the style of ice cream while maintaining its original structure. The result should be an image with the castle transformed into a colorful and fantastic ice cream castle.* | ![Input](assets/gallery/12_input.png) | ![Result](assets/gallery/12_result.png) |
| *You are given a low-resolution photo of a crowd of people. Upscale the image by 4x. The result should be a high-resolution version of the image.* | ![Input](assets/gallery/13_input.png) | ![Result](assets/gallery/13_result.png) |
| *You are given an image of a table filled with dishes. Remove the fork on the table. The result should be a high-quality image without visible artifacts.* | ![Input](assets/gallery/14_input.png) | ![Result](assets/gallery/14_result.png) |
| *You are given an image of a red car parked on the street. Replace the tree behind the car with a white house. The result should be a high-quality image without visible artifacts.* | ![Input](assets/gallery/15_input.png) | ![Result](assets/gallery/15_result.png) |
| *You are given an image of a red bridge with a person standing on it. Remove the person from the image while maintaining the original appearance of the bridge. The result should be a high-quality image without visible artifacts.* | ![Input](assets/gallery/03_input.png) | ![Result](assets/gallery/03_result.png) |
| *You are given a photo of mountains and rivers with a visible watermark in the bottom right corner. Remove the watermark from the image while maintaining the quality and content of the original photo. The result should be a high-quality image without the watermark.* | ![Input](assets/gallery/11_input.png) | ![Result](assets/gallery/11_result.png) |
| *You are given an image of a girl playing the guitar. Generate an image of an old man playing the guitar in a forest with the same pose as the girl. The result should be a realistic image of an old man playing the guitar.* | ![Input](assets/gallery/07_input.png) | ![Result](assets/gallery/07_result.png) |
| *You are given an image of a man wearing a black jacket. Change the black jacket into a white hoodie while ensuring that the modification looks natural and realistic. The result should be a high-quality image of the man wearing a white hoodie.* | ![Input](assets/gallery/10_input.png) | ![Result](assets/gallery/10_result.png) |
| *You are given an image of a male celebrity. Transform the man in the image into a beautiful woman with ponytail hair while preserving her facial identity. The result should be a high-quality image of the woman.* | ![Input](assets/gallery/01_input.png) | ![Result](assets/gallery/01_result.png) |
| *You are given an image of a toy dog. Replace the background with a scene of a sunny park with green grass while keeping the lighting and shadows consistent. The result should be an image of the toy dog in the park scene.* | ![Input](assets/gallery/08_input.png) | ![Result](assets/gallery/08_result.png) |
| *You are given an image of a standing cat. Replace the background with a scene of a cozy living room while keeping the lighting and shadows consistent. The result should be an image of the cat in the living room scene.* | ![Input](assets/gallery/02_input.png) | ![Result](assets/gallery/02_result.png) |
| *You are given an image containing two bottles of cosmetic products illuminated by a soft yellow light. Modify the illumination into a bright pink light to create a more vibrant and attractive appearance. The result should be an image of the cosmetic products with the new illumination.* | ![Input](assets/gallery/16_input.jpg) | ![Result](assets/gallery/16_result.png) |## 📜 Citation
If you find our work helpful, please consider citing our paper:
```bibtex
@article{xue2024comfybench,
title={ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems},
author={Xue, Xiangyuan and Lu, Zeyu and Huang, Di and Wang, Zidong and Ouyang, Wanli and Bai, Lei},
journal={arXiv preprint arXiv:2409.01392},
year={2024}
}
```