https://github.com/curt-park/comfyui-onprem-k8s

ComfyUI Service on On-Premise Kubernetes Cluster
https://github.com/curt-park/comfyui-onprem-k8s

comfyui gpu-shareable jupyterhub kubernetes minikube

Last synced: 6 months ago
JSON representation

ComfyUI Service on On-Premise Kubernetes Cluster

Host: GitHub
URL: https://github.com/curt-park/comfyui-onprem-k8s
Owner: Curt-Park
License: apache-2.0
Created: 2024-07-31T23:38:28.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-08-29T07:15:53.000Z (about 1 year ago)
Last Synced: 2024-08-29T08:45:24.070Z (about 1 year ago)
Topics: comfyui, gpu-shareable, jupyterhub, kubernetes, minikube
Language: Smarty
Homepage:
Size: 485 KB
Stars: 8
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # ComfyUI on an On-Premise Kubernetes Cluster

This project aims to provide two different types of ComfyUI services on Kubernetes:

1. API Service for Production

2. Interactive Service for Designers

## Features

- [x] [Common] Minikube Cluster for development

- [x] [Common] GPU Sharing (Time Slicing)

- [x] [Common] Ingress Route

- [ ] [Common] Monitoring (Dashboard for Metric + Logging)

- [x] [ComfyUI API Service] Cookie-based Session Stickiness with Timeout

- [x] [ComfyUI API Service] Python Test Script for Image Generation

- [ ] [ComfyUI API Service] Horizontal Pod Autoscaling

- [ ] [ComfyUI API Service] Efficient Routing (Max Use of ComfyUI Cache)

- [x] [ComfyUI Interactive Service] Authentication and Authorization

- [x] [ComfyUI Interactive Service] Custom Docker Image for JupyterHub SingleUser (+ComfyUI)

- [x] [ComfyUI Interactive Service] ComfyUI Extension for Jupyter Server Proxy 

- [x] [ComfyUI Interactive Service] Profiles for GPU Env / CPU only Env

- [x] [ComfyUI Interactive Service] Persistent Volume for ComfyUI User Data

- [x] [ComfyUI Interactive Service] Evicting inactive users

## Prerequisites

- Install [Docker](https://docs.docker.com/engine/install/) & [NVIDIA-Container-Toolkit](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html)

- Install [Kubectl](https://kubernetes.io/docs/tasks/tools/)

- Install [Minikube](https://minikube.sigs.k8s.io/docs/start)

- Install [HELM](https://helm.sh/docs/intro/install/)

## Cluster Setup

```bash

MODEL_PATH=PATH_TO_MODELS make cluster

# Enable GPU Time-Slicing for multiple replicas.

helm install nvidia-device-plugin charts/nvidia-device-plugin -n kube-system

# Install Ingress.

kubectl create namespace ingress

helm install traefik charts/traefik -n ingress

# Volumes.

kubectl apply -f volumes/minikube.yaml  # for minikube env.

```

## ComfyUI Interactive Service (w/ JupyterHub)

Build ComfyUI (Optional)

```bash

eval $(minikube docker-env)

make docker-build

make docker-build-jupyter

make docker-push-jupyter

make docker-run-jupyter  # for testing

```

```bash

helm install jupyterhub charts/jupyterhub

```

- login with `id: admin / pw: admin123!@#`



## ComfyUI API Service

This is an on-premise Kubernetes cluster version inspired by [comfyui on EKS](https://github.com/aws-samples/comfyui-on-eks).

Build ComfyUI (Optional)

```bash

make docker-build

make docker-push

make docker-run  # for testing

```

```bash

helm install comfyui charts/comfyui

```

- Create a connection for testing in minikube: `make tunnel`

- open http://localhost/comfyui/

- `cd test`

- `python main.py -s http://host-address/comfyui`

Result:

```bash

Generation started.

Generation not ready, sleep 1s ...

Generation not ready, sleep 1s ...

Generation finished.

Inference finished.

ClientID: 80e564e7-48ef-44cd-aae0-2a18fa091deb.

PromptID: 8882b193-d7d4-42e4-8003-7cdfe51a465d.

Num of images: 1.

Time spent: 2.17s.

------

```

## References

### GPU Sharing on K8S

- https://github.com/NVIDIA/k8s-device-plugin?tab=readme-ov-file#with-cuda-time-slicing

- https://nyyang.tistory.com/198

### ComfyUI + K8S

- https://github.com/aws-samples/comfyui-on-eks

### Ingress

- https://doc.traefik.io/traefik/routing/services/?ref=traefik.io#sticky-sessions

- https://doc.traefik.io/traefik/routing/providers/kubernetes-ingress/

### JupyterHub

- https://z2jh.jupyter.org/en/stable/

- https://z2jh.jupyter.org/en/latest/administrator/services.html

### Jupyter Server Proxy

- https://jupyter-server-proxy.readthedocs.io/en/latest/server-process.html

- https://github.com/jupyterhub/jupyter-rsession-proxy/

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/curt-park/comfyui-onprem-k8s

Awesome Lists containing this project

README