An open API service indexing awesome lists of open source software.

https://github.com/sozercan/k8s-distributed-inference

🦄 Distributed Inference on Kubernetes with DRA and MIG
https://github.com/sozercan/k8s-distributed-inference

dra dynamic-resource-allocation inference kubernetes llm multi-instance-gpu nvidia

Last synced: 6 months ago
JSON representation

🦄 Distributed Inference on Kubernetes with DRA and MIG

Awesome Lists containing this project

README

          

# k8s-distributed-inference

1. [Set up host](provision-host.md)
2. [Deploy Local AI](deploy-local-ai.md)