https://github.com/jpmcb/ollama-kubernetes-operator

🦙 A Kubernetes operator for Ollama
https://github.com/jpmcb/ollama-kubernetes-operator

kubernetes kubernetes-deployment llama llm ollama rust

Last synced: 9 months ago
JSON representation

🦙 A Kubernetes operator for Ollama

Host: GitHub
URL: https://github.com/jpmcb/ollama-kubernetes-operator
Owner: jpmcb
Created: 2024-04-20T15:54:49.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-04-20T23:36:31.000Z (about 2 years ago)
Last Synced: 2025-09-16T00:39:20.457Z (9 months ago)
Topics: kubernetes, kubernetes-deployment, llama, llm, ollama, rust
Language: Rust
Homepage:
Size: 14.6 KB
Stars: 5
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# 🦙 Ollama-Kubernetes-Operator

A Kubernetes operator to simplify deploying Ollama as GPU workloads to the right nodes.

_A current work in progress._

### Why?

There's no definitive way to deploy Ollama onto a subset of nodes that support
fast, GPU workloads. You can piece-meal it together, but you end up with alot of
tainted deployments with a mess of labels that's difficult to maintain.
The aim of the Ollama-Kubernetes-Operator is to enable Kubernetes practitioners
to easily deploy open source LLM technologies
as a service to their clusters with as minimal maintenance as possible.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jpmcb/ollama-kubernetes-operator

Awesome Lists containing this project

README