https://github.com/agentic-layer/workshop-infra

Infrastructure for our Conference Workshops ("Architecting and Building a K8s-based AI Platform")
https://github.com/agentic-layer/workshop-infra

Last synced: 9 days ago
JSON representation

Infrastructure for our Conference Workshops ("Architecting and Building a K8s-based AI Platform")

Host: GitHub
URL: https://github.com/agentic-layer/workshop-infra
Owner: agentic-layer
License: apache-2.0
Created: 2025-10-27T12:40:50.000Z (8 months ago)
Default Branch: main
Last Pushed: 2026-05-17T14:45:46.000Z (27 days ago)
Last Synced: 2026-05-17T16:49:37.466Z (26 days ago)
Language: Shell
Size: 226 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# workshop-infra
Infrastructure for our Conference Workshops ("Architecting and Building a K8s-based AI Platform")

Based on https://github.com/lreimer/k8s-native-iac 's Makefile

## Layout

Flux Kustomizations under `foundation/host-cluster/` are organized along
the conceptual planes used in the [agentic-layer
docs](https://docs.agentic-layer.ai/) and the workshop step folders:

| Path | Contents |
|---|---|
| `infrastructure/` | Kubernetes prerequisites: cert-manager, Gateway API CRDs |
| `observability-controllers/` | OpenTelemetry operator, Prometheus CRDs |
| `observability/` | LGTM stack (Loki, Grafana, Tempo, Mimir), OTel collector, observability-dashboard |
| `platform-operators/` | The four agentic-layer operators (agent-runtime, agent-gateway-krakend, ai-gateway-litellm, tool-gateway-agentgateway) |
| `platform-gateways/` | Gateway *instances* (Agent Gateway, AI Gateway, Tool Gateway) — Custom Resources reconciled by the operators above |
| `user-serving-plane/` | LibreChat, the chat UI participants point at the Agent Gateway |
| `quality-plane/` | testkube + testbench-operator: evaluating agents with Experiments |

Dependencies are wired so a cold cluster bootstraps cleanly:
`infrastructure → {observability-controllers, platform-operators} → {observability, platform-gateways} → {user-serving-plane, quality-plane}`.

## Prerequisites

Requires the following tools:
- kubectl
- gcloud CLI
- flux CLI
- **vcluster** CLI

## Setup

### 1. Create the Host Cluster, setup gitops
```bash
make prepare-cluster
make create-cluster
make bootstrap-flux
```

### 2. (Optional) Reconfigure vClusters

Edit the variable `clustersToCreate` in `generate-overlays.sh` to change the number of vClusters, then check in the changes:
```
make generate-vcluster-configs
git add infrastructure/vcluster/overlays/*
...
```

vCluster configuration can be changed later in `infrastructure/vcluster/base/vcluster.yaml`.

Note that changing the configuration might require the vClusters to be recreated, potentially breaking any credentials.

### 3. Setup env vars, secrets, and kubeconfigs

- Copy `.env.example` to `.env`
- Configure environment variables based on entries in the [Google Secrets Manager](https://console.cloud.google.com/security/secret-manager?project=agentic-layer-workshop)
- `source .env`
- Create secrets in the cluster
```
make secrets
```
- Generate and encrypt the shared participant kubeconfig (uses
`WORKSHOP_PASSWORD` from `.env`)
```
make kubeconfigs
```
- Copy `kubeconfigs-encrypted/workshop-kubeconfig.yaml.enc` to
`github.com/agentic-layer/workshop/kubeconfigs/` and commit. Keep
`workshop-admin-kubeconfig.yaml.enc` locally as a backup; don't
ship it to participants.

### 4. Model Serving with Ollama

```bash
# llama3.1 model deployment via CRD
kubectl apply -f foundation/mother-vcluster-europe-west1/ollama-operator/ollama-model-llama31.yaml
kollama expose llama3.1 --service-name=ollama-model-llama31 --namespace ollama-operator-system
kollama expose llama3.1 --service-name=ollama-model-llama31-lb --service-type LoadBalancer --namespace ollama-operator-system

# to start a chat with ollama
# exchange localhost with the actual LoadBalancer IP
OLLAMA_HOST=localhost:11434 ollama run llama3.1

# call the chat API of Ollama or OpenAI
# curl http://ollama-model-llama31.default:11434/v1/chat/completions
curl http://ollama-model-llama31.default:11434/api/chat \
-H "Content-Type: application/json" \
-d '{
"model": "llama3.1",
"messages": [
{
"role": "user",
"content": "Say this is a test!"
}
]
}'
```

---

## Connect

Participants share a single read-only kubeconfig (with `edit` scoped to
their claimed `ns-XX` namespace via RoleBinding). It's distributed as
an encrypted file in `github.com/agentic-layer/workshop/kubeconfigs/`.

```bash
./decrypt-kubeconfig.sh kubeconfigs/workshop-kubeconfig.yaml.enc kubeconfig.yaml
export KUBECONFIG=kubeconfig.yaml
kubectl get namespaces
```

For admin access, decrypt the `workshop-admin-kubeconfig.yaml.enc`
backup (kept locally, not shipped to participants) the same way.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/agentic-layer/workshop-infra

Awesome Lists containing this project

README