https://github.com/llmariner/llmariner

Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.
https://github.com/llmariner/llmariner

ai autoscaling fine-tuning gpu inference k8s kubernetes llm ml multi-cluster openai operator vllm

Last synced: 6 months ago
JSON representation

Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.

Host: GitHub
URL: https://github.com/llmariner/llmariner
Owner: llmariner
License: apache-2.0
Created: 2024-03-27T15:28:53.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-04-09T23:47:27.000Z (6 months ago)
Last Synced: 2025-04-10T00:30:11.667Z (6 months ago)
Topics: ai, autoscaling, fine-tuning, gpu, inference, k8s, kubernetes, llm, ml, multi-cluster, openai, operator, vllm
Language: Go
Homepage: https://llmariner.ai
Size: 7.84 MB
Stars: 65
Watchers: 2
Forks: 7
Open Issues: 8
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

Documentation |
Community Slack

---

LLMariner (= LLM + Mariner) is an extensible open source platform to simplify the management of generative AI workloads. Built on Kubernetes, it enables you to efficiently handle both training and inference data within your own clusters. With [OpenAI-compatible APIs](https://platform.openai.com/docs/api-reference), LLMariner leverages an ecosystem of tools, facilitating seamless integration for a wide range of AI-driven applications.

LLMariner concepts

## Architecture

LLMariner consists of a control-plane and one or more worker-planes. Both components can operate within a single cluster, but if you want to utilize GPU resources across multiple clusters, they can also be installed into separate clusters:

Control-Plane components:: Expose the OpenAI-compatible APIs and manage the overall state of LLMariner and receive a request from the client.
Worker-Plane components:: Run every worker cluster, process tasks using compute resources such as GPUs in response to requests from the control-plane.

LLMariner High-level Architecture

Refer to the [High-Level Architecture](https://llmariner.ai/docs/overview/how-works/) document and [Technical Details](https://llmariner.ai/docs/dev/architecture/) document for more information.

## Installation

Check out our [installation guide](https://llmariner.ai/docs/setup/install/), which covers several deployment methods, including setting up a test environment using the kind cluster and building a production-ready environment, among others.

## Integration

LLMariner provides OpenAI-compatible APIs, making it easy to integrate with powerful tools such as assistant web UIs, code generation tools, and more. Here are some integration samples:

- **Open WebUI**: A self-hosted web UI that works with OpenAI-compatible APIs. See [Open WebUI](https://llmariner.ai/docs/integration/openwebui/) integration guide for details.
- **Continue**: An open-source AI code assistant inside of VS Code and JetBrains. See [Continue](https://llmariner.ai/docs/integration/continue/) integration guide for details.
- **Weights & Biases (W&B)**: AI developer platform that can enable you to easily see the progress of your fine-tuning jobs, such as training epoch, loss, etc. See [W&B](https://llmariner.ai/docs/integration/wandb/) integration guide for details.

## Directory Structure

- `cli`: CLI
- `deployments`: Helm chart
- `integration-examples`: Examples of integration of other services with LLMariner
- `provision`: provisioning scripts
- `tutorials`: Tutorials

## Talks

- [Transform Your Kubernetes Cluster Into a GenAI Platform: Get Ready-to-Use LLM APIs Today! - Cloud Native & Kubernetes AI Day 2024 North America](https://sched.co/1izue)

## Contributing

See [Contributing Guide](CONTRIBUTING.md).

## License

See [LICENSE](LICENSE).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/llmariner/llmariner

Awesome Lists containing this project

README