An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by gpustack

A curated list of projects in awesome lists by gpustack .

https://github.com/gpustack/gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

ascend cuda deepseek distributed-inference genai high-performance-inference inference llama llm llm-inference llm-serving maas mindie openai qwen rocm sglang vllm

Last synced: 20 Apr 2026

https://github.com/gpustack/llama-box

LM inference server implementation based on *.cpp.

cpp diffusion gguf openai-compatible-api transformer

Last synced: 05 Apr 2025

https://github.com/gpustack/gguf-parser-go

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

gguf go llama-box llama-cpp stable-diffusion-cpp

Last synced: 19 Apr 2025

https://github.com/gpustack/vox-box

A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

asr audio-processing openai-compatible-api python speech-to-text stt text-to-speech tts

Last synced: 07 Apr 2025

https://github.com/gpustack/runtime

Provides a unified interface to detect GPU resources and manages GPU workloads.

Last synced: 08 Apr 2026

https://github.com/gpustack/gguf-packer-go

Deliver LLMs of GGUF format via Dockerfile.

gguf go llama

Last synced: 16 May 2026

https://github.com/gpustack/runner

Last synced: 04 Feb 2026

https://github.com/gpustack/.github

Last synced: 23 Mar 2025

https://github.com/gpustack/community-inference-backends

Community Inference Backends for GPUStack V2

Last synced: 18 Jan 2026