Projects in Awesome Lists by gpustack
A curated list of projects in awesome lists by gpustack .
https://github.com/gpustack/gpustack
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
ascend cuda deepseek distributed-inference genai high-performance-inference inference llama llm llm-inference llm-serving maas mindie openai qwen rocm sglang vllm
Last synced: 20 Apr 2026
https://github.com/gpustack/llama-box
LM inference server implementation based on *.cpp.
cpp diffusion gguf openai-compatible-api transformer
Last synced: 05 Apr 2025
https://github.com/gpustack/gguf-parser-go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
gguf go llama-box llama-cpp stable-diffusion-cpp
Last synced: 19 Apr 2025
https://github.com/gpustack/vox-box
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
asr audio-processing openai-compatible-api python speech-to-text stt text-to-speech tts
Last synced: 07 Apr 2025
https://github.com/gpustack/runtime
Provides a unified interface to detect GPU resources and manages GPU workloads.
Last synced: 08 Apr 2026
https://github.com/gpustack/gguf-packer-go
Deliver LLMs of GGUF format via Dockerfile.
Last synced: 16 May 2026
https://github.com/gpustack/community-inference-backends
Community Inference Backends for GPUStack V2
Last synced: 18 Jan 2026