Projects in Awesome Lists tagged with multi-gpu-inference
A curated list of projects in awesome lists tagged with multi-gpu-inference .
https://github.com/inferflow/inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
baichuan2 bloom deepseek falcon gemma internlm llama2 llamacpp llm-inference m2m100 minicpm mistral mixtral mixture-of-experts model-quantization moe multi-gpu-inference phi-2 qwen
Last synced: 07 Apr 2025
https://github.com/jingyunliang/pytorch-multi-gpu-multi-process-testing
A script for PyTorch multi-GPU multi-process testing
multi-gpu-inference multi-gpu-multi-process multi-gpu-testing pytorch
Last synced: 09 Sep 2025