Projects in Awesome Lists tagged with gpu-cluster
A curated list of projects in awesome lists tagged with gpu-cluster .
https://github.com/microsoft/pai
Resource scheduling and cluster management for AI
ai artificial-intelligence chainer cloud cluster-management cluster-manager gpu gpu-cluster gpu-computing gpu-scheduler jupyter kubernetes machine-learning model-training on-premise pytorch resource-management scheduling tensorflow
Last synced: 02 Oct 2025
https://github.com/Microsoft/pai
Resource scheduling and cluster management for AI
ai artificial-intelligence chainer cloud cluster-management cluster-manager gpu gpu-cluster gpu-computing gpu-scheduler jupyter kubernetes machine-learning model-training on-premise pytorch resource-management scheduling tensorflow
Last synced: 20 Apr 2025
https://github.com/lambdalabsml/distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
cluster cuda deepspeed distributed-training fsdp gpu gpu-cluster kuberentes lambdalabs mpi nccl pytorch sharding slurm
Last synced: 16 May 2025
https://github.com/LambdaLabsML/distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
cluster cuda deepspeed distributed-training fsdp gpu gpu-cluster kuberentes lambdalabs mpi nccl pytorch sharding slurm
Last synced: 08 Mar 2025
https://github.com/cy69855522/ai-power
AI动力(AI Power) GPU云平台使用指南
aipower gpu gpu-cloud gpu-cluster
Last synced: 11 Oct 2025
https://github.com/acm-uiuc/gpu-cluster-images
Docker Images for the GPU Cluster
ai deep-learning docker docker-image gpu-cluster nvidia nvidia-docker
Last synced: 08 May 2026
https://github.com/acm-uiuc/gpu-cluster-backend
backend deep-learning docker gpu gpu-cluster nvidia nvidia-docker
Last synced: 18 Apr 2026
https://github.com/acm-uiuc/gpu-cluster
Root repo for ACM GPU Cluster
ai deep-learning gpu gpu-cluster nvidia
Last synced: 01 Sep 2025
https://github.com/saravanabalagi/mask-gpu
A simple tool to expose only specified number of GPUs with desired memory to Tensorflow
expose-gpu gpu-cluster gpu-management mask-gpu tensorflow-gpu
Last synced: 31 Oct 2025
https://github.com/datavorous/cluster-setup
A clean minimal boilerplate to setup an test bed for experiments.
boilerplate-template gpu-cluster
Last synced: 22 May 2026
https://github.com/acm-uiuc/gpu-cluster-frontend
ai deep-learning frontend gpu gpu-cluster nvidia react
Last synced: 10 May 2025
https://github.com/acm-uiuc/gpu-image-template
Template for custom docker files on the gpu cluster
docker docker-image gpu-cluster nvidia nvidia-docker
Last synced: 05 Dec 2025
https://github.com/bjornmelin/mlops-toolkit
🛠️ Enterprise ML infrastructure and deployment tools. Comprehensive suite for ML lifecycle management with focus on GPU cluster optimization. 📊
deployment dvc gpu-cluster kubeflow mlflow mlops model-registry monitoring python
Last synced: 16 Aug 2025