An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with gpu-cluster

A curated list of projects in awesome lists tagged with gpu-cluster .

https://github.com/lambdalabsml/distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

cluster cuda deepspeed distributed-training fsdp gpu gpu-cluster kuberentes lambdalabs mpi nccl pytorch sharding slurm

Last synced: 16 May 2025

https://github.com/LambdaLabsML/distributed-training-guide

Best practices & guides on how to write distributed pytorch training code

cluster cuda deepspeed distributed-training fsdp gpu gpu-cluster kuberentes lambdalabs mpi nccl pytorch sharding slurm

Last synced: 08 Mar 2025

https://github.com/cy69855522/ai-power

AI动力(AI Power) GPU云平台使用指南

aipower gpu gpu-cloud gpu-cluster

Last synced: 11 Oct 2025

https://github.com/acm-uiuc/gpu-cluster

Root repo for ACM GPU Cluster

ai deep-learning gpu gpu-cluster nvidia

Last synced: 01 Sep 2025

https://github.com/saravanabalagi/mask-gpu

A simple tool to expose only specified number of GPUs with desired memory to Tensorflow

expose-gpu gpu-cluster gpu-management mask-gpu tensorflow-gpu

Last synced: 31 Oct 2025

https://github.com/datavorous/cluster-setup

A clean minimal boilerplate to setup an test bed for experiments.

boilerplate-template gpu-cluster

Last synced: 22 May 2026

https://github.com/acm-uiuc/gpu-image-template

Template for custom docker files on the gpu cluster

docker docker-image gpu-cluster nvidia nvidia-docker

Last synced: 05 Dec 2025

https://github.com/bjornmelin/mlops-toolkit

🛠️ Enterprise ML infrastructure and deployment tools. Comprehensive suite for ML lifecycle management with focus on GPU cluster optimization. 📊

deployment dvc gpu-cluster kubeflow mlflow mlops model-registry monitoring python

Last synced: 16 Aug 2025