An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with distributed-deep-learning

A curated list of projects in awesome lists tagged with distributed-deep-learning .

https://github.com/intel/bigdl

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

analytics-zoo apache-spark bigdl deep-neural-network distributed-deep-learning keras-tensorflow python pytorch scala

Last synced: 14 May 2025

https://github.com/zoranzhao/deepthings

A Portable C Library for Distributed CNN Inference on IoT Edge Clusters

deep-neural-networks distributed-deep-learning edge-computing internet-of-things iot-edge-clusters

Last synced: 22 Sep 2025

https://github.com/ParCIS/Chimera

Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.

distributed-deep-learning pipeline-parallelism transformers

Last synced: 13 Apr 2025

https://github.com/shigangli/eager-sgd

Eager-SGD is a decentralized asynchronous SGD. It utilizes novel partial collectives operations to accumulate the gradients across all the processes.

distributed-deep-learning gradient-averaging partial-allreduce

Last synced: 13 Aug 2025

https://github.com/shigangli/wagma-sgd

WAGMA-SGD is a decentralized asynchronous SGD based on wait-avoiding group model averaging. The synchronization is relaxed by making the collectives externally-triggerable, namely, a collective can be initiated without requiring that all the processes enter it. It partially reduces the data within non-overlapping groups of process, improving the parallel scalability.

distributed-deep-learning model-averaging partial-allreduce

Last synced: 13 Aug 2025

https://github.com/stefanofioravanzo/distributed-deeplearning-kubernetes

Collection of resources for automatic deployment of distributed deep learning jobs on a Kubernetes cluster

azure-kubernetes-service distributed-deep-learning kubernetes-operator mxnet tensorflow

Last synced: 19 Apr 2026

https://github.com/pierric/mnist-caffe-mpi

mnist, using caffe and openmpi

caffe distributed-deep-learning mnist openmpi

Last synced: 17 Oct 2025

https://github.com/hyunnnchoi/google-t5-fsdp-kubeflow

A foundational repository for setting up distributed training jobs using Kubeflow and PyTorch FSDP.

distributed-deep-learning fsdp kubeflow pytorch

Last synced: 26 Apr 2026