Projects in Awesome Lists tagged with multi-gpu
A curated list of projects in awesome lists tagged with multi-gpu .
https://github.com/confettifx/the-forge
The Forge Cross-Platform Framework PC Windows, Steamdeck (native), Ray Tracing, macOS / iOS, Android, XBOX, PS4, PS5, Switch, Quest 2
android directx directx12 ios linux linux-ubuntu macos metal multi-gpu multi-threading ps4 ps5 ray-tracing shader-translator shaders visibility-buffer vulkan vulkan-api vulkan-sdk xbox
Last synced: 13 May 2025
https://github.com/ConfettiFX/The-Forge
The Forge Cross-Platform Framework PC Windows, Steamdeck (native), Ray Tracing, macOS / iOS, Android, XBOX, PS4, PS5, Switch, Quest 2
android directx directx12 ios linux linux-ubuntu macos metal multi-gpu multi-threading ps4 ps5 ray-tracing shader-translator shaders visibility-buffer vulkan vulkan-api vulkan-sdk xbox
Last synced: 15 Mar 2025
https://github.com/NVIDIA/OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
deep-learning float16 language-model mixed-precision multi-gpu multi-node neural-machine-translation seq2seq sequence-to-sequence speech-recognition speech-synthesis speech-to-text tensorflow text-to-speech
Last synced: 19 Jul 2025
https://github.com/nvidia/openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
deep-learning float16 language-model mixed-precision multi-gpu multi-node neural-machine-translation seq2seq sequence-to-sequence speech-recognition speech-synthesis speech-to-text tensorflow text-to-speech
Last synced: 28 Sep 2025
https://github.com/v-iashin/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
audio-features clip feature-extraction i3d ig65m laion multi-gpu optical-flow parallel pytorch r2plus1d raft resnet s3d swin timm vggish video-features visual-features vit
Last synced: 02 Apr 2025
https://github.com/rbbrdckybk/dream-factory
Multi-threaded GUI manager for mass creation of AI-generated art with support for multiple GPUs.
ai ai-art generative-art image-generation machine-learning multi-gpu multithreaded nvidia-gpu stable-diffusion
Last synced: 24 Mar 2025
https://github.com/seasonsh/docface
Face recognition system for ID photos
biometrics face-recognition face-verification multi-gpu tensorflow
Last synced: 06 Apr 2025
https://github.com/nicklucche/stable-diffusion-nvidia-docker
GPU-ready Dockerfile to run Stability.AI stable-diffusion model v2 with a simple web interface. Includes multi-GPUs support.
docker image-generation multi-gpu nvidia-docker stable-diffusion
Last synced: 05 Apr 2025
https://github.com/NickLucche/stable-diffusion-nvidia-docker
GPU-ready Dockerfile to run Stability.AI stable-diffusion model v2 with a simple web interface. Includes multi-GPUs support.
docker image-generation multi-gpu nvidia-docker stable-diffusion
Last synced: 12 May 2025
https://github.com/omlins/parallelstencil.jl
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
cuda gpu julia multi-gpu multi-xpu parallel staggered-grids stencil stencil-codes xpu
Last synced: 28 Jan 2026
https://github.com/omlins/ParallelStencil.jl
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
cuda gpu julia multi-gpu multi-xpu parallel staggered-grids stencil stencil-codes xpu
Last synced: 27 Mar 2025
https://github.com/lattice/quda
QUDA is a library for performing calculations in lattice QCD on GPUs.
c c-plus-plus cuda gpu mpi multi-gpu qcd
Last synced: 15 May 2025
https://github.com/helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
array-api data-analytics data-processing data-science distributed gpu hpc machine-learning massive-datasets mpi mpi4py multi-gpu multi-node-cluster numpy parallelism python pytorch tensors
Last synced: 15 May 2025
https://github.com/tamerthamoqa/facenet-pytorch-glint360k
A PyTorch implementation of the 'FaceNet' paper for training a facial recognition model with Triplet Loss using the glint360k dataset. A pre-trained model using Triplet Loss is available for download.
face-recognition facenet lfw-dataset multi-gpu pretrained-model pytorch triplet-loss vggface2-dataset
Last synced: 02 Feb 2026
https://github.com/bharatsingh430/py-r-fcn-multigpu
Code for training py-faster-rcnn and py-R-FCN on multiple GPUs in caffe
faster-rcnn multi-gpu object-detection
Last synced: 07 May 2025
https://github.com/papuSpartan/stable-diffusion-webui-distributed
Chains stable-diffusion-webui instances together to facilitate faster image generation.
automatic1111 distributed-computing multi-gpu stable-diffusion stable-diffusion-webui stable-diffusion-webui-plugin
Last synced: 28 Mar 2025
https://github.com/eth-cscs/implicitglobalgrid.jl
Almost trivial distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid
cuda distributed gpu julia julia-mpi-wrapper mpi multi-gpu staggered-grids stencil-codes
Last synced: 04 Apr 2025
https://github.com/projectchrono/dem-engine
A dual-GPU DEM solver with complex grain geometry support
chrono cuda discrete-element-method gpu multi-gpu simulation
Last synced: 06 Apr 2025
https://github.com/rickiepark/deep-learning-with-python-2nd
<케라스 창시자에게 배우는 딥러닝 2판> 도서의 코드 저장소
cnn deep-learning gan image-augmentation image-classification image-segmentation image-style-transfer keras keras-tuner machine-translation mixed-precision multi-gpu neural-network rnn tensorflow text-classification text-generation time-series tpu transformer
Last synced: 10 Apr 2025
https://github.com/andreped/gradientaccumulator
:dart: Accumulated Gradients for TensorFlow 2
accumulated-batch-normalization accumulated-gradients adaptive-gradient-clipping batch-size deep-learning distributed-training float16 gpu gradient-accumulation hacktoberfest huggingface keras memory-constraints mixed-precision multi-gpu tensorflow tensorflow2 tf2 tpu
Last synced: 13 Apr 2025
https://github.com/shamrock-code/shamrock
The Shamrock Framework, an open-source, multi-GPU hydrodynamics framework for astrophysics. Scales seamlessly from laptops to exascale supercomputers, supporting SPH, AMR, and more.
adaptivecpp amr astrophysics fluid-dynamics fluid-simulation-engine mpi multi-gpu oneapi phantom ramses sph sycl zeus
Last synced: 21 Oct 2025
https://github.com/kuixu/keras_multi_gpu
Multi-GPU training for Keras
data-parallelism keras multi-gpu
Last synced: 29 Oct 2025
https://github.com/lupantech/dual-mfa-vqa
Co-attending Regions and Detections for VQA.
aaai attention-mechanism caffe faster-rcnn multi-gpu multi-modal object-detection torch visual-question-answering vqa
Last synced: 13 Oct 2025
https://github.com/miguelcarcamov/gpuvmem
GPU Framework for Radio Astronomical Image Synthesis
alma astronomical-algorithms astronomical-images astrophysics complex-systems cuda gpu gpu-acceleration gpu-computing image-synthesis maximum-entropy multi-gpu optimization-methods radio-imaging radio-interferometry radioastronomy ska vla
Last synced: 14 Apr 2025
https://github.com/dmarnerides/dlt
Deep Learning Toolbox for Torch
deep-neural-networks hpc-facilities multi-gpu slurm-job toolbox torch
Last synced: 04 Jan 2026
https://github.com/18520339/ml-distributed-training
Reduce the training time of CNNs by leveraging the power of multiple GPUs in 2 approaches, Multi-workers & Parameter Sever Training using TensorFlow 2
distributed distributed-tensorflow distributed-training multi-gpu multi-workers parameter-server tensorflow
Last synced: 15 Apr 2025
https://github.com/previsionio/damavand
Damavand is a quantum circuit simulator. It can run on laptops or High Performance Computing architectures, such CPU distributed architectures or multi GPU distributed architectures.
cuda distributed-computing hpc multi-gpu multithreading quantum-computing rust simulator
Last synced: 19 Apr 2025
https://github.com/lebedov/cudamps
Python interface to CUDA Multi-Process Service
Last synced: 05 May 2025
https://github.com/zabir-nabil/darknet-multi-gpu-parallel
running multiple darknet models in parallel in multi-gpu setup
darknet multi-gpu parallel-processing yolo
Last synced: 26 Jul 2025
https://github.com/visionscaper/stateful_multi_gpu
Experimental utility to build stateful RNN models for multi GPU training.
gru keras keras-models keras-tensorflow lstm multi-gpu rnn
Last synced: 22 Jul 2025
https://github.com/zjcv/facenet
[CVPR 2015] FaceNet: A Unified Embedding for Face Recognition and Clustering
facenet mixed-precision-training multi-gpu pytorch zcls
Last synced: 15 Apr 2025
https://github.com/madcato/pytorch-word2vec
word2vec implementation using PyTroch
Last synced: 23 Oct 2025
https://github.com/arthurvasseur/glgpuselect
GLGpuSelect is a cross-platform drop-in replacement for opengl32.dll on Windows and libGL.so on Linux that enables per-application GPU selection
egl gpu gpu-affinity gpu-association gpu-selection multi-gpu opengl opengl-context wgl
Last synced: 13 Jul 2025
https://github.com/raheel-baksh/glint
Glint is a Rust framework designed for creating stateful, graph-based AI systems, enabling efficient multi-step workflows. With features like LLM integration and a graph-based architecture, Glint helps developers build powerful AI solutions with ease. 🐙✨
commitlint face-recognition facenet git go hacktoberfest lfw-dataset multi-gpu opengl pretrained-model pytorch storybook styled-components triplet-loss vggface2-dataset webpack4
Last synced: 23 Jul 2025
https://github.com/farrajota/multi-gpu-torchnet
Train an object classifier using multiple gpus in Torch7
dbcollection multi-gpu torch7 torchnet
Last synced: 04 Oct 2025
https://github.com/olk/mnist-performance
performance test of MNIST hand writings usign MXNet + TF
classification gluon horovod keras mirrored-strategy mnist model-parallelism multi-gpu multi-gpu-training mxnet python tensorflow
Last synced: 26 Mar 2025
https://github.com/neuraladitya/neural_network_c
Neural Network C is an advanced neural network implementation in pure C, optimized for high performance on CPUs and NVIDIA GPUs.
artificial-intelligence bayesian-optimization c-programming convolutional-neural-networks cuda deep-learning encryption gpu-computing high-performance-computing machine-learning mpi multi-gpu neural-network openmp parallel-computing quantization real-time-monitoring secure-computing tensor-cores transformers
Last synced: 29 Mar 2025