An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with gpu-computing

A curated list of projects in awesome lists tagged with gpu-computing .

https://github.com/catboost/catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

big-data catboost categorical-features coreml cuda data-mining data-science decision-trees gbdt gbm gpu gpu-computing gradient-boosting kaggle machine-learning python r tutorial

Last synced: 12 May 2025

https://github.com/nvidia/thrust

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

algorithms cpp cpp11 cpp14 cpp17 cpp20 cuda cxx cxx11 cxx14 cxx17 cxx20 gpu gpu-computing nvidia nvidia-hpc-sdk thrust

Last synced: 30 Mar 2025

https://github.com/thrust/thrust

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

algorithms cpp cpp11 cpp14 cpp17 cpp20 cuda cxx cxx11 cxx14 cxx17 cxx20 gpu gpu-computing nvidia nvidia-hpc-sdk thrust

Last synced: 17 Mar 2025

https://github.com/NVIDIA/thrust

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

algorithms cpp cpp11 cpp14 cpp17 cpp20 cuda cxx cxx11 cxx14 cxx17 cxx20 gpu gpu-computing nvidia nvidia-hpc-sdk thrust

Last synced: 15 Mar 2025

https://github.com/komputeproject/kompute

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

cpp deep-learning deep-learning-gpu gpgpu gpu-computing machine-learning machine-learning-gpu python vulkan vulkan-compute vulkan-compute-example vulkan-compute-framework vulkan-compute-tutorial vulkan-demos vulkan-example vulkan-tutorial

Last synced: 13 May 2025

https://github.com/KomputeProject/kompute

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

cpp deep-learning deep-learning-gpu gpgpu gpu-computing machine-learning machine-learning-gpu python vulkan vulkan-compute vulkan-compute-example vulkan-compute-framework vulkan-compute-tutorial vulkan-demos vulkan-example vulkan-tutorial

Last synced: 31 Mar 2025

https://github.com/inducer/pycuda

CUDA integration for Python, plus shiny features

array cuda gpu gpu-computing multidimensional-arrays pycuda python scientific-computing

Last synced: 13 May 2025

https://github.com/adaptivecpp/adaptivecpp

Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl

Last synced: 11 Dec 2025

https://github.com/calebwin/emu

The write-once-run-anywhere GPGPU library for Rust

emu gpgpu gpu gpu-acceleration gpu-computing gpu-programming rust

Last synced: 14 May 2025

https://calebwin.github.io/emu/

The write-once-run-anywhere GPGPU library for Rust

emu gpgpu gpu gpu-acceleration gpu-computing gpu-programming rust

Last synced: 30 Apr 2025

https://github.com/AdaptiveCpp/AdaptiveCpp

Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl

Last synced: 21 Apr 2025

https://github.com/mratsim/arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor

Last synced: 14 May 2025

https://github.com/beehive-lab/tornadovm

TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

ai cuda gpu-acceleration gpu-computing gpus graalvm java levelzero multi-core opencl parallel-computing parallel-programming spirv

Last synced: 02 Dec 2025

https://github.com/nvidia/matx

An efficient C++17 GPU numerical computing library with Python-like syntax

cuda gpgpu gpu gpu-computing hpc

Last synced: 14 May 2025

https://github.com/NVIDIA/MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

cuda gpgpu gpu gpu-computing hpc

Last synced: 26 Mar 2025

https://github.com/beehive-lab/TornadoVM

TornadoVM: A practical and efficient heterogeneous programming framework for managed languages

ai cuda gpu-acceleration gpu-computing gpus graalvm java levelzero multi-core opencl parallel-computing parallel-programming spirv

Last synced: 04 Apr 2025

https://github.com/mratsim/Arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor

Last synced: 16 Apr 2025

https://mratsim.github.io/Arraymancer/

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor

Last synced: 08 May 2025

https://github.com/AccelerateHS/accelerate

Embedded language for high-performance array computations

accelerate cuda gpu gpu-computing hacktoberfest haskell llvm parallel-computing

Last synced: 12 May 2025

https://github.com/acceleratehs/accelerate

Embedded language for high-performance array computations

accelerate cuda gpu gpu-computing hacktoberfest haskell llvm parallel-computing

Last synced: 14 May 2025

https://github.com/Langhalsdino/Kubernetes-GPU-Guide

This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.

cluster deep-learning distributed-systems gpu gpu-computing guide kubernetes kubernetes-cluster kubernetes-gpu-cluster kubernetes-setup worker-nodes

Last synced: 20 Jul 2025

https://github.com/iot-salzburg/gpu-jupyter

GPU-Jupyter: Your GPU-accelerated JupyterLab with a rich data science toolstack, TensorFlow and PyTorch for your reproducible deep learning experiments.

docker environment gpu-acceleration gpu-computing jupyter jupyter-server jupyterlab pytorch reproducible-research tensorflow

Last synced: 15 May 2025

https://github.com/ComputationalRadiationPhysics/picongpu

Performance-Portable Particle-in-Cell Simulations for the Exascale Era :sparkles:

gpu gpu-computing laser particle-accelerator particle-in-cell physics physics-simulation pic plasma research

Last synced: 26 Mar 2025

https://github.com/googlefonts/compute-shader-101

Sample code for compute shader 101 training

gpu-computing shaders

Last synced: 15 May 2025

https://github.com/AmesingFlank/taichi.js

Modern GPU Compute and Rendering in Javascript

gpu gpu-computing gpu-programming javascript webgpu webgpu-api webgpu-shaders

Last synced: 24 Mar 2025

https://github.com/ccsb-scripps/AutoDock-GPU

AutoDock for GPUs and other accelerators

autodock4 cuda gpu-computing molecular-docking multicore-cpu opencl

Last synced: 21 Nov 2025

https://github.com/smistad/fast

A framework for high-performance medical image processing, neural network inference and visualization

deep-learning digital-pathology gpu-computing image-processing medical-imaging opencl parallel-computing python streaming ultrasound visualization

Last synced: 13 Apr 2025

https://github.com/ccsb-scripps/autodock-gpu

AutoDock for GPUs and other accelerators

autodock4 cuda gpu-computing molecular-docking multicore-cpu opencl

Last synced: 15 May 2025

https://github.com/software-mansion/typegpu

TypeScript library that enhances the WebGPU API, allowing resource management in a type-safe, declarative way.

gpgpu gpu gpu-computing gpu-programming graphics javascript typesafe typescript webgpu webgpu-api wgsl wgsl-shader

Last synced: 14 Jun 2025

https://github.com/trisycl/trisycl

Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group

cpp cpp20 fpga gpu-computing heterogeneous-parallel-programming opencl spir sycl trisycl

Last synced: 15 May 2025

https://github.com/projectphysx/opencl-wrapper

OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.

gpgpu gpgpu-computing gpu gpu-acceleration gpu-computing gpu-programming opencl vector-processor vectorization

Last synced: 16 May 2025

https://github.com/kpet/clvk

Implementation of OpenCL 3.0 on Vulkan

gpu-computing opencl vulkan vulkan-api

Last synced: 09 May 2025

https://github.com/rrze-hpc/gpu-benches

collection of benchmarks to measure basic GPU capabilities

cache gpu-computing micro-benchmarks performance

Last synced: 16 May 2025

https://github.com/RRZE-HPC/gpu-benches

collection of benchmarks to measure basic GPU capabilities

cache gpu-computing micro-benchmarks performance

Last synced: 22 Apr 2025

https://github.com/fastflow/fastflow

FastFlow pattern-based parallel programming framework (formerly on sourceforge)

gpu-computing gpu-programming multicore parallel-algorithm parallel-programming parallelization patterns skeleton-framework

Last synced: 01 Apr 2025

https://github.com/cdeterman/gpuR

R interface to use GPU's

gpgpu gpgpu-computing gpu gpu-computing r

Last synced: 13 Jul 2025

https://github.com/brandondube/prysm

physical optics: integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing...

4d diffraction forbes-polynomial gpu-computing modeling mtf mtf-mapper optics phase-retrieval phasecam propagation psf python q-polynomial raytracing trioptics wavefront wavefront-sensing zernike zygo

Last synced: 04 Apr 2025

https://github.com/BasBuller/PySNN

Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration

deep-learning dynamic gpu-acceleration gpu-computing machine-learning neural-networks python3 pytorch spiking-neural-networks stdp

Last synced: 07 May 2025

https://github.com/shiinamiyuki/akari_render

High Performance CPU/GPU Physically Based Renderer in Rust

blender gpu-computing gpu-raytracing path-guiding path-tracer path-tracing ray-tracing raytracing rust

Last synced: 09 Oct 2025

https://github.com/projectphysx/opencl-benchmark

A small OpenCL benchmark program to measure peak GPU/CPU performance.

bandwidth benchmark benchmarking flops gpgpu gpu gpu-computing high-performance-computing hpc opencl tool tools

Last synced: 04 Apr 2025

https://github.com/rsnemmen/OpenCL-examples

Simple OpenCL examples for exploiting GPU computing

c examples gpu gpu-computing numerical-calculations opencl opencl-device

Last synced: 16 May 2025

https://github.com/zeam-vm/pelemay

Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.

elixir gpu-computing simd-parallelism

Last synced: 06 Apr 2025

https://github.com/p-costa/CaNS

A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows

cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fortran gpu gpu-computing high-performance-computing turbulence

Last synced: 22 Feb 2025

https://github.com/CaNS-World/CaNS

A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows

cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fortran gpu gpu-computing high-performance-computing turbulence

Last synced: 14 Mar 2025

https://github.com/Ricks-Lab/gpu-utils

A set of utilities for monitoring and customizing GPU performance

amdgpu boinc einsteinathome gpu-computing gpu-monitoring gpu-settings gpu-utils linux milkyway overclock python3 setiathome

Last synced: 30 Apr 2025

https://github.com/goofit/goofit

Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP

cuda fitting gpu gpu-computing omp physics root-cern thrust

Last synced: 10 Apr 2025

https://github.com/lachlan2k/phatcrack

Modern web-based distributed hashcracking solution, built on hashcat

distributed-computing golang gpu-computing hacking hashcat hashcracking infosec pentesting security-tools vue

Last synced: 10 Apr 2025

https://github.com/anicetngrt/jiro-nn

A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.

adam classification cuda data-analysis deep-learning dropout gpu gpu-computing machine-learning ml nalgebra neural-networks nn opencl pipelines regression rust sgd

Last synced: 09 Apr 2025

https://github.com/AnicetNgrt/jiro-nn

A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.

adam classification cuda data-analysis deep-learning dropout gpu gpu-computing machine-learning ml nalgebra neural-networks nn opencl pipelines regression rust sgd

Last synced: 25 Sep 2025

https://github.com/GooFit/GooFit

Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP

cuda fitting gpu gpu-computing omp physics root-cern thrust

Last synced: 08 Apr 2025

https://github.com/intelpython/dpctl

Python SYCL bindings and SYCL-based Python Array API library

dppy gpu gpu-computing intel intel-xpu oneapi python sycl

Last synced: 16 May 2025

https://github.com/IntelPython/dpctl

Python SYCL bindings and SYCL-based Python Array API library

dppy gpu gpu-computing intel intel-xpu oneapi python sycl

Last synced: 04 Apr 2025

https://github.com/barbagroup/petibm

PetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures

computational-fluid-dynamics gpu-computing immersed-boundary-method nvidia-amgx petsc

Last synced: 06 Apr 2025

https://github.com/barbagroup/PetIBM

PetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures

computational-fluid-dynamics gpu-computing immersed-boundary-method nvidia-amgx petsc

Last synced: 14 Mar 2025

https://github.com/houkensjtu/taichi-fluid

A collection of CFD related resources for Taichi developers.

cfd gpu-computing parallel-computing python taichi

Last synced: 02 Apr 2025