Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with gpu-computing
A curated list of projects in awesome lists tagged with gpu-computing .
https://github.com/catboost/catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
big-data catboost categorical-features coreml cuda data-mining data-science decision-trees gbdt gbm gpu gpu-computing gradient-boosting kaggle machine-learning python r tutorial
Last synced: 16 Dec 2024
https://github.com/gyroflow/gyroflow
Video stabilization using gyroscope data
fpv gopro gpu gpu-computing gyroscope insta360 rolling-shutter-undistortion rust sony-alpha-cameras stabilization video video-processing
Last synced: 17 Dec 2024
https://github.com/nvidia/thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
algorithms cpp cpp11 cpp14 cpp17 cpp20 cuda cxx cxx11 cxx14 cxx17 cxx20 gpu gpu-computing nvidia nvidia-hpc-sdk thrust
Last synced: 01 Nov 2024
https://github.com/NVIDIA/thrust
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
algorithms cpp cpp11 cpp14 cpp17 cpp20 cuda cxx cxx11 cxx14 cxx17 cxx20 gpu gpu-computing nvidia nvidia-hpc-sdk thrust
Last synced: 26 Oct 2024
https://github.com/google/tf-quant-finance
High-performance TensorFlow library for quantitative finance.
finance gpu gpu-computing high-performance high-performance-computing numerical-integration numerical-methods numerical-optimization python quantitative-finance quantlib tensorflow
Last synced: 16 Dec 2024
https://github.com/projectphysx/fluidx3d
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
benchmark cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fluid-solver gpgpu gpu gpu-computing high-performance-computing hpc interactive-visualization lattice-boltzmann lbm opencl physics raytracing scientific-computing scientific-visualization simulation
Last synced: 17 Dec 2024
https://github.com/ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via OpenCL. Free for non-commercial use.
benchmark cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fluid-solver gpgpu gpu gpu-computing high-performance-computing hpc interactive-visualization lattice-boltzmann lbm opencl physics raytracing scientific-computing scientific-visualization simulation
Last synced: 30 Oct 2024
https://github.com/Microsoft/pai
Resource scheduling and cluster management for AI
ai artificial-intelligence chainer cloud cluster-management cluster-manager gpu gpu-cluster gpu-computing gpu-scheduler jupyter kubernetes machine-learning model-training on-premise pytorch resource-management scheduling tensorflow
Last synced: 09 Nov 2024
https://github.com/microsoft/pai
Resource scheduling and cluster management for AI
ai artificial-intelligence chainer cloud cluster-management cluster-manager gpu gpu-cluster gpu-computing gpu-scheduler jupyter kubernetes machine-learning model-training on-premise pytorch resource-management scheduling tensorflow
Last synced: 27 Sep 2024
https://github.com/komputeproject/kompute
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
cpp deep-learning deep-learning-gpu gpgpu gpu-computing machine-learning machine-learning-gpu python vulkan vulkan-compute vulkan-compute-example vulkan-compute-framework vulkan-compute-tutorial vulkan-demos vulkan-example vulkan-tutorial
Last synced: 18 Dec 2024
https://github.com/jbush001/nyuziprocessor
GPGPU microprocessor architecture
fpga gpu gpu-computing graphics hardware microprocessor processor-architecture verilog
Last synced: 20 Dec 2024
https://github.com/jbush001/NyuziProcessor
GPGPU microprocessor architecture
fpga gpu gpu-computing graphics hardware microprocessor processor-architecture verilog
Last synced: 25 Oct 2024
https://github.com/KomputeProject/kompute
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
cpp deep-learning deep-learning-gpu gpgpu gpu-computing machine-learning machine-learning-gpu python vulkan vulkan-compute vulkan-compute-example vulkan-compute-framework vulkan-compute-tutorial vulkan-demos vulkan-example vulkan-tutorial
Last synced: 01 Nov 2024
https://github.com/inducer/pycuda
CUDA integration for Python, plus shiny features
array cuda gpu gpu-computing multidimensional-arrays pycuda python scientific-computing
Last synced: 17 Dec 2024
https://github.com/sciml/scimlbook
Parallel Computing and Scientific Machine Learning (SciML): Methods and Applications (MIT 18.337J/6.338J)
differential-equations gpu-computing lecture-notes neural-networks neural-ode neural-sde numerical-methods parallelism performance-engineering scientific-machine-learning scientific-simulators sciml stiff-equations
Last synced: 02 Dec 2024
https://github.com/SciML/SciMLBook
Parallel Computing and Scientific Machine Learning (SciML): Methods and Applications (MIT 18.337J/6.338J)
differential-equations gpu-computing lecture-notes neural-networks neural-ode neural-sde numerical-methods parallelism performance-engineering scientific-machine-learning scientific-simulators sciml stiff-equations
Last synced: 13 Nov 2024
https://github.com/coreylowman/dfdx
Deep learning in Rust, with shape checked tensors and neural networks
autodiff autodifferentiation autograd backpropagation cuda cuda-kernels cuda-support cuda-toolkit cudnn deep-learning deep-neural-networks gpu gpu-acceleration gpu-computing machine-learning neural-network rust rust-lang tensor
Last synced: 17 Dec 2024
https://calebwin.github.io/emu/
The write-once-run-anywhere GPGPU library for Rust
emu gpgpu gpu gpu-acceleration gpu-computing gpu-programming rust
Last synced: 11 Nov 2024
https://github.com/calebwin/emu
The write-once-run-anywhere GPGPU library for Rust
emu gpgpu gpu gpu-acceleration gpu-computing gpu-programming rust
Last synced: 27 Oct 2024
https://github.com/bindsnet/bindsnet
Simulation of spiking neural networks (SNNs) using PyTorch.
dynamic gpu-computing machine-learning neurons pytorch reinforcement-learning simulation snn spiking-neural-networks stdp synapse
Last synced: 19 Dec 2024
https://github.com/BindsNET/bindsnet
Simulation of spiking neural networks (SNNs) using PyTorch.
dynamic gpu-computing machine-learning neurons pytorch reinforcement-learning simulation snn spiking-neural-networks stdp synapse
Last synced: 10 Nov 2024
https://github.com/adaptivecpp/adaptivecpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl
Last synced: 19 Dec 2024
https://github.com/AdaptiveCpp/AdaptiveCpp
Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl
Last synced: 09 Nov 2024
https://github.com/nvidia/cccl
CUDA Core Compute Libraries
accelerated-computing cpp cpp-programming cuda cuda-cpp cuda-kernels cuda-library cuda-programming gpu gpu-acceleration gpu-computing gpu-programming hpc modern-cpp nvidia nvidia-gpu parallel-algorithm parallel-computing parallel-programming
Last synced: 19 Dec 2024
https://github.com/NVIDIA/cccl
CUDA Core Compute Libraries
accelerated-computing cpp cpp-programming cuda cuda-cpp cuda-kernels cuda-library cuda-programming gpu gpu-acceleration gpu-computing gpu-programming hpc modern-cpp nvidia nvidia-gpu parallel-algorithm parallel-computing parallel-programming
Last synced: 19 Nov 2024
https://github.com/nvidia/matx
An efficient C++17 GPU numerical computing library with Python-like syntax
cuda gpgpu gpu gpu-computing hpc
Last synced: 20 Dec 2024
https://github.com/NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
cuda gpgpu gpu gpu-computing hpc
Last synced: 30 Oct 2024
https://github.com/beehive-lab/tornadovm
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
ai cuda gpu-acceleration gpu-computing gpus graalvm java levelzero multi-core opencl spirv
Last synced: 19 Dec 2024
https://github.com/beehive-lab/TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
ai cuda gpu-acceleration gpu-computing gpus graalvm java levelzero multi-core opencl spirv
Last synced: 05 Nov 2024
https://mratsim.github.io/Arraymancer/
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor
Last synced: 14 Nov 2024
https://github.com/mratsim/arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor
Last synced: 21 Dec 2024
https://github.com/mratsim/Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor
Last synced: 08 Nov 2024
https://github.com/stotko/stdgpu
stdgpu: Efficient STL-like Data Structures on the GPU
cpp cpp17 cpp20 cuda data-structures gpgpu gpu gpu-acceleration gpu-computing hip modern-cpp openmp rocm stl stl-containers stl-like
Last synced: 20 Dec 2024
https://github.com/luxcorerender/luxcore
LuxCore source repository
3d-graphics bidirectional-path-tracing cuda gpu-computing luxcorerender luxrender opencl optix path-tracing pathtracer ray ray-tracer ray-tracing raytracer raytracing rtx visualization
Last synced: 19 Dec 2024
https://github.com/uncomplicate/neanderthal
Fast Clojure Matrix Library
api clojure clojure-library cuda gpgpu gpu gpu-computing high-performance-computing java matrix matrix-calculations matrix-factorization matrix-functions matrix-multiplication opencl vectorization
Last synced: 18 Dec 2024
https://github.com/acceleratehs/accelerate
Embedded language for high-performance array computations
accelerate cuda gpu gpu-computing hacktoberfest haskell llvm parallel-computing
Last synced: 18 Dec 2024
https://github.com/AccelerateHS/accelerate
Embedded language for high-performance array computations
accelerate cuda gpu gpu-computing hacktoberfest haskell llvm parallel-computing
Last synced: 18 Nov 2024
https://github.com/Langhalsdino/Kubernetes-GPU-Guide
This guide should help fellow researchers and hobbyists to easily automate and accelerate there deep leaning training with their own Kubernetes GPU cluster.
cluster deep-learning distributed-systems gpu gpu-computing guide kubernetes kubernetes-cluster kubernetes-gpu-cluster kubernetes-setup worker-nodes
Last synced: 27 Nov 2024
https://github.com/eyalroz/cuda-api-wrappers
Thin, unified, C++-flavored wrappers for the CUDA APIs
api-wrapper cuda cuda-api-wrappers cuda-device cuda-driver cuda-driver-api cuda-programming cuda-runtime-api cuda-toolkit gpgpu gpgpu-computing gpu gpu-computing gpu-memory modern-cpp
Last synced: 09 Nov 2024
https://github.com/zszazi/deep-learning-in-cloud
List of Deep Learning Cloud Providers
artificial-intelligence cloud cloud-gpus deep-learning deeplearning gpu gpu-computing machine-learning mlops
Last synced: 10 Dec 2024
https://github.com/luxcorerender/blendluxcore
Blender Integration for LuxCore
3d-graphics blender blender-addon gpu-computing luxcorerender opencl ray-tracing raytracer visualization
Last synced: 20 Dec 2024
https://github.com/ComputationalRadiationPhysics/picongpu
Performance-Portable Particle-in-Cell Simulations for the Exascale Era :sparkles:
gpu gpu-computing laser particle-accelerator particle-in-cell physics physics-simulation pic plasma research
Last synced: 30 Oct 2024
https://github.com/googlefonts/compute-shader-101
Sample code for compute shader 101 training
Last synced: 16 Dec 2024
https://github.com/AmesingFlank/taichi.js
Modern GPU Compute and Rendering in Javascript
gpu gpu-computing gpu-programming javascript webgpu webgpu-api webgpu-shaders
Last synced: 28 Oct 2024
https://github.com/smistad/fast
A framework for high-performance medical image processing, neural network inference and visualization
deep-learning digital-pathology gpu-computing image-processing medical-imaging opencl parallel-computing python streaming ultrasound visualization
Last synced: 20 Dec 2024
https://github.com/trisycl/trisycl
Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group
cpp cpp20 fpga gpu-computing heterogeneous-parallel-programming opencl spir sycl trisycl
Last synced: 21 Dec 2024
https://github.com/ginkgo-project/ginkgo
Numerical linear algebra software package
cuda dpcpp gpu-computing hip hpc krylov-methods linear-algebra oneapi openmp preconditioning sparse-linear-systems spmv
Last synced: 20 Dec 2024
https://github.com/ccsb-scripps/autodock-gpu
AutoDock for GPUs and other accelerators
autodock4 cuda gpu-computing molecular-docking multicore-cpu opencl
Last synced: 21 Dec 2024
https://github.com/juliagpu/kernelabstractions.jl
Heterogeneous programming in Julia
gpu-computing heterogeneous-parallel-programming julia julia-package julialang
Last synced: 20 Dec 2024
https://github.com/JuliaGPU/KernelAbstractions.jl
Heterogeneous programming in Julia
gpu-computing heterogeneous-parallel-programming julia julia-package julialang
Last synced: 05 Nov 2024
https://github.com/uncomplicate/bayadera
High-performance Bayesian Data Analysis on the GPU in Clojure
bayesian bayesian-data-analysis bayesian-inference clojure clojure-library cuda gpu gpu-acceleration gpu-computing high-performance-computing machine-learning markov-chain-monte-carlo mcmc opencl statistics
Last synced: 17 Dec 2024
https://github.com/kpet/clvk
Implementation of OpenCL 3.0 on Vulkan
gpu-computing opencl vulkan vulkan-api
Last synced: 16 Nov 2024
https://github.com/projectphysx/opencl-wrapper
OpenCL is the most powerful programming language ever created. Yet the OpenCL C++ bindings are cumbersome and the code overhead prevents many people from getting started. I created this lightweight OpenCL-Wrapper to greatly simplify OpenCL software development with C++ while keeping functionality and performance.
gpgpu gpgpu-computing gpu gpu-acceleration gpu-computing gpu-programming opencl vector-processor vectorization
Last synced: 15 Dec 2024
https://github.com/andrewmilson/ministark
🏃♂️💨 GPU accelerated STARK prover built on @arkworks-rs
apple-silicon arkworks arkworks-rs crypto cryptography fft finite-fields gpu gpu-acceleration gpu-computing gpu-programming m1 metal optimization polynomials rust starks virtual-machine zero-knowledge zkstarks
Last synced: 09 Nov 2024
https://github.com/favreau/Sol-R
Open-Source CUDA/OpenCL Speed Of Light Ray-tracer
3d 3d-graphics-engine cuda gpgpu gpu-acceleration gpu-computing graphics-engine interactive opencl path-tracing pathtracing ray-tracing raytracer raytracing raytracing-engine realtime-rendering rendering science virtual-reality vr
Last synced: 12 Nov 2024
https://github.com/kerneltuner/kernel_tuner
Kernel Tuner
auto-tuning autotuning c cplusplus cuda cuda-kernels gpu gpu-computing kernel-tuner machine-learning opencl opencl-kernels optimization python software-development testing
Last synced: 20 Dec 2024
https://github.com/fastflow/fastflow
FastFlow pattern-based parallel programming framework (formerly on sourceforge)
gpu-computing gpu-programming multicore parallel-algorithm parallel-programming parallelization patterns skeleton-framework
Last synced: 02 Nov 2024
https://github.com/uncomplicate/clojurecl
ClojureCL is a Clojure library for parallel computations with OpenCL.
amd-opencl clojure clojure-library gpu-computing high-performance high-performance-computing intel nvidia opencl parallel-computations
Last synced: 18 Dec 2024
https://github.com/baggepinnen/montecarlomeasurements.jl
Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
error-analysis error-propagation gpu-acceleration gpu-computing monte-carlo monte-carlo-sampling monte-carlo-simulation numeric-types particle-filter physical-quantities probability-distributions robust-optimization uncertainties uncertainty-propagation uncertainty-sampling
Last synced: 21 Nov 2024
https://github.com/baggepinnen/MonteCarloMeasurements.jl
Propagation of distributions by Monte-Carlo sampling: Real number types with uncertainty represented by samples.
error-analysis error-propagation gpu-acceleration gpu-computing monte-carlo monte-carlo-sampling monte-carlo-simulation numeric-types particle-filter physical-quantities probability-distributions robust-optimization uncertainties uncertainty-propagation uncertainty-sampling
Last synced: 30 Oct 2024
https://github.com/cdeterman/gpuR
R interface to use GPU's
gpgpu gpgpu-computing gpu gpu-computing r
Last synced: 22 Nov 2024
https://github.com/brandondube/prysm
physical optics: integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing...
4d diffraction forbes-polynomial gpu-computing modeling mtf mtf-mapper optics phase-retrieval phasecam propagation psf python q-polynomial raytracing trioptics wavefront wavefront-sensing zernike zygo
Last synced: 15 Dec 2024
https://github.com/mfem/pymfem
Python wrapper for MFEM
fem finite-elements gpu-computing hpc parallel-computing python scientific-computing swig
Last synced: 14 Dec 2024
https://github.com/denosaurs/netsaur
Powerful Powerful Machine Learning library with GPU, CPU and WASM backends
ai artificial-intelligence deep-learning deep-neural-networks deno edge-computing gpu-acceleration gpu-computing hacktoberfest machine-learning ml neural-network rust safetensors serverless typescript wasm webassembly webgpu
Last synced: 18 Dec 2024
https://github.com/BasBuller/PySNN
Efficient Spiking Neural Network framework, built on top of PyTorch for GPU acceleration
deep-learning dynamic gpu-acceleration gpu-computing machine-learning neural-networks python3 pytorch spiking-neural-networks stdp
Last synced: 14 Nov 2024
https://github.com/lnstadrum/beatmup
Beatmup: image and signal processing library
android gpu gpu-computing image-processing linux opengl opengl-es raspberry-pi signal-processing windows
Last synced: 16 Dec 2024
https://github.com/ROCm/Tensile
Stretching GPU performance for GEMMs and tensor contractions.
amd assembly auto-tuning blas dnn gemm gpu gpu-acceleration gpu-computing hip machine-learning matrix-multiplication neural-networks opencl python radeon tensor-contraction tensors
Last synced: 30 Nov 2024
https://github.com/rocm/tensile
Stretching GPU performance for GEMMs and tensor contractions.
amd assembly auto-tuning blas dnn gemm gpu gpu-acceleration gpu-computing hip machine-learning matrix-multiplication neural-networks opencl python radeon tensor-contraction tensors
Last synced: 21 Dec 2024
https://github.com/te42kyfo/gpu-benches
collection of benchmarks to measure basic GPU capabilities
cache gpu-computing micro-benchmarks performance
Last synced: 09 Nov 2024
https://github.com/rsnemmen/OpenCL-examples
Simple OpenCL examples for exploiting GPU computing
c examples gpu gpu-computing numerical-calculations opencl opencl-device
Last synced: 19 Nov 2024
https://github.com/zeam-vm/pelemay
Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
elixir gpu-computing simd-parallelism
Last synced: 16 Dec 2024
https://github.com/CaNS-World/CaNS
A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fortran gpu gpu-computing high-performance-computing turbulence
Last synced: 25 Oct 2024
https://github.com/uncomplicate/clojurecuda
Clojure library for CUDA development
clojure clojure-library cuda cuda-development gpu-acceleration gpu-computing high-performance java
Last synced: 21 Dec 2024
https://github.com/zjin-lcf/HeCBench
benchmark cuda gpu-computing hip hpc-applications openmp scientific-computing sycl test-driven-development
Last synced: 05 Nov 2024
https://github.com/projectphysx/opencl-benchmark
A small OpenCL benchmark program to measure peak GPU/CPU performance.
bandwidth benchmark benchmarking flops gpgpu gpu gpu-computing high-performance-computing hpc opencl tool tools
Last synced: 21 Dec 2024
https://github.com/nixon-voxell/GPUClothSimulationInUnity
Trying to replicate what this legend did: https://youtu.be/kCGHXlLR3l8
cloth cloth-simulation compute-shader compute-shaders gpu-accelerated-library gpu-cloth-simulation gpu-computing pbd position-based-dynamics
Last synced: 14 Nov 2024
https://github.com/acceleratehs/accelerate-llvm
LLVM backend for Accelerate
accelerate compiler cuda gpu gpu-computing hacktoberfest haskell llvm parallel-computing
Last synced: 15 Dec 2024
https://github.com/Ricks-Lab/gpu-utils
A set of utilities for monitoring and customizing GPU performance
amdgpu boinc einsteinathome gpu-computing gpu-monitoring gpu-settings gpu-utils linux milkyway overclock python3 setiathome
Last synced: 12 Nov 2024
https://github.com/GooFit/GooFit
Code repository for the massively-parallel framework for maximum-likelihood fits, implemented in CUDA/OpenMP
cuda fitting gpu gpu-computing omp physics root-cern thrust
Last synced: 06 Nov 2024
https://github.com/anicetngrt/jiro-nn
A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.
adam classification cuda data-analysis deep-learning dropout gpu gpu-computing machine-learning ml nalgebra neural-networks nn opencl pipelines regression rust sgd
Last synced: 03 Dec 2024
https://github.com/AnicetNgrt/jiro-nn
A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.
adam classification cuda data-analysis deep-learning dropout gpu gpu-computing machine-learning ml nalgebra neural-networks nn opencl pipelines regression rust sgd
Last synced: 24 Sep 2024
https://github.com/barbagroup/petibm
PetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
computational-fluid-dynamics gpu-computing immersed-boundary-method nvidia-amgx petsc
Last synced: 06 Nov 2024
https://github.com/barbagroup/PetIBM
PetIBM - toolbox and applications of the immersed-boundary method on distributed-memory architectures
computational-fluid-dynamics gpu-computing immersed-boundary-method nvidia-amgx petsc
Last synced: 25 Oct 2024
https://github.com/radiantone/entangle
A lightweight (serverless) native python parallel processing framework based on simple decorators and call graphs.
artificial-intelligence containers dataflow dataflow-engine decorator-composition devops gpu gpu-computing hpc parallel parallel-processes parallel-workflows python3 scripting supercomputing workflow-composition workflow-managers
Last synced: 20 Nov 2024
https://github.com/intelpython/dpctl
Python SYCL bindings and SYCL-based Python Array API library
dppy gpu gpu-computing intel intel-xpu oneapi python sycl
Last synced: 15 Dec 2024
https://github.com/IntelPython/dpctl
Python SYCL bindings and SYCL-based Python Array API library
dppy gpu gpu-computing intel intel-xpu oneapi python sycl
Last synced: 05 Nov 2024
https://github.com/Heteroflow/Heteroflow
Concurrent CPU-GPU Programming using Task Models
cpu-gpu-scheduling cuda gpu gpu-acceleration gpu-computing gpu-programming heterogeneous-computing heterogeneous-parallel-programming heterogeneous-systems multithreaded multithreading task-parallelism
Last synced: 02 Nov 2024
https://github.com/houkensjtu/taichi-fluid
A collection of CFD related resources for Taichi developers.
cfd gpu-computing parallel-computing python taichi
Last synced: 03 Nov 2024
https://github.com/etaler/Etaler
A flexable HTM (Hierarchical Temporal Memory) framework with full GPU support.
gpu-computing hierarchical-temporal-memory machine-intelligence machine-learning opencl tensor
Last synced: 20 Nov 2024
https://github.com/slai-labs/get-beam
Run GPU inference and training jobs on serverless infrastructure that scales with you.
artificial-intelligence cloud-computing cost-optimization data-science deep-learning distributed-computing gpu-acceleration gpu-computing hpc llm-serving llm-training machine-learning ml-infrastructure mlops python serverless serverless-architectures
Last synced: 09 Nov 2024
https://github.com/larsgeb/m1-gpu-cpp
Metal Shading Language on Apple M1's GPU for scientific C++.
clang cpp cpp17 gpu-acceleration gpu-computing m1-mac metal metal-cpp objective-c scientific-computing
Last synced: 27 Oct 2024
https://github.com/f0nzie/rTorch.old
PyTorch bindings for R
deep-learning gpu-computing machine-learning-library neural-networks python r-package rstats
Last synced: 22 Nov 2024
https://github.com/gavinlyonsrepo/raspberrypi_tempmon
System monitoring program for Raspberry pi single board computers written in Python 3.
arm cpu cpu-monitoring cpu-temperature desktop-notifications gpu-computing graph-mode logfile logging python raspberry raspberry-pi raspberry-pi-3 raspberrypi raspbian rpi ssmtp stress temperature-monitoring tempertaure
Last synced: 17 Dec 2024
https://github.com/ashvardanian/parallelreductionsbenchmark
Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!
cuda glsl gpgpu gpu gpu-acceleration gpu-computing halide hpc intel nvidia opencl openmp parallel simd stl sycl tbb thrust vulkan
Last synced: 26 Oct 2024
https://paragroup.github.io/WindFlow/
A C++17 Data Stream Processing Parallel Library for Multicores and GPUs
cuda gpu gpu-acceleration gpu-computing gpu-programming multi-core multicore multithreading parallel-computing parallel-patterns parallel-programming parallelism sliding-windows stream stream-api stream-processing streaming streaming-api streaming-data streams
Last synced: 18 Nov 2024
https://github.com/xmartlabs/cuda-calculator
Online CUDA Occupancy Calculator
cuda gpgpu gpu gpu-computing gpu-kernels gpu-programming kernel nvidia occupancy
Last synced: 23 Oct 2024
https://github.com/node-3d/3d-core-raub
An extensible Node.js 3D core for desktop applications
3d gl glfw gpu gpu-computing graphics image javascript js native node-3d opengl pixijs simulation threejs vao vbo webgl window
Last synced: 12 Nov 2024
https://github.com/mil-tokyo/sushi2
Matrix Library for JavaScript
gpu-computing javascript matrix-library webcl
Last synced: 18 Nov 2024