Projects in Awesome Lists tagged with high-performance-computing
A curated list of projects in awesome lists tagged with high-performance-computing .
https://github.com/taskflow/taskflow
A General-purpose Task-parallel Programming System using Modern C++
concurrent-programming cuda-programming gpu-programming heterogeneous-parallel-programming high-performance-computing multi-threading multicore-programming multithreading parallel parallel-computing parallel-programming taskflow taskparallelism threadpool work-stealing
Last synced: 14 May 2025
https://github.com/netflix/metaflow
Build, Manage and Deploy AI/ML Systems
agents ai aws azure data-science datascience gcp generative-ai high-performance-computing kubernetes llm llmops machine-learning ml ml-infrastructure ml-platform mlops model-management python
Last synced: 09 Sep 2025
https://github.com/Netflix/metaflow
:rocket: Build and manage real-life ML, AI, and data science projects with ease!
ai aws azure data-science datascience gcp high-performance-computing kubernetes machine-learning ml ml-infrastructure ml-platform mlops model-management productivity python r r-package reproducible-research rstats
Last synced: 13 Mar 2025
https://github.com/google/tf-quant-finance
High-performance TensorFlow library for quantitative finance.
finance gpu gpu-computing high-performance high-performance-computing numerical-integration numerical-methods numerical-optimization python quantitative-finance quantlib tensorflow
Last synced: 14 May 2025
https://github.com/projectphysx/fluidx3d
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
benchmark cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fluid-solver gpgpu gpu gpu-computing high-performance-computing hpc interactive-visualization lattice-boltzmann lbm opencl physics raytracing scientific-computing scientific-visualization simulation
Last synced: 13 May 2025
https://github.com/ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
benchmark cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fluid-solver gpgpu gpu gpu-computing high-performance-computing hpc interactive-visualization lattice-boltzmann lbm opencl physics raytracing scientific-computing scientific-visualization simulation
Last synced: 26 Mar 2025
https://github.com/parallel101/course
高性能并行编程与优化 - 课件
course cpp cpp17 high-performance-computing parallel-computing slides
Last synced: 29 Apr 2025
https://github.com/alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
alpa auto-parallelization compiler deep-learning distributed-computing distributed-training high-performance-computing jax llm machine-learning
Last synced: 22 Feb 2025
https://github.com/bshoshany/thread-pool
BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library
concurrency cplusplus cplusplus-17 cplusplus-20 cplusplus-23 cpp cpp17 cpp20 cpp20-modules cpp23 easy-to-use high-performance high-performance-computing multithreading parallel scientific-computing thread-pool threading threadpool
Last synced: 14 May 2025
https://github.com/flame/blis
BLAS-like Library Instantiation Software Framework
blas blas-libraries blis high-performance high-performance-computing hpc linear-algebra linear-algebra-library matrix matrix-calculations matrix-functions matrix-library matrix-multiplication optimization
Last synced: 25 Feb 2025
https://github.com/kokkos/kokkos
Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction
abstraction c-plus-plus high-performance-computing hpsf kokkos parallel-computing programming-model
Last synced: 13 May 2025
https://github.com/boinc/boinc
Open-source software for volunteer computing and grid computing.
android boinc c-plus-plus citizen-science distributed-computing grid-computing hacktoberfest high-performance-computing high-throughput-computing java kotlin php science scientific-computing volunteer-computing
Last synced: 14 May 2025
https://github.com/BOINC/boinc
Open-source software for volunteer computing and grid computing.
android boinc c-plus-plus citizen-science distributed-computing grid-computing hacktoberfest high-performance-computing high-throughput-computing java kotlin php science scientific-computing volunteer-computing
Last synced: 16 Apr 2025
https://github.com/chapel-lang/chapel
a Productive Parallel Programming Language
chapel compiler concurrency distributed-computing gpu high-performance-computing hpc language open-source parallel parallel-computing performance productive programming-language scientific-computing
Last synced: 14 May 2025
https://github.com/mfem/mfem
Lightweight, general, scalable C++ library for finite element methods
amr computational-science fem finite-elements high-order high-performance-computing hpc math-physics parallel-computing radiuss scientific-computing
Last synced: 12 Dec 2025
https://github.com/hermit-os/hermit-rs
Hermit for Rust.
cloud-computing high-performance-computing operating-system operating-systems osdev rust rust-lang unikernel virtualization
Last synced: 16 May 2025
https://github.com/maratyszcza/nnpack
Acceleration package for neural networks on multi-core CPUs
convolutional-layers cpu fast-fourier-transform high-performance high-performance-computing inference matrix-multiplication multithreading neural-network neural-networks simd winograd-transform
Last synced: 15 May 2025
https://github.com/Maratyszcza/NNPACK
Acceleration package for neural networks on multi-core CPUs
convolutional-layers cpu fast-fourier-transform high-performance high-performance-computing inference matrix-multiplication multithreading neural-network neural-networks simd winograd-transform
Last synced: 18 Mar 2025
https://github.com/adaptivecpp/adaptivecpp
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl
Last synced: 11 Dec 2025
https://github.com/AdaptiveCpp/AdaptiveCpp
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl
Last synced: 21 Apr 2025
https://github.com/mratsim/arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor
Last synced: 14 May 2025
https://github.com/ropensci/drake
An R-focused pipeline toolkit for reproducibility and high-performance computing
data-science drake high-performance-computing makefile peer-reviewed pipeline r r-package reproducibility reproducible-research ropensci rstats workflow
Last synced: 13 May 2025
https://github.com/trilinos/trilinos
Primary repository for the Trilinos Project
c-plus-plus high-performance-computing hpc hpsf sandia-national-laboratories scientific-computing snl-science-libs trilinos
Last synced: 14 May 2025
https://github.com/hermit-os/kernel
A Rust-based, lightweight unikernel.
cloud-computing high-performance-computing kernel operating-system operating-systems osdev rust rust-lang unikernels virtualization
Last synced: 14 May 2025
https://mratsim.github.io/Arraymancer/
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor
Last synced: 08 May 2025
https://github.com/mratsim/Arraymancer
A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends
autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor
Last synced: 16 Apr 2025
https://github.com/sail-sg/envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
atari-games box2d cpp17 dm-control dm-env gym high-performance-computing lock-free-queue mujoco parallel-processing pybind11 reinforcement-learning reinforcement-learning-environments robotics threadpool vizdoom
Last synced: 15 May 2025
https://github.com/uncomplicate/neanderthal
Fast Clojure Matrix Library
api clojure clojure-library cuda gpgpu gpu gpu-computing high-performance-computing java matrix matrix-calculations matrix-factorization matrix-functions matrix-multiplication opencl vectorization
Last synced: 14 May 2025
https://github.com/liu-xiandong/how_to_optimize_in_gpu
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
elementwise gpu-acceleration high-performance-computing hpc reduce sgemm sgemv
Last synced: 03 Oct 2025
https://github.com/ropensci/targets
Function-oriented Make-like declarative workflows for R
data-science high-performance-computing make peer-reviewed pipeline r r-package r-targetopia reproducibility reproducible-research rstats targets workflow
Last synced: 13 May 2025
https://github.com/mateogianolio/vectorious
Linear algebra in TypeScript.
blas high-performance-computing javascript linear-algebra linear-algebra-library machine-learning matrix typescript vector
Last synced: 15 May 2025
https://github.com/openmc-dev/openmc
OpenMC Monte Carlo Code
computational-physics high-performance-computing monte-carlo-simulation neutron-transport neutronics nuclear-data nuclear-energy nuclear-engineering nuclear-fusion openmc particle-transport photon-transport radiation-transport
Last synced: 18 Dec 2025
https://github.com/Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
elementwise gpu-acceleration high-performance-computing hpc reduce sgemm sgemv
Last synced: 14 May 2025
https://github.com/precice/precice
A coupling library for partitioned multi-physics simulations, including, but not restricted to fluid-structure interaction and conjugate heat transfer simulations.
calculix co-simulation code-aster computer-aided-engineering conjugate-heat-transfer coupling cpp dealii fenics fluent fluid-structure-interaction high-performance-computing multi-physics multiphysics openfoam precice research-and-development simulation su2
Last synced: 14 May 2025
https://github.com/MarioSieg/magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
artificial-intelligence cpp cuda high-performance-computing machine-learning neuronal-network python pytorch research-project tensorflow tiny
Last synced: 15 Sep 2025
https://github.com/zanellia/prometeo
An experimental Python-to-C transpiler and domain specific language for embedded high-performance computing
c compiler domain-specific-language embedded-systems high-performance-computing hpc python python-to-c source-to-source static-analysis static-typing transcompiler transpiler
Last synced: 16 May 2025
https://github.com/austinksmith/hamsters.js
100% Vanilla Javascript Multithreading & Parallel Execution Library
concurrent-programming future-proofing high-performance-computing multithreaded multithreading nodejs-server parallel-processing performance react-native-app task-processor task-runner threadpool throughput throughput-performance web-application webworkers worker worker-pool worker-threads
Last synced: 14 May 2025
https://github.com/Geant4/geant4
Geant4 toolkit for the simulation of the passage of particles through matter - NIM A 506 (2003) 250-303
computational-physics geometry hadron-physics high-energy-physics high-performance-computing medical-physics monte-carlo-simulation multiphysics-simulation neutron-transport nuclear-data particle-tracking particle-transport photon-transport radiation-transport scientific-computing shielding visualization
Last synced: 09 Jul 2025
https://github.com/austinksmith/Hamsters.js
100% Vanilla Javascript Multithreading & Parallel Execution Library
concurrent-programming future-proofing high-performance-computing multithreaded multithreading nodejs-server parallel-processing performance react-native-app task-processor task-runner threadpool throughput throughput-performance web-application webworkers worker worker-pool worker-threads
Last synced: 06 Apr 2025
https://github.com/llnl/sundials
Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.
dae-solver high-performance-computing hpc math-physics nonlinear-equation-solver ode-solver parallel-computing radiuss scientific-computing sensitivity-analysis solver time-integration
Last synced: 15 May 2025
https://github.com/brucefan1983/GPUMD
Graphics Processing Units Molecular Dynamics
cuda gpu gpumd heat-transport high-performance-computing machine-learning machine-learning-potential molecular-dynamics molecular-dynamics-simulation natural-evolution-strategies neural-network neuroevolution phonon physics-simulation simulation
Last synced: 04 May 2025
https://github.com/mariosieg/magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
artificial-intelligence cpp cuda high-performance-computing machine-learning neuronal-network python pytorch research-project tensorflow tiny
Last synced: 08 Apr 2025
https://github.com/spcl/dace
DaCe - Data Centric Parallel Programming
cuda fpga high-level-synthesis high-performance-computing programming-language vivado-hls
Last synced: 14 May 2025
https://github.com/developerpaul123/thread-pool
A modern, fast, lightweight thread pool library based on C++20
c-plus-plus concurrency cplusplus cpp cpp20 cpp20-library fast header-only high-performance high-performance-computing modern-cpp performance thread thread-pool thread-pool-implementations threading threadpool threads
Last synced: 15 May 2025
https://github.com/pypr/pysph
A framework for Smoothed Particle Hydrodynamics in Python
cython fluid-simulation framework gas-dynamics high-performance-computing message-passing-interface opencl python-library scientific-computing smoothed-particle-hydrodynamics solid-mechanics
Last synced: 08 Apr 2025
https://github.com/DeveloperPaul123/thread-pool
A modern, fast, lightweight thread pool library based on C++20
c-plus-plus concurrency cplusplus cpp cpp20 cpp20-library fast header-only high-performance high-performance-computing modern-cpp performance thread thread-pool thread-pool-implementations threading threadpool threads
Last synced: 08 May 2025
https://github.com/neuronsimulator/nrn
NEURON Simulator
high-performance-computing neuron neuroscience simulation
Last synced: 14 May 2025
https://github.com/cselab/aphros
Finite volume solver for incompressible multiphase flows with surface tension. Foaming flows in complex geometries.
cfd chemical-engineering fluid high-performance-computing multiphase-flow paraview simulation surface-tension
Last synced: 14 Mar 2025
https://github.com/mpi4jax/mpi4jax
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python :zap:
gpu high-performance-computing jax jit mpi parallel-computing xla
Last synced: 21 Oct 2025
https://github.com/uncomplicate/bayadera
High-performance Bayesian Data Analysis on the GPU in Clojure
bayesian bayesian-data-analysis bayesian-inference clojure clojure-library cuda gpu gpu-acceleration gpu-computing high-performance-computing machine-learning markov-chain-monte-carlo mcmc opencl statistics
Last synced: 09 Apr 2025
https://github.com/GraphIt-DSL/graphit
GraphIt - A High-Performance Domain Specific Language for Graph Analytics
code-generation compiler domain-specific-language graph-analytics graph-computing high-performance-computing m machine-learning parallel-computing
Last synced: 04 May 2025
https://github.com/sciml/surrogates.jl
Surrogate modeling and optimization for scientific machine learning (SciML)
automatic-differentiation differential-equations high-performance-computing julia optimization scientific-machine-learning sciml surrogate surrogate-based-optimization surrogate-models surrogates
Last synced: 15 May 2025
https://github.com/philipturner/metal-flash-attention
FlashAttention (Metal Port)
artificial-intelligence attention-mechanism high-performance-computing metal software-engineering stable-diffusion transformer-models
Last synced: 25 Mar 2025
https://github.com/huggingface/datablations
Scaling Data-Constrained Language Models
gpt high-performance-computing language-models large-language-models llms scaling-laws
Last synced: 14 Oct 2025
https://github.com/SciML/Surrogates.jl
Surrogate modeling and optimization for scientific machine learning (SciML)
automatic-differentiation differential-equations high-performance-computing julia optimization scientific-machine-learning sciml surrogate surrogate-based-optimization surrogate-models surrogates
Last synced: 04 May 2025
https://github.com/mrshaw01/software-engineer
A curated learning repository focused on High-Performance Computing (HPC) — covering fundamentals to advanced topics in CUDA, MPI, C++, and Python-C++ interoperability.
cpp cuda high-performance-computing hip python
Last synced: 16 Jul 2025
https://github.com/dionhaefner/pyhpc-benchmarks
A suite of benchmarks for CPU and GPU performance of the most popular high-performance libraries for Python :rocket:
benchmarks cupy gpu high-performance-computing jax parallel-computing python pytorch tensorflow
Last synced: 12 Apr 2025
https://github.com/QMCPACK/qmcpack
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support
c-plus-plus cuda electronic-structure gpu high-performance-computing hpc mpi oneapi quantum-chemistry quantum-monte-carlo rocm
Last synced: 26 Mar 2025
https://github.com/ornladios/adios2
Next generation of ADIOS developed in the Exascale Computing Program
adios cmake ecp exascale exascale-computing hdf5 high-performance-computing hpc io
Last synced: 21 Oct 2025
https://github.com/curvineio/curvine
High performance distributed cache system. Built by Rust.
ai ai-infra bigdata cache-storage cloud-native hdfs high-performance-computing io rust s3 shuffle spark train-acceleration
Last synced: 11 Aug 2025
https://github.com/zero-one-group/geni
A Clojure dataframe library that runs on Spark
big-data clojure clojure-library clojure-repl data-engineering data-science dataframe distributed-computing high-performance-computing machine-learning parallel-computing spark
Last synced: 04 Apr 2025
https://github.com/Xiangyu-Hu/SPHinXsys
SPHinXsys provides C++ APIs for engineering simulation and optimization. It aims at complex systems driven by fluid, structure, multi-body dynamics and beyond. The multi-physics library is based on a unique and unified computational framework by which strong coupling has been achieved for all involved physics.
computer-aided-engineering cpp finite-volume-method fluid-dynamics fluid-structure-interaction gpu high-performance-computing multi-physics multi-platforms multiphysics-coupling research-and-development smoothed-particle-hydrodynamics solid-dynamics sycl
Last synced: 04 Apr 2025
https://github.com/mratsim/laser
The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers
assembler blas compiler-optimization convolution deep-learning gemm high-performance-computing jit matrix-multiplication openmp parallel runtime-cpu-detection simd tensor
Last synced: 08 Apr 2025
https://github.com/uncomplicate/clojurecl
ClojureCL is a Clojure library for parallel computations with OpenCL.
amd-opencl clojure clojure-library gpu-computing high-performance high-performance-computing intel nvidia opencl parallel-computations
Last synced: 13 Apr 2025
https://github.com/Trinkle23897/Fast-Poisson-Image-Editing
A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.
cpp cuda high-performance-computing image-processing jacobi-iteration jacobi-method mpi numpy openmp parallel-computing poisson-image-editing pybind11 python
Last synced: 02 Apr 2025
https://github.com/trinkle23897/fast-poisson-image-editing
A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.
cpp cuda high-performance-computing image-processing jacobi-iteration jacobi-method mpi numpy openmp parallel-computing poisson-image-editing pybind11 python
Last synced: 05 Apr 2025
https://github.com/sciml/nonlinearsolve.jl
High-performance and differentiation-enabled nonlinear solvers (Newton methods), bracketed rootfinding (bisection, Falsi), with sparsity and Newton-Krylov support.
bracketing deep-equilibrium-models differential-equations equilibrium factorization high-performance-computing julia newton-krylov newton-method newton-raphson nonlinear-equations scientific-machine-learning sciml sparse-matrices sparse-matrix steady-state
Last synced: 14 May 2025
https://github.com/SciML/NonlinearSolve.jl
High-performance and differentiation-enabled nonlinear solvers (Newton methods), bracketed rootfinding (bisection, Falsi), with sparsity and Newton-Krylov support.
bracketing deep-equilibrium-models differential-equations equilibrium factorization high-performance-computing julia newton-krylov newton-method newton-raphson nonlinear-equations scientific-machine-learning sciml sparse-matrices sparse-matrix steady-state
Last synced: 04 May 2025
https://github.com/df308/x9
high performance message passing library
high-frequency-trading high-performance-computing low-latency ultra-low-latency
Last synced: 09 Aug 2025
https://github.com/flame/libflame
High-performance object-based library for DLA computations
flame high-performance high-performance-computing lapack linear-algebra linear-algebra-library matrix-computations matrix-functions matrix-library
Last synced: 01 Aug 2025
https://github.com/hongbo-miao/hongbomiao.com
A personal research and development (R&D) lab that facilitates the sharing of knowledge.
aerospace cloud-native computational-fluid-dynamics computer-vision continuous-machine-learning distributed-tracing embedded graphql high-performance-computing infrastructure-as-code kubernetes llm matlab mlops national-instruments neural-network robot-operating-system rust service-mesh veristand
Last synced: 15 May 2025
https://github.com/iqusoft/intel-qs
High-performance simulator of quantum circuits
cloud-computing high-performance-computing intel-quantum-simulator quantum-circuits quantum-computing
Last synced: 16 Apr 2025
https://github.com/ECP-copa/Cabana
Performance-portable library for particle-based simulations
co-design exascale exascale-computing high-performance-computing hpc kokkos particles
Last synced: 28 Mar 2025
https://github.com/ceed/libceed
CEED Library: Code for Efficient Extensible Discretizations
api ceed cuda ecp exascale-computing gpu high-order high-performance-computing hpc julia linear-algebra
Last synced: 15 May 2025
https://github.com/CEED/libCEED
CEED Library: Code for Efficient Extensible Discretizations
api ceed cuda ecp exascale-computing gpu high-order high-performance-computing hpc julia linear-algebra
Last synced: 07 May 2025
https://github.com/hermit-os/libhermit
HermitCore: A C-based, lightweight unikernel
cloud-computing high-performance-computing kernel multi-kernel operating-system osdev unikernel virtualization
Last synced: 30 Mar 2025
https://github.com/r-lib/mirai
mirai - Minimalist Async Evaluation Framework for R
async asynchronous-tasks concurrency distributed-computing high-performance-computing parallel-computing r
Last synced: 05 Apr 2025
https://github.com/intel/intel-qs
High-performance simulator of quantum circuits
cloud-computing high-performance-computing intel-quantum-simulator quantum-circuits quantum-computing
Last synced: 02 Apr 2025
https://github.com/esa/torchquad
Numerical integration in arbitrary dimensions on the GPU using PyTorch / TF / JAX
automatic-differentiation gpu high-performance-computing integration machine-learning monte-carlo-integration multidimensional-integration numerical-integration python pytorch torchquad vegas vegas-enhanced
Last synced: 15 May 2025
https://github.com/dlr-amr/t8code
Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.
adaptive-mesh-refinement high-performance-computing hpc mesh modeling mpi parallel parallel-computing simulation
Last synced: 16 May 2025
https://github.com/projectphysx/opencl-benchmark
A small OpenCL benchmark program to measure peak GPU/CPU performance.
bandwidth benchmark benchmarking flops gpgpu gpu gpu-computing high-performance-computing hpc opencl tool tools
Last synced: 04 Apr 2025
https://github.com/tikv/minstant
Performant time measuring in Rust
high-performance high-performance-computing timing tsc
Last synced: 12 Apr 2025
https://github.com/springer13/hptt
High-Performance Tensor Transpose library
high-performance-computing multidimensional-arrays tensor tensor-transposition tensors transposition
Last synced: 09 Jul 2025
https://github.com/CaNS-World/CaNS
A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fortran gpu gpu-computing high-performance-computing turbulence
Last synced: 14 Mar 2025
https://github.com/p-costa/CaNS
A code for fast, massively-parallel direct numerical simulations (DNS) of canonical flows
cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fortran gpu gpu-computing high-performance-computing turbulence
Last synced: 22 Feb 2025
https://github.com/hao-lh/the-books-making-you-better
A list of time-lasting classic books, which not only help you figure out how it works, but also grasp when it works and why it works in that way.
bayesian-inference computer-architecture computer-vision deep-learning high-performance-computing linear-algebra machine-learning probabilistic-graphical-models reinforcement-learning statistical-learning
Last synced: 15 Apr 2025
https://github.com/librapid/librapid
A highly optimised C++ library for mathematical applications and neural networks.
array cpp cpp20 cpp23 cuda gpu high-performance-computing library matrix multidimensional-arrays multithreading parallel-programming pypy pypy3 python python3 simd
Last synced: 08 Oct 2025
https://github.com/lanl/vpic
Vector Particle-In-Cell (VPIC) Project
high-performance high-performance-computing hpc hpc-applications particle-in-cell
Last synced: 05 Oct 2025
https://github.com/DLR-AMR/t8code
Parallel algorithms and data structures for tree-based adaptive mesh refinement (AMR) with arbitrary element shapes.
adaptive-mesh-refinement high-performance-computing hpc mesh modeling mpi parallel parallel-computing simulation
Last synced: 09 Sep 2025
https://github.com/LibRapid/librapid
A highly optimised C++ library for mathematical applications and neural networks.
array cpp cpp20 cpp23 cuda gpu high-performance-computing library matrix multidimensional-arrays multithreading parallel-programming pypy pypy3 python python3 simd
Last synced: 01 Aug 2025
https://github.com/mschubert/clustermq
R package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
cluster high-performance-computing lsf r-package sge slurm ssh
Last synced: 15 May 2025
https://github.com/arborx/arborx
Performance-portable geometric search library
bounding-volume-hierarchy c-plus-plus clustering cpp cuda dbscan distributed gpu hdbscan high-performance-computing hpc knn-search kokkos mpi nearest-neighbors parallel
Last synced: 10 Apr 2025
https://github.com/ropensci/tarchetypes
Archetypes for targets and pipelines
data-science high-performance-computing peer-reviewed pipeline r r-package r-targetopia reproducibility rstats targets workflow
Last synced: 16 May 2025
https://github.com/kahypar/mt-kahypar
Mt-KaHyPar (Multi-Threaded Karlsruhe Hypergraph Partitioner) is a shared-memory multilevel graph and hypergraph partitioner equipped with parallel implementations of techniques used in the best sequential partitioning algorithms. Mt-KaHyPar can partition extremely large hypergraphs very fast and with high quality.
algorithm-engineering graph-algorithms graph-partitioning graphs high-performance-computing hypergraph hypergraph-partitioning hypergraphs parallel-computing partitioning partitioning-algorithms shared-memory tbb
Last synced: 04 Apr 2025
https://github.com/shikokuchuo/mirai
mirai - Minimalist Async Evaluation Framework for R
asynchronous-tasks concurrency cran distributed-computing high-performance-computing parallel-programming promises r r-package rstats
Last synced: 29 Mar 2025
https://github.com/parthenon-hpc-lab/parthenon
Parthenon AMR infrastructure
amr high-performance-computing kokkos parthenon
Last synced: 21 Oct 2025
https://github.com/pranabdas/espresso
Notes and tutorials on Density Functional Theory calculation using Quantum ESPRESSO.
density-functional-theory dft first-principles-calculations high-performance-computing hpc materials-modelling quantum-espresso tutorial wannier
Last synced: 07 May 2025
https://github.com/carstenbauer/threadpinning.jl
Readily pin Julia threads to CPU-threads
high-performance-computing julia multithreading thread-affinities
Last synced: 07 Sep 2025
https://github.com/aronszanto/sLSM-Tree
High-Performance C++ Data System
big-data data-system high-performance high-performance-computing lsm-tree multithreading skiplist
Last synced: 16 May 2025