Projects in Awesome Lists tagged with hpc
A curated list of projects in awesome lists tagged with hpc .
https://github.com/julialang/julia
The Julia Programming Language
hacktoberfest hpc julia julia-language julialang machine-learning numerical programming-language science scientific
Last synced: 09 Feb 2026
https://github.com/JuliaLang/julia
The Julia Programming Language
hacktoberfest hpc julia julia-language julialang machine-learning numerical programming-language science scientific
Last synced: 14 Mar 2025
https://github.com/hpcaitech/colossalai
Making large AI models cheaper, faster and more accessible
ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism
Last synced: 09 Sep 2025
https://github.com/hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
ai big-model data-parallelism deep-learning distributed-computing foundation-models heterogeneous-training hpc inference large-scale model-parallelism pipeline-parallelism
Last synced: 19 Mar 2025
https://github.com/volcano-sh/volcano
A Cloud Native Batch System (Project under CNCF)
ai batch-systems bigdata gene golang hpc kubernetes machine-learning serving training
Last synced: 31 Jan 2026
https://github.com/arrayfire/arrayfire
ArrayFire: a general purpose GPU library.
arrayfire c c-plus-plus cpp cuda gpgpu gpu hpc opencl performance scientific-computing
Last synced: 13 May 2025
https://github.com/spack/spack
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
build-tools hpc hpsf linux macos package-manager python radiuss scientific-computing spack windows
Last synced: 22 Feb 2026
https://github.com/projectphysx/fluidx3d
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
benchmark cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fluid-solver gpgpu gpu gpu-computing high-performance-computing hpc interactive-visualization lattice-boltzmann lbm opencl physics raytracing scientific-computing scientific-visualization simulation
Last synced: 13 May 2025
https://github.com/ProjectPhysX/FluidX3D
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via OpenCL. Free for non-commercial use.
benchmark cfd computational-fluid-dynamics fluid-dynamics fluid-simulation fluid-solver gpgpu gpu gpu-computing high-performance-computing hpc interactive-visualization lattice-boltzmann lbm opencl physics raytracing scientific-computing scientific-visualization simulation
Last synced: 26 Mar 2025
https://github.com/nextflow-io/nextflow
A DSL for data-driven computational pipelines
aws bioinformatics cloud dataflow docker groovy hello hpc nextflow pipeline pipeline-framework reproducible-research reproducible-science sge singularity singularity-containers slurm workflow-engine
Last synced: 13 May 2025
https://github.com/apptainer/singularity
Singularity has been renamed to Apptainer as part of us moving the project to the Linux Foundation. This repo has been persisted as a snapshot right before the changes.
cloud-native container containers hpc linux parallel portability portable reproducible reproducible-science rootless-containers science singularity singularity-container
Last synced: 15 Jan 2026
https://github.com/open-mpi/ompi
Open MPI main development repository
c fortran hacktoberfest hpc mpi openmpi
Last synced: 13 May 2025
https://github.com/flame/blis
BLAS-like Library Instantiation Software Framework
blas blas-libraries blis high-performance high-performance-computing hpc linear-algebra linear-algebra-library matrix matrix-calculations matrix-functions matrix-library matrix-multiplication optimization
Last synced: 25 Feb 2025
https://github.com/chapel-lang/chapel
a Productive Parallel Programming Language
chapel compiler concurrency distributed-computing gpu high-performance-computing hpc language open-source parallel parallel-computing performance productive programming-language scientific-computing
Last synced: 14 May 2025
https://github.com/mfem/mfem
Lightweight, general, scalable C++ library for finite element methods
amr computational-science fem finite-elements high-order high-performance-computing hpc math-physics parallel-computing radiuss scientific-computing
Last synced: 12 Dec 2025
https://github.com/ashvardanian/BenchmarkingTutorial
Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO
assembly assembly-language avx512 benchmark coroutines cpp cpp-programming cpp17 cpp20 cuda gcc google-benchmark hpc io-uring linux-kernel llvm ptx ranges tutorial tutorials
Last synced: 26 Jun 2025
https://github.com/boostorg/compute
A C++ GPU Computing Library for OpenCL
boost c-plus-plus compute cpp gpgpu gpu hpc opencl performance
Last synced: 17 Dec 2025
https://github.com/nvidia/cccl
CUDA Core Compute Libraries
accelerated-computing cpp cpp-programming cuda cuda-cpp cuda-kernels cuda-library cuda-programming gpu gpu-acceleration gpu-computing gpu-programming hpc modern-cpp nvidia nvidia-gpu parallel-algorithm parallel-computing parallel-programming
Last synced: 05 Feb 2026
https://github.com/adaptivecpp/adaptivecpp
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl
Last synced: 20 Apr 2026
https://boostorg.github.io/compute/
A C++ GPU Computing Library for OpenCL
boost c-plus-plus compute cpp gpgpu gpu hpc opencl performance
Last synced: 30 Apr 2025
https://github.com/AdaptiveCpp/AdaptiveCpp
Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!
adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl
Last synced: 21 Apr 2025
https://github.com/su2code/su2
SU2: An Open-Source Suite for Multiphysics Simulation and Design
c-plus-plus cfd flow fluid fluid-dynamics hpc opensource optimization physics python simulation
Last synced: 14 May 2025
https://indigo-dc.github.io/udocker/
A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.
batch chroot containers deep-hybrid-datacloud docker docker-containers emulation eosc-hub fakechroot grid hpc indigo proot root-privileges runc user
Last synced: 07 May 2025
https://github.com/indigo-dc/udocker
A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.
batch chroot containers deep-hybrid-datacloud docker docker-containers emulation eosc-hub fakechroot grid hpc indigo proot root-privileges runc user
Last synced: 10 Apr 2025
https://github.com/nvidia/matx
An efficient C++17 GPU numerical computing library with Python-like syntax
cuda gpgpu gpu gpu-computing hpc
Last synced: 04 Mar 2026
https://github.com/openucx/ucx
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
aries c c-plus-plus cray drivers gemini hacktoberfest hpc infiniband iwarp mpi networking openshmem pgas rdma roce shared-memory shmem tcp-ip verbs
Last synced: 24 May 2026
https://github.com/apptainer/apptainer
Apptainer: Application containers for Linux
apptainer containers hpc linux rootless-containers science singularity singularity-container
Last synced: 13 May 2025
https://github.com/NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
cuda gpgpu gpu gpu-computing hpc
Last synced: 26 Mar 2025
https://github.com/trilinos/trilinos
Primary repository for the Trilinos Project
c-plus-plus high-performance-computing hpc hpsf sandia-national-laboratories scientific-computing snl-science-libs trilinos
Last synced: 14 May 2025
https://github.com/NVIDIA/cccl
CUDA Core Compute Libraries
accelerated-computing cpp cpp-programming cuda cuda-cpp cuda-kernels cuda-library cuda-programming gpu gpu-acceleration gpu-computing gpu-programming hpc modern-cpp nvidia nvidia-gpu parallel-algorithm parallel-computing parallel-programming
Last synced: 14 May 2025
https://github.com/su2code/SU2
SU2: An Open-Source Suite for Multiphysics Simulation and Design
c-plus-plus cfd flow fluid fluid-dynamics hpc opensource optimization physics python simulation
Last synced: 14 Mar 2025
https://github.com/jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
aarch64 altivec avx avx2 cpp cpp-library hpc neon simd simd-library simd-parallelism simd-programming sse2 ssse3
Last synced: 02 Jul 2025
https://github.com/kubernetes-retired/kube-batch
A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC
bigdata hpc k8s-sig-scheduling kubernetes machine-learning
Last synced: 29 Sep 2025
https://github.com/liu-xiandong/how_to_optimize_in_gpu
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
elementwise gpu-acceleration high-performance-computing hpc reduce sgemm sgemv
Last synced: 03 Oct 2025
https://github.com/gunrock/gunrock
Programmable CUDA/C++ GPU Graph Analytics
algorithm algorithms cpp cuda cxx essentials gnn gpu graph graph-algorithms graph-analytics graph-engine graph-neural-networks graph-primitives graph-processing gunrock hpc parallel-computing sparse-matrix
Last synced: 18 Jan 2026
https://github.com/futureverse/future
:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone
asynchronous cran distributed-computing futures hpc hpc-clusters parallel-computing parallel-processing parallelization programming promises r
Last synced: 12 Dec 2025
https://github.com/raftlib/raftlib
The RaftLib C++ library, streaming/dataflow concurrency via C++ iostream-like operators
c-plus-plus cmake dataflow dataflow-programming dataflow-structure dataflows dsl hpc ipc machine opencv parallel pthreads qthread-library qthreads raftlib runtime streaming thread thread-library
Last synced: 16 May 2025
https://github.com/broadinstitute/cromwell
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
application bioinformatics cloud containers docker executor ga4gh hpc scala wdl workflow workflow-description-language workflow-execution
Last synced: 27 Mar 2025
https://github.com/RaftLib/RaftLib
The RaftLib C++ library, streaming/dataflow concurrency via C++ iostream-like operators
c-plus-plus cmake dataflow dataflow-programming dataflow-structure dataflows dsl hpc ipc machine opencv parallel pthreads qthread-library qthreads raftlib runtime streaming thread thread-library
Last synced: 15 Mar 2025
https://github.com/sylabs/singularity
SingularityCE is the Community Edition of Singularity, an open source container platform designed to be simple, fast, and secure.
Last synced: 14 May 2025
https://github.com/envmodules/modules
Environment Modules: provides dynamic modification of a user's environment
environment environment-modules hpc module modulefiles shell tcl
Last synced: 11 Mar 2026
https://github.com/agnostiqhq/covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
covalent data-pipeline data-science deep-learning hacktoberfest hpc hpc-applications machine-learning machinelearning machinelearning-python orchestration parallelization pipelines python quantum quantum-computing quantum-machine-learning workflow workflow-automation workflow-management
Last synced: 14 May 2025
https://github.com/arrayfire/arrayfire-rust
Rust wrapper for ArrayFire
arrayfire cuda gpgpu gpu hpc opencl rust rust-bindings
Last synced: 15 May 2025
https://github.com/Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
elementwise gpu-acceleration high-performance-computing hpc reduce sgemm sgemv
Last synced: 14 May 2025
https://github.com/hypre-space/hypre
Parallel solvers for sparse linear systems featuring multigrid methods.
hpc library math-physics radiuss
Last synced: 21 Oct 2025
https://github.com/chrisvoncsefalvay/learn-julia-the-hard-way
Learn Julia the hard way!
data-science hpc julia julia-language julialang language learning learning-by-doing learning-julia scientific-computing statistics technical-computing
Last synced: 16 May 2025
https://github.com/AgnostiqHQ/covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
covalent data-pipeline data-science deep-learning hacktoberfest hpc hpc-applications machine-learning machinelearning machinelearning-python orchestration parallelization pipelines python quantum quantum-computing quantum-machine-learning workflow workflow-automation workflow-management
Last synced: 30 Mar 2025
https://cea-hpc.github.io/modules
Environment Modules: provides dynamic modification of a user's environment
environment environment-modules hpc module modulefiles shell tcl
Last synced: 14 Mar 2025
https://github.com/romeric/Fastor
A lightweight high performance tensor algebra framework for modern C++
fpga hpc multidimensional-arrays simd small-blas tensor-contraction tensors
Last synced: 27 Apr 2025
https://github.com/sslotin/amh-code
Complete implementations from "Algorithms for Modern Hardware"
algorithms computer-science hpc performance
Last synced: 04 Apr 2025
https://github.com/uxlfoundation/onemath
oneAPI Math Library (oneMath)
api blas cpu cuda dpcpp gpu hpc intel math-libraries oneapi onemkl parallel-computing parallel-programming performance rng
Last synced: 15 May 2025
https://github.com/warewulf/warewulf
Warewulf is a stateless and diskless container operating system provisioning system for large clusters of bare metal and/or virtual systems.
clusters containers hpc provisioning stateless warewulf
Last synced: 05 May 2026
https://github.com/zanellia/prometeo
An experimental Python-to-C transpiler and domain specific language for embedded high-performance computing
c compiler domain-specific-language embedded-systems high-performance-computing hpc python python-to-c source-to-source static-analysis static-typing transcompiler transpiler
Last synced: 16 May 2025
https://github.com/nndeploy/nndeploy
nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础,致力为用户提供跨平台、简单易用、高性能的模型部署体验。
ascend easy-to-use hpc mnn model-deployment multi-inference openvino out-of-box-model parallel rknn tensorrt yolo
Last synced: 14 Dec 2025
https://github.com/llnl/sundials
Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.
dae-solver high-performance-computing hpc math-physics nonlinear-equation-solver ode-solver parallel-computing radiuss scientific-computing sensitivity-analysis solver time-integration
Last synced: 15 May 2025
https://github.com/tacc/lmod
Lmod: An Environment Module System based on Lua, Reads TCL Modules, Supports a Software Hierarchy
environment-modules hpc lmod lua tacc xsede
Last synced: 12 Feb 2026
https://github.com/lablup/backend.ai
Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.
api backendai cloud-computing containers distributed-computing docker documentation hpc monitoring paas python
Last synced: 02 Apr 2026
https://github.com/openhackathons-org/gpubootcamp
This repository consists for gpu bootcamp material for HPC and AI
ai4hpc cuda data-science deep-learning deepstream gpu hpc machine-learning mpi openacc openmp rapidsai
Last synced: 27 Mar 2025
https://github.com/easybuilders/easybuild
EasyBuild - building software with ease
hacktoberfest hpc linux python scientific-software
Last synced: 17 Mar 2026
https://github.com/visit-dav/visit
VisIt - Visualization and Data Analysis for Mesh-based Scientific Data
data-analysis data-viz hpc python radiuss scientific-computing scientific-visualization visualization
Last synced: 14 Jan 2026
https://github.com/ashvardanian/less_slow.cpp
Learning how to write "Less Slow" code in C++ 20, C 99, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO
assembly assembly-language avx512 benchmark coroutines cpp cpp-programming cpp17 cpp20 cuda gcc google-benchmark hpc io-uring linux-kernel llvm ptx ranges tutorial tutorials
Last synced: 08 Apr 2025
https://github.com/TACC/Lmod
Lmod: An Environment Module System based on Lua, Reads TCL Modules, Supports a Software Hierarchy
environment-modules hpc lmod lua tacc xsede
Last synced: 17 Jul 2025
https://github.com/haptork/easyLambda
distributed dataflows with functional list operations for data processing with C++14
cpp14 dataflow-programming distributed-computing functional-programming hpc mpi parallel
Last synced: 15 Mar 2025
https://github.com/haptork/easylambda
distributed dataflows with functional list operations for data processing with C++14
cpp14 dataflow-programming distributed-computing functional-programming hpc mpi parallel
Last synced: 06 Apr 2025
https://github.com/nvidia/hpc-container-maker
HPC Container Maker
containers docker hpc singularity
Last synced: 14 May 2025
https://github.com/ginkgo-project/ginkgo
Numerical linear algebra software package
cuda dpcpp gpu-computing hip hpc krylov-methods linear-algebra oneapi openmp preconditioning sparse-linear-systems spmv
Last synced: 15 May 2025
https://github.com/NVIDIA/hpc-container-maker
HPC Container Maker
containers docker hpc singularity
Last synced: 14 Mar 2025
https://github.com/luispedro/jug
Parallel programming with Python
hpc parallel-computing python python-2 python-3 workflow workflow-engine
Last synced: 14 May 2025
https://github.com/oracle/coherence
Oracle Coherence Community Edition
caching cloud clustering coherence data-grid distributed hpc imdg in-memory java kv-store microservices polyglot scalability
Last synced: 24 Dec 2025
https://github.com/arrayfire/arrayfire-python
Python bindings for ArrayFire: A general purpose GPU library.
arrayfire cuda gpgpu gpu hpc opencl python python-bindings
Last synced: 02 Apr 2025
https://github.com/blitzpp/blitz
Blitz++ Multi-Dimensional Array Library for C++
array array-manipulations blitz cpp-library high-performance hpc multi-dimensional-array numerical-calculations numerical-computation numerics partial-evaluators scientific-computing template-metaprogramming tensor vector
Last synced: 21 Oct 2025
https://github.com/it4innovations/hyperqueue
Scheduler for sub-node tasks for HPC systems with batch scheduling
distributed-computing hpc rust task-graph
Last synced: 15 May 2025
https://github.com/ParRes/Kernels
This is a set of simple programs that can be used to explore the features of a parallel platform.
c c-plus-plus coarray-fortran fortran2008 hpc julia kokkos mpi openacc opencl openmp parallel parallel-programming pgas python3 shmem sycl threading
Last synced: 01 Apr 2025
https://github.com/easybuilders/easybuild-easyconfigs
A collection of easyconfig files that describe which software to build using which build options with EasyBuild.
hpc linux python scientific-software
Last synced: 13 May 2025
https://github.com/gem/oq-engine
OpenQuake Engine: a software for Seismic Hazard and Risk Analysis
cluster earthquakes hazard hazard-assessment hpc openquake openquake-engine psha python risk risk-analysis risk-assessment scientific-computing seismic
Last synced: 10 Mar 2026
https://github.com/llnl/Umpire
An application-focused API for memory management on NUMA & GPU architectures
blt cpp gpu hpc memory-management portability radiuss
Last synced: 31 Mar 2026
https://github.com/It4innovations/hyperqueue
Scheduler for sub-node tasks for HPC systems with batch scheduling
distributed-computing hpc rust task-graph
Last synced: 01 Apr 2025
https://github.com/juliaparallel/mpi.jl
MPI wrappers for Julia
hpc julia julia-language microsoft-mpi mpi mpich openmpi
Last synced: 14 May 2025
https://github.com/alpaka-group/alpaka
Abstraction Library for Parallel Kernel Acceleration :llama:
cpp cpp17 cuda gpu header-only heterogeneous-parallel-programming hip hpc openacc openmp rocm tbb
Last synced: 15 May 2025
https://github.com/Nek5000/Nek5000
our classic
anl cfd flow fluid high-order hpc navier-stokes sem spectral
Last synced: 14 Mar 2025
https://github.com/llnl/caliper
Caliper is an instrumentation and performance profiling library
annotation-apis caliper cpp hpc instrumentation performance performance-analysis performance-monitoring radiuss trace
Last synced: 14 May 2025
https://github.com/nek5000/nekrs
our next generation fast and scalable CFD code
cfd exascale gpu high-order hpc turbulence
Last synced: 19 Feb 2026
https://github.com/llnl/umpire
An application-focused API for memory management on NUMA & GPU architectures
blt cpp gpu hpc memory-management portability radiuss
Last synced: 15 May 2025
https://github.com/giovtorres/slurm-docker-cluster
A Slurm cluster using docker-compose
docker-compose hpc slurm slurm-cluster
Last synced: 24 Feb 2026
https://github.com/pipefunc/pipefunc
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
dag hpc parallel-computing pipeline-framework pipelines reproducible-research slurm workflow-engine
Last synced: 16 Dec 2025
https://github.com/LLNL/Umpire
An application-focused API for memory management on NUMA & GPU architectures
blt cpp gpu hpc memory-management portability radiuss
Last synced: 11 May 2025
https://github.com/uob-hpc/babelstream
STREAM, for lots of devices written in many programming models
benchmark cuda gpgpu gpu hpc kokkos memory-bandwidth openacc opencl openmp parallel-processing raja sycl
Last synced: 21 Oct 2025
https://github.com/LLNL/Caliper
Caliper is an instrumentation and performance profiling library
annotation-apis caliper cpp hpc instrumentation performance performance-analysis performance-monitoring radiuss trace
Last synced: 08 May 2025
https://github.com/cnuernber/dtype-next
A Clojure library designed to aid in the implementation of high performance algorithms and systems.
Last synced: 16 May 2025