An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with hpc

A curated list of projects in awesome lists tagged with hpc .

https://github.com/volcano-sh/volcano

A Cloud Native Batch System (Project under CNCF)

ai batch-systems bigdata gene golang hpc kubernetes machine-learning serving training

Last synced: 31 Jan 2026

https://github.com/arrayfire/arrayfire

ArrayFire: a general purpose GPU library.

arrayfire c c-plus-plus cpp cuda gpgpu gpu hpc opencl performance scientific-computing

Last synced: 13 May 2025

https://github.com/spack/spack

A flexible package manager that supports multiple versions, configurations, platforms, and compilers.

build-tools hpc hpsf linux macos package-manager python radiuss scientific-computing spack windows

Last synced: 22 Feb 2026

https://github.com/diku-dk/futhark

:boom::computer::boom: A data-parallel functional programming language

boom compiler cuda futhark gpgpu gpu hacktoberfest hpc language opencl

Last synced: 05 May 2026

https://github.com/apptainer/singularity

Singularity has been renamed to Apptainer as part of us moving the project to the Linux Foundation. This repo has been persisted as a snapshot right before the changes.

cloud-native container containers hpc linux parallel portability portable reproducible reproducible-science rootless-containers science singularity singularity-container

Last synced: 15 Jan 2026

https://github.com/open-mpi/ompi

Open MPI main development repository

c fortran hacktoberfest hpc mpi openmpi

Last synced: 13 May 2025

https://github.com/zml/zml

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

ai bazel hpc inference xla zig

Last synced: 12 Apr 2025

https://github.com/ChunelFeng/CGraph

【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流

ai cpp dag graph hpc pipeline pybind11 python taskflow threadpool workflow

Last synced: 18 Mar 2025

https://github.com/ashvardanian/BenchmarkingTutorial

Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

assembly assembly-language avx512 benchmark coroutines cpp cpp-programming cpp17 cpp20 cuda gcc google-benchmark hpc io-uring linux-kernel llvm ptx ranges tutorial tutorials

Last synced: 26 Jun 2025

https://github.com/boostorg/compute

A C++ GPU Computing Library for OpenCL

boost c-plus-plus compute cpp gpgpu gpu hpc opencl performance

Last synced: 17 Dec 2025

https://github.com/adaptivecpp/adaptivecpp

Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl

Last synced: 20 Apr 2026

https://github.com/dtolm/vkfft

Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library

c2r convolution cuda dct fft hip hpc levelzero metal opencl r2c r2r vulkan

Last synced: 14 May 2025

https://boostorg.github.io/compute/

A C++ GPU Computing Library for OpenCL

boost c-plus-plus compute cpp gpgpu gpu hpc opencl performance

Last synced: 30 Apr 2025

https://github.com/AdaptiveCpp/AdaptiveCpp

Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

adaptivecpp compiler gpgpu gpu-computing high-performance high-performance-computing hipsycl hpc opensycl stdpar sycl

Last synced: 21 Apr 2025

https://github.com/DTolm/VkFFT

Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library

c2r convolution cuda dct fft hip hpc levelzero metal opencl r2c r2r vulkan

Last synced: 26 Mar 2025

https://github.com/su2code/su2

SU2: An Open-Source Suite for Multiphysics Simulation and Design

c-plus-plus cfd flow fluid fluid-dynamics hpc opensource optimization physics python simulation

Last synced: 14 May 2025

https://indigo-dc.github.io/udocker/

A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.

batch chroot containers deep-hybrid-datacloud docker docker-containers emulation eosc-hub fakechroot grid hpc indigo proot root-privileges runc user

Last synced: 07 May 2025

https://github.com/indigo-dc/udocker

A basic user tool to execute simple docker containers in batch or interactive systems without root privileges.

batch chroot containers deep-hybrid-datacloud docker docker-containers emulation eosc-hub fakechroot grid hpc indigo proot root-privileges runc user

Last synced: 10 Apr 2025

https://github.com/nvidia/matx

An efficient C++17 GPU numerical computing library with Python-like syntax

cuda gpgpu gpu gpu-computing hpc

Last synced: 04 Mar 2026

https://github.com/openucx/ucx

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

aries c c-plus-plus cray drivers gemini hacktoberfest hpc infiniband iwarp mpi networking openshmem pgas rdma roce shared-memory shmem tcp-ip verbs

Last synced: 24 May 2026

https://github.com/NVIDIA/MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

cuda gpgpu gpu gpu-computing hpc

Last synced: 26 Mar 2025

https://github.com/su2code/SU2

SU2: An Open-Source Suite for Multiphysics Simulation and Design

c-plus-plus cfd flow fluid fluid-dynamics hpc opensource optimization physics python simulation

Last synced: 14 Mar 2025

https://github.com/jfalcou/eve

Expressive Vector Engine - SIMD in C++ Goes Brrrr

aarch64 altivec avx avx2 cpp cpp-library hpc neon simd simd-library simd-parallelism simd-programming sse2 ssse3

Last synced: 02 Jul 2025

https://github.com/kubernetes-retired/kube-batch

A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC

bigdata hpc k8s-sig-scheduling kubernetes machine-learning

Last synced: 29 Sep 2025

https://github.com/liu-xiandong/how_to_optimize_in_gpu

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

elementwise gpu-acceleration high-performance-computing hpc reduce sgemm sgemv

Last synced: 03 Oct 2025

https://github.com/futureverse/future

:rocket: R package: future: Unified Parallel and Distributed Processing in R for Everyone

asynchronous cran distributed-computing futures hpc hpc-clusters parallel-computing parallel-processing parallelization programming promises r

Last synced: 12 Dec 2025

https://github.com/broadinstitute/cromwell

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments

application bioinformatics cloud containers docker executor ga4gh hpc scala wdl workflow workflow-description-language workflow-execution

Last synced: 27 Mar 2025

https://github.com/sylabs/singularity

SingularityCE is the Community Edition of Singularity, an open source container platform designed to be simple, fast, and secure.

containers hpc linux

Last synced: 14 May 2025

https://github.com/envmodules/modules

Environment Modules: provides dynamic modification of a user's environment

environment environment-modules hpc module modulefiles shell tcl

Last synced: 11 Mar 2026

https://github.com/arrayfire/arrayfire-rust

Rust wrapper for ArrayFire

arrayfire cuda gpgpu gpu hpc opencl rust rust-bindings

Last synced: 15 May 2025

https://github.com/Liu-xiandong/How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

elementwise gpu-acceleration high-performance-computing hpc reduce sgemm sgemv

Last synced: 14 May 2025

https://github.com/hypre-space/hypre

Parallel solvers for sparse linear systems featuring multigrid methods.

hpc library math-physics radiuss

Last synced: 21 Oct 2025

https://cea-hpc.github.io/modules

Environment Modules: provides dynamic modification of a user's environment

environment environment-modules hpc module modulefiles shell tcl

Last synced: 14 Mar 2025

https://github.com/romeric/Fastor

A lightweight high performance tensor algebra framework for modern C++

fpga hpc multidimensional-arrays simd small-blas tensor-contraction tensors

Last synced: 27 Apr 2025

https://github.com/sslotin/amh-code

Complete implementations from "Algorithms for Modern Hardware"

algorithms computer-science hpc performance

Last synced: 04 Apr 2025

https://github.com/warewulf/warewulf

Warewulf is a stateless and diskless container operating system provisioning system for large clusters of bare metal and/or virtual systems.

clusters containers hpc provisioning stateless warewulf

Last synced: 05 May 2026

https://github.com/zanellia/prometeo

An experimental Python-to-C transpiler and domain specific language for embedded high-performance computing

c compiler domain-specific-language embedded-systems high-performance-computing hpc python python-to-c source-to-source static-analysis static-typing transcompiler transpiler

Last synced: 16 May 2025

https://github.com/pmodels/mpich

Official MPICH Repository

c fortran hpc mpi

Last synced: 07 Apr 2026

https://github.com/nndeploy/nndeploy

nndeploy是一款模型端到端部署框架。以多端推理以及基于有向无环图模型部署为基础,致力为用户提供跨平台、简单易用、高性能的模型部署体验。

ascend easy-to-use hpc mnn model-deployment multi-inference openvino out-of-box-model parallel rknn tensorrt yolo

Last synced: 14 Dec 2025

https://github.com/llnl/sundials

Official development repository for SUNDIALS - a SUite of Nonlinear and DIfferential/ALgebraic equation Solvers. Pull requests are welcome for bug fixes and minor changes.

dae-solver high-performance-computing hpc math-physics nonlinear-equation-solver ode-solver parallel-computing radiuss scientific-computing sensitivity-analysis solver time-integration

Last synced: 15 May 2025

https://github.com/tacc/lmod

Lmod: An Environment Module System based on Lua, Reads TCL Modules, Supports a Software Hierarchy

environment-modules hpc lmod lua tacc xsede

Last synced: 12 Feb 2026

https://github.com/lablup/backend.ai

Backend.AI is a streamlined, container-based computing cluster platform that hosts popular computing/ML frameworks and diverse programming languages, with pluggable heterogeneous accelerator support including CUDA GPU, ROCm GPU, TPU, IPU and other NPUs.

api backendai cloud-computing containers distributed-computing docker documentation hpc monitoring paas python

Last synced: 02 Apr 2026

https://github.com/openhackathons-org/gpubootcamp

This repository consists for gpu bootcamp material for HPC and AI

ai4hpc cuda data-science deep-learning deepstream gpu hpc machine-learning mpi openacc openmp rapidsai

Last synced: 27 Mar 2025

https://github.com/open-mpi/hwloc

Hardware locality (hwloc)

c hardware hpc locality topology

Last synced: 09 Feb 2026

https://github.com/PySlurm/pyslurm

Python Interface to Slurm

cluster cython hpc python slurm

Last synced: 01 May 2025

https://github.com/easybuilders/easybuild

EasyBuild - building software with ease

hacktoberfest hpc linux python scientific-software

Last synced: 17 Mar 2026

https://github.com/visit-dav/visit

VisIt - Visualization and Data Analysis for Mesh-based Scientific Data

data-analysis data-viz hpc python radiuss scientific-computing scientific-visualization visualization

Last synced: 14 Jan 2026

https://github.com/ashvardanian/less_slow.cpp

Learning how to write "Less Slow" code in C++ 20, C 99, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

assembly assembly-language avx512 benchmark coroutines cpp cpp-programming cpp17 cpp20 cuda gcc google-benchmark hpc io-uring linux-kernel llvm ptx ranges tutorial tutorials

Last synced: 08 Apr 2025

https://github.com/TACC/Lmod

Lmod: An Environment Module System based on Lua, Reads TCL Modules, Supports a Software Hierarchy

environment-modules hpc lmod lua tacc xsede

Last synced: 17 Jul 2025

https://github.com/haptork/easyLambda

distributed dataflows with functional list operations for data processing with C++14

cpp14 dataflow-programming distributed-computing functional-programming hpc mpi parallel

Last synced: 15 Mar 2025

https://github.com/haptork/easylambda

distributed dataflows with functional list operations for data processing with C++14

cpp14 dataflow-programming distributed-computing functional-programming hpc mpi parallel

Last synced: 06 Apr 2025

https://github.com/luispedro/jug

Parallel programming with Python

hpc parallel-computing python python-2 python-3 workflow workflow-engine

Last synced: 14 May 2025

https://github.com/arrayfire/arrayfire-python

Python bindings for ArrayFire: A general purpose GPU library.

arrayfire cuda gpgpu gpu hpc opencl python python-bindings

Last synced: 02 Apr 2025

https://github.com/libocca/occa

Portable and vendor neutral framework for parallel programming on heterogeneous platforms.

c cpp cuda dpcpp fortran gpgpu gpu hip hpc jit metal multithreading oneapi opencl openmp sycl

Last synced: 04 Apr 2025

https://github.com/it4innovations/hyperqueue

Scheduler for sub-node tasks for HPC systems with batch scheduling

distributed-computing hpc rust task-graph

Last synced: 15 May 2025

https://github.com/ParRes/Kernels

This is a set of simple programs that can be used to explore the features of a parallel platform.

c c-plus-plus coarray-fortran fortran2008 hpc julia kokkos mpi openacc opencl openmp parallel parallel-programming pgas python3 shmem sycl threading

Last synced: 01 Apr 2025

https://github.com/easybuilders/easybuild-easyconfigs

A collection of easyconfig files that describe which software to build using which build options with EasyBuild.

hpc linux python scientific-software

Last synced: 13 May 2025

https://github.com/llnl/Umpire

An application-focused API for memory management on NUMA & GPU architectures

blt cpp gpu hpc memory-management portability radiuss

Last synced: 31 Mar 2026

https://github.com/It4innovations/hyperqueue

Scheduler for sub-node tasks for HPC systems with batch scheduling

distributed-computing hpc rust task-graph

Last synced: 01 Apr 2025

https://github.com/alpaka-group/alpaka

Abstraction Library for Parallel Kernel Acceleration :llama:

cpp cpp17 cuda gpu header-only heterogeneous-parallel-programming hip hpc openacc openmp rocm tbb

Last synced: 15 May 2025

https://github.com/llnl/caliper

Caliper is an instrumentation and performance profiling library

annotation-apis caliper cpp hpc instrumentation performance performance-analysis performance-monitoring radiuss trace

Last synced: 14 May 2025

https://github.com/nek5000/nekrs

our next generation fast and scalable CFD code

cfd exascale gpu high-order hpc turbulence

Last synced: 19 Feb 2026

https://github.com/llnl/umpire

An application-focused API for memory management on NUMA & GPU architectures

blt cpp gpu hpc memory-management portability radiuss

Last synced: 15 May 2025

https://github.com/giovtorres/slurm-docker-cluster

A Slurm cluster using docker-compose

docker-compose hpc slurm slurm-cluster

Last synced: 24 Feb 2026

https://github.com/pipefunc/pipefunc

Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪

dag hpc parallel-computing pipeline-framework pipelines reproducible-research slurm workflow-engine

Last synced: 16 Dec 2025

https://github.com/pyccel/pyccel

Python extension language using accelerators

dsl fortran hpc mpi openacc openmp python python3 sympy transpiler

Last synced: 26 Feb 2026

https://github.com/LLNL/Umpire

An application-focused API for memory management on NUMA & GPU architectures

blt cpp gpu hpc memory-management portability radiuss

Last synced: 11 May 2025

https://github.com/uob-hpc/babelstream

STREAM, for lots of devices written in many programming models

benchmark cuda gpgpu gpu hpc kokkos memory-bandwidth openacc opencl openmp parallel-processing raja sycl

Last synced: 21 Oct 2025

https://github.com/LLNL/Caliper

Caliper is an instrumentation and performance profiling library

annotation-apis caliper cpp hpc instrumentation performance performance-analysis performance-monitoring radiuss trace

Last synced: 08 May 2025

https://github.com/NERSC/shifter

Shifter - Linux Containers for HPC

containers hpc linux

Last synced: 20 Apr 2025

https://github.com/cnuernber/dtype-next

A Clojure library designed to aid in the implementation of high performance algorithms and systems.

clojure graal-native hpc

Last synced: 16 May 2025