An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with openmp

A curated list of projects in awesome lists tagged with openmp .

https://github.com/openwall/john

John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs

assembler c cracker crypt fpga gpgpu gpu hash john jtr mpi opencl openmp password ripper simd

Last synced: 16 Dec 2025

https://github.com/magnumripper/johntheripper

John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs

assembler c cracker crypt fpga gpgpu gpu hash john jtr mpi opencl openmp password ripper simd

Last synced: 09 Mar 2025

https://github.com/mratsim/arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor

Last synced: 14 May 2025

https://mratsim.github.io/Arraymancer/

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor

Last synced: 08 May 2025

https://github.com/mratsim/Arraymancer

A fast, ergonomic and portable tensor library in Nim with a deep learning focus for CPU, GPU and embedded devices via OpenMP, Cuda and OpenCL backends

autograd automatic-differentiation cuda cudnn deep-learning gpgpu gpu-computing high-performance-computing iot linear-algebra machine-learning matrix-library multidimensional-arrays ndarray neural-networks nim opencl openmp parallel-computing tensor

Last synced: 16 Apr 2025

https://github.com/kratosmultiphysics/kratos

Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.

bsd-license c-plus-plus dem fem kratos kratos-multiphysics mpi multi-platform multiphysics numerical-methods openmp parallel-computing python

Last synced: 13 May 2025

https://kratosmultiphysics.github.io/Kratos/

Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.

bsd-license c-plus-plus dem fem kratos kratos-multiphysics mpi multi-platform multiphysics numerical-methods openmp parallel-computing python

Last synced: 23 Apr 2025

https://github.com/KratosMultiphysics/Kratos

Kratos Multiphysics (A.K.A Kratos) is a framework for building parallel multi-disciplinary simulation software. Modularity, extensibility and HPC are the main objectives. Kratos has BSD license and is written in C++ with extensive Python interface.

bsd-license c-plus-plus dem fem kratos kratos-multiphysics mpi multi-platform multiphysics numerical-methods openmp parallel-computing python

Last synced: 14 Mar 2025

https://github.com/0voice/developkit_set

2021年最新总结,值得推荐的c/c++开源框架与库。持续更新中。

boost cpp libevent mongodb nginx ntyco openmp

Last synced: 05 May 2025

https://github.com/faasm/faasm

High-performance stateful serverless runtime based on WebAssembly

cpp faas kubernetes mpi openmp serverless wasi wasm webassembly

Last synced: 29 Dec 2025

https://github.com/ddemidov/amgcl

C++ library for solving large sparse linear systems with algebraic multigrid method

amg c-plus-plus cpp cuda gpgpu linear-solvers mpi multigrid opencl openmp scientific-computing sparse-linear-systems

Last synced: 14 May 2025

https://github.com/mogill/ems

Extended Memory Semantics - Persistent shared object memory and parallelism for Node.js and Python

ems extended-memory-semantics javascript json json-data multithreading non-volatile-memory openmp parallel persistent-data persistent-data-structure persistent-memory python shared-memory

Last synced: 21 Mar 2025

https://github.com/mratsim/weave

A state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead

data-parallelism fork-join message-passing multithreading openmp parallelism runtime scheduler task-parallelism task-scheduler threadpool work-stealing

Last synced: 05 Apr 2025

https://github.com/openhackathons-org/gpubootcamp

This repository consists for gpu bootcamp material for HPC and AI

ai4hpc cuda data-science deep-learning deepstream gpu hpc machine-learning mpi openacc openmp rapidsai

Last synced: 27 Mar 2025

https://github.com/libocca/occa

Portable and vendor neutral framework for parallel programming on heterogeneous platforms.

c cpp cuda dpcpp fortran gpgpu gpu hip hpc jit metal multithreading oneapi opencl openmp sycl

Last synced: 04 Apr 2025

https://github.com/ParRes/Kernels

This is a set of simple programs that can be used to explore the features of a parallel platform.

c c-plus-plus coarray-fortran fortran2008 hpc julia kokkos mpi openacc opencl openmp parallel parallel-programming pgas python3 shmem sycl threading

Last synced: 01 Apr 2025

https://github.com/alpaka-group/alpaka

Abstraction Library for Parallel Kernel Acceleration :llama:

cpp cpp17 cuda gpu header-only heterogeneous-parallel-programming hip hpc openacc openmp rocm tbb

Last synced: 15 May 2025

https://github.com/ekondis/mixbench

A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)

benchmark cuda gpu hip opencl openmp sycl

Last synced: 04 Apr 2025

https://github.com/pyccel/pyccel

Python extension language using accelerators

dsl fortran hpc mpi openacc openmp python python3 sympy transpiler

Last synced: 26 Mar 2025

https://github.com/uob-hpc/babelstream

STREAM, for lots of devices written in many programming models

benchmark cuda gpgpu gpu hpc kokkos memory-bandwidth openacc opencl openmp parallel-processing raja sycl

Last synced: 21 Oct 2025

https://github.com/sbeamer/gapbs

GAP Benchmark Suite

benchmark bfs graph-algorithms openmp

Last synced: 15 May 2025

https://github.com/UoB-HPC/BabelStream

STREAM, for lots of devices written in many programming models

benchmark cuda gpgpu gpu hpc kokkos memory-bandwidth openacc opencl openmp parallel-processing raja sycl

Last synced: 21 Apr 2025

https://github.com/bcgsc/abyss

:microscope: Assemble large genomes using short reads

assembler bioinformatics bloom-filter c-plus-plus genome mpi openmp scaffold science

Last synced: 08 Apr 2025

https://github.com/spoutn1k/mcmap

Pixel-art map visualizer for Minecraft. Maps are drawn from an isometric perspective.

linux macos minecraft minecraft-map openmp pixel-art qt threading windows

Last synced: 17 Nov 2025

https://github.com/mratsim/laser

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers

assembler blas compiler-optimization convolution deep-learning gemm high-performance-computing jit matrix-multiplication openmp parallel runtime-cpu-detection simd tensor

Last synced: 08 Apr 2025

https://github.com/trinkle23897/fast-poisson-image-editing

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

cpp cuda high-performance-computing image-processing jacobi-iteration jacobi-method mpi numpy openmp parallel-computing poisson-image-editing pybind11 python

Last synced: 05 Apr 2025

https://github.com/Trinkle23897/Fast-Poisson-Image-Editing

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

cpp cuda high-performance-computing image-processing jacobi-iteration jacobi-method mpi numpy openmp parallel-computing poisson-image-editing pybind11 python

Last synced: 02 Apr 2025

https://github.com/ashvardanian/forkunion

Lower-latency OpenMP-style minimalistic scoped thread-pool designed for 'Fork-Join' parallelism in Rust and C++, avoiding memory allocations, mutexes, CAS-primitives, and false-sharing on the hot path 🍴

arm atomics compare-and-swap concurrency memory-model mpi multithreading openmp parallel-computing parallel-stl parallelism rayon thread-pool threadpool

Last synced: 16 Dec 2025

https://github.com/wepe/dive-into-ml-system

Dive into machine learning system, start from reinventing the wheel.

ctypes eigen machine-learning-systems openmp

Last synced: 04 Sep 2025

https://github.com/rocm/aomp

AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

amd clang fortran-compiler llvm openmp rocm

Last synced: 16 May 2025

https://github.com/rocm/gpufort

GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify

cuda cuda-fortran fortran gpgpu gpu hip interoperability openacc openmp rocm

Last synced: 21 Jun 2025

https://github.com/ROCm/gpufort

GPUFORT: S2S translation tool for CUDA Fortran and Fortran+X in the spirit of hipify

cuda cuda-fortran fortran gpgpu gpu hip interoperability openacc openmp rocm

Last synced: 11 Mar 2025

https://github.com/roastduck/FreeTensor

A language and compiler for irregular tensor programs.

ast automatic-differentiation code-generation cuda gpu jit openmp tensor

Last synced: 11 Apr 2025

https://github.com/pennylaneai/pennylane-lightning

The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.

cuda distributed-computing gpu hpc mpi openmp parallel quantum-computing quantum-machine-learning rocm

Last synced: 15 May 2025

https://github.com/PennyLaneAI/pennylane-lightning

The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.

cuda distributed-computing gpu hpc mpi openmp parallel quantum-computing quantum-machine-learning rocm

Last synced: 11 May 2025

https://github.com/kanyun-inc/ytk-mp4j

Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gather, scatter, allgather, reduce-scatter, broadcast, reduce, allreduce communications for distributed machine learning.

allreduce broadcast machine-learning messaging-library mpi openmp reduce

Last synced: 06 May 2025

https://github.com/parallel-rust-cpp/shortcut-comparison

Performance comparison of parallel Rust and C++

cpp openmp parallel rayon rust simd

Last synced: 22 Jul 2025

https://github.com/prip-lab/MSU-LatentAFIS

A system for identifying latent fingerprints. Created at Michigan State University by Anil K. Jain, Kai Cao, Dinh-Luan Nguyen, and Cori Tymoszek.

afis feature-extraction fingerprint-matching latent-fingerprints minutiae openmp opensource parallel-programming templates tensorflow

Last synced: 12 Jul 2025

https://github.com/ashvardanian/parallelreductionsbenchmark

Thrust, CUB, TBB, AVX2, AVX-512, CUDA, OpenCL, OpenMP, Metal - all it takes to sum a lot of numbers fast!

apple avx512 cuda glsl gpgpu gpu gpu-acceleration gpu-computing hpc intel metal nvidia opencl openmp parallel simd stl tbb thrust

Last synced: 06 Apr 2025

https://github.com/eomii/rules_ll

An Upstream Clang/LLVM-based toolchain for contemporary C++ and heterogeneous programming

bazel bleeding-edge build-system clang clang-tidy cpp cuda gpu-programming hermetic hip llvm nix openmp remote-caching remote-execution reproducible sanitizers

Last synced: 06 Apr 2025

https://github.com/ttsiodras/mandelbrotsse

Real-time Mandelbrot zoom via SSE, AVX, OpenMP, CUDA, XaoS...

avx cuda openmp sse

Last synced: 18 Oct 2025

https://github.com/pierrebarbera/epa-ng

Massively parallel phylogenetic placement of genetic sequences

bioinformatics mpi mpi-io openmp phylogenetics placement taxonomic-classification

Last synced: 13 Apr 2025

https://github.com/ashvardanian/fork_union

Low(est?)-latency OpenMP-style minimalistic scoped thread-pool designed for 'Fork-Join' parallelism in Rust and C++, avoiding memory allocations, mutexes, CAS-primitives, and false-sharing on the hot path 🍴

arm atomics compare-and-swap concurrency memory-model mpi multithreading openmp parallel-computing parallel-stl parallelism rayon thread-pool threadpool

Last synced: 13 Jun 2025

https://github.com/rocm/rocalution

Next generation library for iterative sparse solvers for ROCm platform

cplusplus cuda fortran mpi opencl openmp solver sparse

Last synced: 05 Apr 2025

https://github.com/bsc-pm/mcxx

Mercurium is a C/C++/Fortran source-to-source compilation infrastructure aimed at fast prototyping developed by the Programming Models group at the Barcelona Supercomputing Center

compiler ompss openmp source-to-source

Last synced: 22 Apr 2025

https://github.com/shadensmith/splatt

The Surprisingly ParalleL spArse Tensor Toolkit.

cpd machine-learning mpi openmp parallel tensor

Last synced: 06 Apr 2025

https://github.com/llnl/dataracebench

Data race benchmark suite for evaluating OpenMP correctness tools aimed to detect data races.

benchmark data-race microbenchmarks openmp

Last synced: 29 Apr 2025

https://github.com/pypr/compyle

Execute a subset of Python on HPC platforms

cuda cython high-performance-computing opencl openmp python transpile

Last synced: 04 Apr 2025

https://github.com/ShadenSmith/splatt

The Surprisingly ParalleL spArse Tensor Toolkit.

cpd machine-learning mpi openmp parallel tensor

Last synced: 16 May 2025

https://github.com/openwall/php_mt_seed

PHP mt_rand() seed cracker

cracker mersenne-twister mt openmp php rand random seed simd

Last synced: 09 Apr 2025

https://github.com/utwente-fmt/vercors

The VerCors verification toolset for verifying parallel and concurrent software

c concurrency java opencl openmp parallelism separation-logic software-analysis vercors-toolset verification viper

Last synced: 27 Feb 2025

https://github.com/mperlet/matrix_multiplication

Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI

c matrices matrix-multiplication mpi openmp parallel-computing pthreads thread

Last synced: 05 Mar 2025

https://github.com/edawson/rkmh

Classify sequencing reads using MinHash.

bioinformatics kmer minhash mutations nanopore openmp

Last synced: 24 Apr 2025

https://github.com/bsc-pm/bots

Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour under certain circumstances: task tiedness, throttle and cut-offs mechanisms, single/multiple task generators, etc.

benchmark ompss openmp testing

Last synced: 26 Apr 2025

https://github.com/markus-goetz/hpdbscan

Highly parallel DBSCAN (HPDBSCAN)

clustering data-analytics dbscan mpi openmp python-bindings

Last synced: 22 Oct 2025

https://github.com/RajdeepKonwar/stockast

Predict stock market pricing over 180 minutes using Black-Scholes stochastic modeling and parallel Monte-Carlo simulations.

black-scholes c-plus-plus monte-carlo-simulation openmp parallel prediction-model predictive-analytics predictive-modeling stock-data stock-market stock-prediction stock-price-prediction stock-prices thread

Last synced: 30 Mar 2025

https://github.com/AFD-Illinois/kharma

Kokkos-based High-Accuracy Relativistic Magnetohydrodynamics with AMR

cuda gpu grmhd hip kokkos mhd openmp sycl

Last synced: 04 Apr 2025

https://github.com/kimwalisch/primesum

🚀 Sum of the primes below x

cpp math number-theory openmp prime-numbers

Last synced: 20 Mar 2025

https://github.com/2b-t/lb-t

Multi-threaded object-oriented computational fluid dynamics (CFD) code for three-dimensional incompressible fluid flow on a regular grid with the incompressible lattice-Boltzmann method (LBM) for general lattices (D2Q9, D3Q15, D3Q19, D3Q27, ...) with A-A pattern for indexing - programmed in C++17, multi-threaded with OpenMP

3d aa-pattern bgk cfd computational-fluid-dynamics d3q19 d3q27 fluid fluid-dynamics fluid-simulation high-performance-computing hpc incompressible incompressible-flow lattice-boltzmann lbm multi-threading openmp three-dimensional trt

Last synced: 13 Jul 2025

https://github.com/fandreuz/parallel-kd-tree

Parallel k-d tree with C++17, MPI and OpenMP

cpp data-structures hpc mpi openmp

Last synced: 10 Jul 2025

https://github.com/douglasdavis/pygram11

Simple and fast histogramming in Python accelerated with OpenMP.

calculating-histograms conda-forge cpp14 histogram histograms macos numpy openmp pybind11 pypi python python3 weights

Last synced: 28 Jul 2025

https://github.com/ucl/research-computing-with-cpp

UCL-ARC *C++ for Research* course

c-plus-plus course gpu mpi openmp ucl

Last synced: 15 Apr 2025

https://github.com/ashvardanian/cuda-python-starter-kit

Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11

cmake cuda cuda-programming hip hpc matrix-multiplication openmp parallel-computing parallel-programming pybind pybind11 python starter-kit starter-template tutorial

Last synced: 13 Jul 2025

https://github.com/lanl/libquo

Dynamic execution environments for coupled, thread-heterogeneous MPI+X applications

hardware-locality hpc hwloc mpi mpi-applications mpi-plus-x mpix openmp pthreads threading

Last synced: 12 Apr 2025

https://github.com/illuhad/hipCPU

Implementation of AMD HIP for CPUs

cuda gpgpu hip hpc openmp openmp-parallelization

Last synced: 21 Apr 2025

https://github.com/illuhad/hipcpu

Implementation of AMD HIP for CPUs

cuda gpgpu hip hpc openmp openmp-parallelization

Last synced: 16 Apr 2025

https://github.com/ferdikoomen/capture3

C++ research project to learn more about cameras, image processing, color spaces, OpenCV and multi‑threading.

cpp cpp14 gphoto2 opencv openmp qt5

Last synced: 29 Apr 2025

https://github.com/soumik12345/radium

Radium is a Ray Tracing Engine written in C++ that runs on the CPU using shared-memory multiprocessing

computer-graphics cpp monte-carlo multiprocessing openmp raytracing russian-roulette

Last synced: 16 Jun 2025

https://github.com/ema2159/equirectangular-cubemaptransform

OpenCV with CUDA and OpenMP implementations for transforming equirectangular images to cube maps and vice versa

cubemap-to-equirectangular cuda equirectangular-to-cubemap opencv openmp

Last synced: 15 Apr 2025

https://github.com/yoch/sparse-som

Efficient Self-Organizing Map for Sparse Data

algorithm neural-nets openmp python self-organizing-map som sparse-data

Last synced: 06 Apr 2025

https://github.com/geoschem/gcclassic

This repository contains the "superproject" wrapper for the "Classic" configuration of the GEOS-Chem model of atmospheric chemistry and composition.

aerosols atmospheric-chemistry atmospheric-composition atmospheric-modeling aws carbon-cycle climate cloud-computing fortran greenhouse-gases hg openmp particulate-matter scientific-computing

Last synced: 27 Apr 2025

https://github.com/vmagnin/exploring_coarrays

Let's explore the modern Fortran features for parallel programming (coarrays, images, collective subroutines...)

coarray-fortran coarrays fortran openmp

Last synced: 30 Mar 2025