Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with gpu-computing

A curated list of projects in awesome lists tagged with gpu-computing .

https://github.com/rocm/hipblaslt

hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

amd assembly blas gemm gpu-computing hip machine-learning matrix-multiplication rocm

Last synced: 22 Dec 2024

https://github.com/open-atmos/PySDM

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

atmospheric-modelling atmospheric-physics cuda gpu gpu-computing monte-carlo-simulation numba nvrtc particle-system physics-simulation pint pypi-package python research simulation thrust

Last synced: 05 Nov 2024

https://github.com/open-atmos/pysdm

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

atmospheric-modelling atmospheric-physics cuda gpu gpu-computing monte-carlo-simulation numba nvrtc particle-system physics-simulation pint pypi-package python research simulation thrust

Last synced: 18 Dec 2024

https://github.com/ritchieng/fractional_differencing_gpu

Rapid large-scale fractional differencing with NVIDIA RAPIDS and GPU to minimize memory loss while making a time series stationary. 6x-400x speed up over CPU implementation.

cudf fractional-differencing gpu-computing hpc-applications nvidia python rapids stationarity time-series

Last synced: 22 Oct 2024

https://github.com/gunrock/loops

🎃 GPU load-balancing library for regular and irregular computations.

cuda gpu gpu-computing hpc load-balancing parallel

Last synced: 11 Nov 2024

https://github.com/rbaygildin/learn-gpgpu

Algorithms implemented in CUDA + resources about GPGPU

cublas cuda curand gpgpu gpu gpu-computing image-processing nvidia opencl parallel-computing pycuda

Last synced: 19 Nov 2024

https://github.com/ProjectPhysX/PTXprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.

cuda gpu gpu-acceleration gpu-computing gpu-programming hpc nvidia nvidia-cuda nvidia-gpu opencl profiler ptx ptx-utils roofline-model sycl

Last synced: 05 Nov 2024

https://github.com/projectphysx/ptxprofiler

A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.

cuda gpu gpu-acceleration gpu-computing gpu-programming hpc nvidia nvidia-cuda nvidia-gpu opencl profiler ptx ptx-utils roofline-model sycl

Last synced: 08 Nov 2024

https://github.com/reignstudios/orbital-framework

Graphics / Video, Audio and Input frameworks. (Agnostic / Portable / Easy / Powerful / Fast)

agnostic api audio csharp direct3d directx dotnet framework gles gpu gpu-computing graphics input legacy opengl orbital-framework portable video vulkan

Last synced: 06 Nov 2024

https://github.com/juliaastrosim/astronbodysim.jl

Unitful and differentiable gravitational N-body simulation code in Julia

astronomy astrophysics autodiff gpu-computing julia nbody-gravity-simulation simulation unitful

Last synced: 03 Dec 2024

https://github.com/intelpython/dpep

Data Parallel Extensions for Python*

gpu-computing heterogeneous-computing numba numpy python

Last synced: 15 Nov 2024

https://github.com/IntelPython/DPEP

Data Parallel Extensions for Python*

gpu-computing heterogeneous-computing numba numpy python

Last synced: 09 Nov 2024

https://github.com/tcoppex/aer-engine

:aquarius: An OpenGL 4.3 / C++ 11 rendering engine oriented towards animation.

animation computergraphics cplusplus-11 engine gpu-computing opengl

Last synced: 12 Oct 2024

https://github.com/lichtso/vulkanfft

Fast Fourier Transform using the Vulkan API

c cli fft fft-library gpu-computing vulkan vulkan-library

Last synced: 07 Nov 2024

https://github.com/pkestene/euler2d_cudafortran

2nd order Godunov solver for 2d Euler equations written in CUDA Fortran and stdpar (standard paralelism)

cea conservation-laws cuda cuda-fortran euler-equations fortran gpu gpu-computing hydrodynamics nvfortran nvhpc stdpar

Last synced: 18 Dec 2024

https://github.com/llnl/care

CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as loop fusion capability and a portable interface for many numerical algorithms. It provides all the basics for anyone wanting to write portable code.

gpu gpu-acceleration gpu-computing gpu-programming hpc hpc-applications portability portable portable-apps portable-class-library portable-executable portable-library portable-object portableapps radiuss

Last synced: 11 Nov 2024

https://github.com/rodrgo/OpenPH

Parallel reduction of boundary matrices for Persistent Homology with CUDA

cuda gpu-computing numerical-computation parallel-computing persistent-homology topological-data-analysis

Last synced: 12 Nov 2024

https://github.com/johnh2o2/cuvarbase

Python library for fast time-series analysis on CUDA GPUs

cuda fourier-methods gpu gpu-computing lomb-scargle-periodogram nfft python python-3 time-series

Last synced: 02 Nov 2024

https://github.com/sparselinearalgebra/spla

An open-source generalized sparse linear algebra library with vendor-agnostic GPUs accelerated computations

cplusplus cpp gpgpu gpu-computing graph-algorithms high-performance-computing linear-algebra opencl parallel python python-bindings sparse-matrix

Last synced: 29 Nov 2024

https://github.com/illuhad/spatialcl

Library for the GPU-accelerated spatial indexing and processing of particles in 2D and 3D with OpenCL. Currently offers trees based on space-filling-curves.

gpu gpu-computing library opencl particle spatial-index tree

Last synced: 28 Nov 2024

https://github.com/changkun/gopherchina2023gogpu

Talk "Go on GPU" in GopherChina 2023

go golang gpu gpu-acceleration gpu-computing

Last synced: 09 Nov 2024

https://github.com/goki/vgpu

Vulkan GPU Framework for Graphics and Compute in Go, now developed at https://github.com/cogentcore/core/tree/main/vgpu

go golang gpu gpu-computing phong-lighting vulkan

Last synced: 15 Dec 2024

https://github.com/gnikoloff/hwoa-rang-gpu

micro webGPU rendering & compute library

3d computer-graphics gpu-computing webgpu wglsl

Last synced: 02 Nov 2024

https://github.com/pkestene/ms-hpc-ai-gpu

resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI

cuda deep-learning gpu gpu-computing machine-learning physics-informed-neural-networks pinn pinns

Last synced: 18 Dec 2024

https://github.com/sergioskar/convolutional-neural-network

Computer vision framework based on deep learning and GPU programming

computer-vision convolutional-neural-network deep-learning gpu-computing opencl

Last synced: 20 Nov 2024

https://github.com/lkedward/lbm2d_opencl

D2Q9 BGK Lattice Boltzmann code written in OpenCL and Modern Fortran.

fortran gpu-computing lattice-boltzmann modern-fortran opencl

Last synced: 08 Nov 2024

https://github.com/epranka/gpucomputationrender-three

GPUComputationRender module for Three.js with ES6 compatibility

es6 gpu-computing shaders threejs typescript

Last synced: 11 Oct 2024

https://github.com/owlbarn/owl_symbolic

Connect Owl with other accelerators and numerical frameworks with symbolic maths

algebra gpu-computing machine-learning neural-networks numerical onnx scientific-computing symbolic-math

Last synced: 13 Nov 2024

https://github.com/paolosalvatori/aks-gpu

This project shows how to add a GPU-enabled node pool to an existing AKS cluster and how to autoscale and monitor GPU-enabled worker nodes

aks aks-cluster azure dcgm-exporter gpu gpu-computing gpu-container grafana grafana-dashboard prometheus prometheus-metrics

Last synced: 18 Oct 2024

https://github.com/upsj/gpu_selection

Parallel selection on GPUs

algorithms gpgpu gpu-computing quickselect

Last synced: 27 Nov 2024

https://github.com/juliawgpu/wgpunative.jl

WGPU bindings (Alias for WGPU_jll without JLLWrappers)

gpu gpu-computing graphics-library julia webgpu wgpu

Last synced: 07 Nov 2024

https://github.com/shadyboukhary/gpu-research-fft-openacc-cuda

Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the development of new skills and the formation of new knowledge. This research studies the behavior and performance of two interdisciplinary and widely adopted scientific kernels, a Fast Fourier Transform and Matrix Multiplication. Both routines are implemented in the two current most popular many-core programming models CUDA and OpenACC. A Fast Fourier Transform (FFT) samples a signal over a period of time and divides it into its frequency components, computing the Discrete Fourier Transform (DFT) of a sequence. Unlike the traditional approach to computing a DFT, FFT algorithms reduce the complexity of the problem from O(n2) to O(nLog2n). Matrix multiplication is a cornerstone routine in Mathematics, Artificial Intelligence and Machine Learning. This research also shows that the nature of the problem plays a crucial role in determining what many-core model will provide the highest benefit in performance.

acceleration cuda fast-fourier-transform fft gpu-acceleration gpu-computing gpu-programming nvcc openacc parallel-computing pgi pgi-compiler radix-2

Last synced: 09 Nov 2024

https://github.com/illuhad/teralens

The fastest gravitational (quasar) microlensing code on the planet. A parallel Barnes-Hut tree code optimized for GPUs, written in OpenCL

astronomy astrophysics gpu gpu-computing gravitational-lensing high-performance high-performance-computing microlensing opencl

Last synced: 28 Nov 2024

https://github.com/skyzh/julia.metal

🎇 Render Julia Sets in real-time with Metal API on macOS

gpu-computing julia-set mathematics metalkit swift xcode

Last synced: 11 Oct 2024

https://github.com/egecetin/videostabilization

Video stabilization is a software-based approach (using OpenCV and its CUDA backend) in real-time to eliminating environmental effects (wind, heavy vehicle etc.) and enhance the visual performance that degrade video streaming quality.

glut gpu-computing opencv real-time-processing video-processing video-stabilization

Last synced: 15 Oct 2024

https://github.com/vmchale/ats-stats

Demonstration of GPU-accelerated statistical functions in ATS

ats futhark gpu-acceleration gpu-computing statistics

Last synced: 10 Nov 2024

https://github.com/pkestene/cuda-proj-tmpl

A minimal cmake based project skeleton for developping a CUDA application

cea cmake cuda gpu gpu-computing parallel-computing parallel-programming template

Last synced: 18 Dec 2024

https://github.com/phronmophobic/clj-webgpu

A clojure wrapper for webgpu-native

clojure gpu gpu-computing webgpu

Last synced: 12 Oct 2024

https://github.com/bokutotu/zenu

Deep Learning Framework Written in Rust

ai autograd blas cublas cuda cudnn deep-learning deep-neural-networks gpu-computing hpc rust

Last synced: 15 Dec 2024

https://github.com/1duo/mpi-nccl-tests

MPI + NCCL tests with GPU Direct RDMA.

distributed-systems gpu-computing mpi

Last synced: 30 Oct 2024

https://github.com/simbleau/vgpu

vgpu is a barebones prototype engine for interactive, hardware-accelerated vector graphics written in rust. It focuses on scalability, response time, and simulation.

gpu-computing path-rendering rendering-engine simulation vector-graphics

Last synced: 27 Oct 2024

https://github.com/sbl-sdsc/df-parallel

Comparison of Dataframe libraries for parallel processing of large tabular files on CPU and GPU.

cuda-toolkit dask dask-cudf dask-dataframes dataframes gpu-computing parallel-processing pyspark-dataframes rapidsai

Last synced: 09 Dec 2024

https://github.com/juliawgpu/wgpucompute.jl

Compute shaders interface for WGPU from julia

compute gpu gpu-computing gpu-programming julia-lang machine-learning shader wgpu

Last synced: 07 Nov 2024

https://github.com/goki/gosl

Go as a shader language: converts Go code to SPIR-V via HLSL

go golang gpu gpu-computing hlsl hlsl-shader shaders

Last synced: 15 Dec 2024

https://github.com/philiparvidsson/gpu-raytracing

Simple real-time raytracing demo running as a fragment shader on the computer's graphics card.

equation gpu gpu-acceleration gpu-computing hlsl intersection math ray raytracer raytracing real-time rendering sphere sphere-surface

Last synced: 01 Dec 2024

https://github.com/say4n/metal.compute

a verbose example on using metal with C++ to perform arbitrary compute on GPUs

apple apple-metal apple-silicon gpu-computing mac macos metal metal-cpp

Last synced: 08 Nov 2024

https://github.com/shapelets/al-juarismi

This is a chatbot to execute Khiva algorithms using natural language.

analytics chatbot gpu-computing python timeseries voice-assistant

Last synced: 13 Nov 2024

https://github.com/mpezeshki/theano_tile_coding

A tile coder in theano for Reinforcement Learning tasks

deep-learning gpu-computing reinforcement-learning theano tile-coding

Last synced: 07 Nov 2024

https://github.com/egororachyov/spbench

Benchmark for sparse linear algebra libraries for CPU and GPU platforms.

benchmark cpp cpu cuda gpu-computing graphblas opencl sparse-matrices

Last synced: 19 Nov 2024

https://github.com/jjfumero/fastr-gpu

Just In Time Compiler for GPUs for R applications.

gpgpu gpu-computing graalvm jit-compiler opencl r-language runtime truffle

Last synced: 17 Nov 2024

https://github.com/BrosnanYuen/RayBNN_Raytrace

Ray tracing library using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI

arrayfire cuda gpu gpu-computing opencl parallel parallel-computing ray ray-tracing raybnn raylib raytracer raytracing rust

Last synced: 05 Nov 2024

https://github.com/brosnanyuen/raybnn_raytrace

Ray tracing library using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI

arrayfire cuda gpu gpu-computing opencl parallel parallel-computing ray ray-tracing raybnn raylib raytracer raytracing rust

Last synced: 13 Nov 2024

https://github.com/brosnanyuen/raybnn_diffeq

Differential Equation Solver using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI

arrayfire cuda differential differential-equations gpu gpu-computing opencl parallel parallel-computing parallel-programming raybnn rust

Last synced: 13 Nov 2024

https://github.com/kai-kj/microcompute

A small library for gpu computing

c glsl gpgpu gpu gpu-computing gpu-programming lua luajit opengl

Last synced: 15 Nov 2024

https://github.com/meetps/me-766

Assignment Solutions to course ME766 High Performance Scientific Computing.

cuda gpu-computing opencl openmp parallel-computing

Last synced: 09 Nov 2024

https://github.com/senresearch/liteqtl.jl

A lightweight Julia package for eQTL genome scans near real-time.

eqtl genome-scan gpu-computing julia linearmodel

Last synced: 02 Dec 2024

https://github.com/babak2/optimizedsum

Optimized Parallel Sum program demonstrating CPU vs GPU performance

cuda cuda-programming gpu-acceleration gpu-computing gpu-parallelism visual-studio

Last synced: 06 Dec 2024

https://github.com/lawmurray/gpu-gemm

CUDA kernel for matrix-matrix multiplication on Nvidia GPUs, using a Hilbert curve to improve L2 cache utilization.

cplusplus cuda cuda-kernels cuda-programming gpu gpu-computing gpu-programming matrix-multiplication numerical-methods scientific-computing

Last synced: 01 Nov 2024

https://github.com/brosnanyuen/raybnn_optimizer

Gradient Descent Optimizers and Genetic Algorithms using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI

arrayfire cuda genetic-algorithm genetic-algorithms gpu gpu-computing gradient gradient-descent parallel parallel-computing raybnn rust

Last synced: 23 Oct 2024

https://github.com/intelpython/sample-data-parallel-extensions

Sample data parallel extensions built with oneAPI DPC++

cython dpctl gpu-computing intel-xpu oneapi pybind11 python

Last synced: 15 Nov 2024

https://github.com/koldim2001/jupiterlabcv

Готовый сервис JupiterLab для разработок в сфере компьютерного зрения

computer-vision docker docker-compose gpu-computing jupyter-lab jupyter-notebook jupyterlab

Last synced: 19 Dec 2024

https://github.com/skyzh/conway.metal

Conway's Game of Life with Metal Computing

conway-game conway-s-game-of-life gpu-computing metalkit swift xcode

Last synced: 11 Nov 2024

https://github.com/lmlk-seal/llmodel

Private LLModel GUI Chat allows users to interact with a local large language model (LLM) using the Llama library and GGUF models.

ai assistant chatbot code cpu gguf gpu-computing gui llama llamacpp llm model models offline privacy private prompt python simple virtual-assistant-ai

Last synced: 22 Dec 2024

https://github.com/k-ye/metalscan

Blelloch's exclusive scan implemented in Metal

gpu gpu-computing metal swift

Last synced: 16 Nov 2024

https://github.com/fwd/nano-pow

Lightweight Proof of Work Server for Nano Currency.

gpu-computing gpu-monetization nanocurrency proof-of-work

Last synced: 10 Nov 2024

https://github.com/dernasherbrezon/rpi-fma

Test FMA on RaspberryPI CPU&GPU

fma gpu gpu-computing opencl raspberrypi

Last synced: 14 Nov 2024

https://github.com/adamelkholyy/nemo

Fork for running Whisper transcriptions with Nemo diarization on University of Exeter's ISCA Supercomputer. Includes custom configuration (env.yaml) and logging for HPC operations.

asr gpu-computing hpc-clusters

Last synced: 20 Nov 2024

https://github.com/daelsepara/hipmandelbrot

GPU Implementation of Mandelbrot Fractal Generator with Benchmarking

amd cuda fractal gpu gpu-compute gpu-computing hip mandelbrot parallel-computing rocm sdk

Last synced: 07 Nov 2024

https://github.com/mortafix/quickshift

A working implementation of Quickshift algorithm in CUDA, GPU-compatible.

cuda gpu-computing quickshift

Last synced: 14 Nov 2024

https://github.com/nooridan/blahutarimoto

pyTorch implementation of blahut-arimoto alogirthm to compute the channel capacity of a peak power constrained channel

additive-gaussian-processes channel-cpacity gpu-computing information-theory pytorch

Last synced: 21 Dec 2024

https://github.com/kapshaul/llm-finetune-vuln-detection

Fine-tuning a Large Language Model (LLM) for code vulnerability detection using QLoRA, a method that quantizes the model to 4-bit floats and incorporates adapters for fine-tuning.

deep-learning fine-tuning gpu-computing large-language-model llm lora machine-learning nlp-machine-learning qlora quantization secrutiy

Last synced: 18 Nov 2024

https://github.com/abhaskumarsinha/keras-blenderbot

A Keras implementation of Blenderbot, a state-of-the-art open-domain chatbot model, supporting TensorFlow, PyTorch, and JAX backends seamlessly.

attention-mechanism chatbot-framework gpu-computing jax keras keras-models language-model python3 pytorch tensorflow

Last synced: 21 Nov 2024

https://github.com/le-ander/msc_bioinfo-experimental_design

Using information theory to inform experimental design with GPU acceleration. Computing group project as part of the MSc in Bioinformatics and Theorectical Systems Biology at Imperial College London 2016/2017.

cuda experimental-design gpu-computing information-theory pycuda systems-biology

Last synced: 04 Dec 2024