An open API service indexing awesome lists of open source software.

CUDA

CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.

https://github.com/voschezang/holographic-projector-simulations

Optimizations of Simulations of Holographic Projectors using CUDA

cuda gpu holography parallel-computing photonics

Last synced: 16 May 2026

https://github.com/zalo/matmul_cuda

A simple learning example for CUDA

cuda

Last synced: 07 Jul 2025

https://github.com/maltsev-andrey/julia_set_cuda

High-performance Julia set fractal computation in pure CUDA C, achieving 2.78 billion pixels/second on Tesla P100. Demonstrates GPU kernel programming, memory optimization, and massive parallelization (16M+ threads)."

cuda fractals gpu-programming high-performance-computing nvidia parallel-computing science visualization

Last synced: 03 Nov 2025

https://github.com/rbuj-uoc/m1.209

PAC 1, PAC 2, PAC 3 i PAC 4 de l'assignatura Computació d'altes prestacions del MUEI

cuda mpi openmp sge

Last synced: 21 May 2026

https://github.com/maxenceleguery/jare

3D Render engine accelerated with CUDA

3d cuda engine raytracing

Last synced: 21 May 2026

https://github.com/sbstndb/nbody_k

A simple 3D naïve NBody simulation using Kokkos enabling CUDA or OpenMP backend

cuda kokkos nbody openmp simulation

Last synced: 21 May 2026

https://github.com/naidezhujimo/cuda-learning-just-record-the-learning-process-

just record the learning process,There are notes,Welcome to learn.

cuda

Last synced: 26 Mar 2025

https://github.com/shermanlo77/poisson_icing

Gibbs sampling on the Poisson-Ising model. The Poisson-Ising model is a 2D image of Poisson distributed random variables but has a dependency on their four neighbours. This causes the Poisson random variables to be similar (or dissimilar) to their neighbours.

cuda cupy gibbs-sampling gpu ising-model mcmc monte-carlo poisson poisson-ising

Last synced: 21 May 2026

https://github.com/bjornmelin/ml-algorithm-playground

🧪 Core ML algorithm implementations with GPU acceleration. Featuring optimized implementations across various libraries with comprehensive analysis. 📈

algorithms cuda gpu-computing lightgbm machine-learning python scikit-learn xgboost

Last synced: 13 May 2026

https://github.com/dasbd72/nthu-ipc-2022

National Tsing Hua University - Introduction to Parallel Computing - 2022

cuda cuda-programming hpc mpi openmp pthreads

Last synced: 30 Mar 2025

https://github.com/daelsepara/hipnewton

GPU Implementation of Newton Fractal Generator with Benchmarking

amd cuda fractal gpu gpu-compute gpu-computing hip newton parallel-computing rocm sdk

Last synced: 03 May 2026

https://github.com/anne-andresen/autoencoder_3d_c_cuda

3D Autoencoder training in raw C/CUDA

3d autoencoder c cuda nifti

Last synced: 28 Apr 2026

https://github.com/fedesky25/hpc-project-2024

Project for the 2024 course of HPC: generator of streamplot of complex-valued functions

complex-numbers cuda openmp

Last synced: 30 Mar 2025

https://github.com/cs550-epfl/review

Review of the paper A Formal Analysis of the NVIDIA PTX Memory Consistency Model

cuda formal-verification gpu memory-consistency ptx simt

Last synced: 30 Mar 2025

https://github.com/td99/ai-sandbox

A collection of AI tools and prototypes.

ai cuda docker image-generation-ai nvidia python

Last synced: 08 Apr 2026

https://github.com/sevilze/folderesque

Python Script to process and upscale images in specified folders using RRDB models.

cuda esrgan scripts upscaler

Last synced: 02 Mar 2026

https://github.com/belrbez/ship-graphic-qt-qml-cuda-c

Client-Server application for Rocket driving in QML graphics

c client-server cpp cuda qml qt5 rocket

Last synced: 08 Apr 2026

https://github.com/kenwuqianghao/c4ai-cuda-birds

Homework assignments for C4AI Beginners in Research-Driven Studies

cuda machine-learning pytorch

Last synced: 18 Apr 2026

https://github.com/TheodoreAI/monte-carlo-simulator

CUDA application for Monte Carlo simulation is used to determine the range of outcomes for a series of parameters, each of which has a probability distribution showing how likely each option is to happen. This is using CUDA.

cuda gpu-computing monte-carlo-simulation parallel-computing

Last synced: 06 Oct 2025

https://github.com/juntyr/necsim-rust-docs

Documentation of the spatially explicit biodiversity simulation necsim-rust

biodiversity cuda docs mpi necsim rust simulation

Last synced: 14 May 2026

https://github.com/marius311/cudadistributedtools.jl

A set of utility tools for multi-GPU + multi-process workflows

cuda distributed julia

Last synced: 01 May 2026

https://github.com/uefi-code/bachelorgraduationdesign

I developed a PyTorch_For_PoorGuys framework and Let it train LLM on NVIDIA GeForce 2080Ti GPU as my Bachelor's Graduation Design Project

chatbot cuda gpu hacking large-language-models pytorch

Last synced: 03 May 2026

https://github.com/hshshshshsh12e/gpumkat

Gpumkat is a shader debugger for metal which is designed to do what instruments can't do

alternative api control cuda darwin debugger debugging gpumkat macos management profiler release shaders threads

Last synced: 14 Apr 2026

https://github.com/sergeipapina/color2graycuda

color to gray image conversion nvidia CUDA kernel implementation using make or cmake to compile and link

cmake cuda cuda-kernels cuda-programming link makefile nvidia

Last synced: 06 Apr 2025

https://github.com/kanttouchthis/cuda_schem

script for voxelization of 3d models to minecraft .schem schematics with texture support powered by numba cuda.

cuda minecraft numba voxelization

Last synced: 07 Oct 2025

https://github.com/lanceberge/cuda-newton-fractals

Parallelize and visualize the Newton Iteration

cpp cuda mathematical-modelling visualization

Last synced: 16 May 2026

https://github.com/AndreasKaratzas/orin

Setting up the NVIDIA Jetson Orin Nano Developer Kit

cuda cudnn jetpack6 nvidia-jetson nvidia-sdkmanager orin-nano

Last synced: 25 Feb 2025

https://github.com/adesoji1/youtubesummaryai

Python script for YouTube summary. The service should summarize an YouTube video by url. It should works for long video and for different languages.

cuda googleapi python3 speech-recognition transformers youtube-api-v3 youtube-dl

Last synced: 04 Apr 2025

https://github.com/dreoporto/tensorflow-gpu-docker

An example project to run TensorFlow with CUDA-enabled GPU acceleration using Windows, Docker and WSL2.

artificial-intelligence cuda deep-learning docker docker-compose jupyter machine-learning nvidia-docker python windows wsl2

Last synced: 27 Jan 2026

https://github.com/alkaifaftab000/autonomous-maze-solver

Building an Autonomous Maze Solver using reinforcement learning to train agents for decision-making in dynamic grid-based environments

agent criticism cuda gymnasium-environment maze-solving-bot pytorch reinforcement-learning reward-functions

Last synced: 12 Apr 2026

https://github.com/f14-bertolotti/torchess

cuda torch extension for a chess engine

chess cuda torch

Last synced: 01 May 2026

https://github.com/larygwil/cuda-samples-old

nvidia cuda samples old (5.0 - 7.5)

cuda nvidia

Last synced: 03 May 2026

https://github.com/artheioupfat/mini-gpt-wiki

Projet visant à créer un mini LLM entraîné sur des données Wikipédia et à interagir avec lui via une interface Streamlit.

cuda gpt language llm model mps nlp pytorch scraping streamlit transformer wikipedia

Last synced: 08 Oct 2025

https://github.com/thanduriel/cuda_hip_comparison

performance study of atomics on GPUs

atomics cuda hip

Last synced: 09 Oct 2025

https://github.com/yash-1335/qwen600

🚀 Build a fast inference engine for the QWEN3-0.6B model using CUDA, optimizing performance with minimal dependencies for efficient learning and practice.

cuda cuda-programming gpu llamacpp llm llm-inference qwen qwen3 transformer

Last synced: 16 May 2026

https://github.com/enesdoruk/opencv-cpp

Opencv CPP tutorials

computer-vision cpp cuda opencv

Last synced: 09 Oct 2025

https://github.com/tylerfaulkner/n-body_simulation

CUDA N-Body Gravitational Simulation with rendering in Python with MatPlotLib

cuda simulation

Last synced: 20 May 2026

https://github.com/bd2720/accesspatterns

Comparing chunked vs. striped memory access patterns for CPU and GPU code using the CUDA toolkit in C.

c cache cuda cuda-toolkit performance-analysis performance-testing profiling

Last synced: 16 May 2026

https://github.com/ojaswithag/opencv-doc

OpenCV ile görüntü ve video işleme, makine öğrenmesi ve proje uygulamaları için Türkçe kapsamlı bir rehber. 🐙 Adım adım kod örnekleriyle öğrenin ve projeler geliştirin.

arm-architecture cuda cuda-support deployment django docker-image docker-images heroku image-processing javascript nodejs nvidia opencv-contrib opencv3 production python scanner tutorial

Last synced: 08 Apr 2026

https://github.com/mradovic38/pycuda-simulated-annealing

Simulated annealing process for finding the 'minimum energy' of an image.

cuda image-energy parallel-computing parallel-programming pycuda python simulated-annealing

Last synced: 09 Oct 2025

https://github.com/naetherm/derelictcurand

Dynamic bindings to the CuRAND library for the D Programming Language.

cuda curand d derelict dlang

Last synced: 27 Mar 2025

https://github.com/skyguy126/cuda-learnings

Collection of personal CUDA learnings.

cuda

Last synced: 10 Oct 2025

https://github.com/bhavinpatel4199/image-processing-with-opencv-and-cuda-on-google-colab

This repository demonstrates image processing using OpenCV with CUDA for GPU acceleration on Google Colab. It includes basics like displaying and manipulating images, alongside advanced techniques using CUDA to enhance performance. Ideal for learning GPU-accelerated image processing in Python.

computer-vision cuda google-colab gpu-acceleration high-performance-computing image-processing opencv pixel-manupulation

Last synced: 19 Jan 2026

https://github.com/yangfengzzz/tardis

Travel space and time by using autodiff and codegen

autodiff codegen cuda

Last synced: 03 May 2026

https://github.com/zcemycl/distributecompute

Parallel Computing and Distributed Computing with C++ threads, Python threads+asyncio+multiprocessing and Spark, and Cuda.

asyncio boost cpp cuda global-interpreter-lock jthread multiprocessing python spark thread

Last synced: 14 Apr 2026