Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
CUDA
![](https://explore-feed.github.com/topics/cuda/cuda.png)
CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.
- GitHub: https://github.com/topics/cuda
- Wikipedia: https://en.wikipedia.org/wiki/CUDA
- Created by: Nvidia
- Released: June 23, 2007
- Related Topics: nvcc,
- Last updated: 2025-02-13 00:07:16 UTC
- JSON Representation
https://github.com/moshidev/acap
Prácticas de la asignatura Arquitectura y Computación de Altas Prestaciones
cuda homework-assignments mpi pthreads
Last synced: 05 Feb 2025
https://github.com/patriciobcs/mini-aevol
Parallel implementation of a reduced version of the Aevol simulator
Last synced: 20 Jan 2025
https://github.com/phantom7knight/cuda-fusion
This project is for learning CUDA to understand the GPU work better.
cuda cuda-programming gpgpu gpu
Last synced: 08 Feb 2025
https://github.com/senli1073/docker-gpu-monitor
A lightweight GPU monitor designed for real-time web-based viewing of GPU server status.
container cuda docker flask gpu gpu-monitoring linux memory-usage nvidia-smi web
Last synced: 06 Feb 2025
https://github.com/alan-cooney/python-cuda-starter-template
Python CUDA Starter Template
Last synced: 06 Feb 2025
https://github.com/fikri-rouzan/cuda-c-program-part-2
CUDA C program from NVIDIA course.
Last synced: 05 Feb 2025
https://github.com/shineiarakawa/particle-stabilizer
A C++ and CUDA-based program for simulating the motion of particles.
Last synced: 13 Jan 2025
https://github.com/fikri-rouzan/cuda-c-program-part-1
CUDA C program from NVIDIA course.
Last synced: 05 Feb 2025
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 22 Dec 2024
https://github.com/sebp/vscode-sycl-dpcpp-cuda
Sample project to use the VS Code Remote - Containers extension to develop SYCL applications for NVIDIA GPUs using the oneAPI DPC++ compiler.
cuda dpcpp fedora gpu-computing podman sycl vscode
Last synced: 08 Feb 2025
https://github.com/malolm/football-player-detection-with-yolov8
Football player detection YOLOv8 fine-tuning
cuda jupyterlab python3 yolov8-detection
Last synced: 06 Feb 2025
https://github.com/daviddavo/19gpu
Short exercises for GPU at Complutense University of Madrid. Mirror from GitLab
accelerator cuda gpu-programming
Last synced: 23 Jan 2025
https://github.com/blazekill/hello-cuda
Cpp + Vcpkg + CUDA + VsCode starter project.
Last synced: 01 Jan 2025
https://github.com/fikri-rouzan/cuda-c-program-part-3
CUDA C program from NVIDIA course.
Last synced: 05 Feb 2025
https://github.com/thalesmg/haskell-accelerate-parconc
Example and benchmark of Accelerate-HS from Parallel and Concurrent Programming in Haskell
accelerate cuda gpu-computing haskell parallel-computing
Last synced: 08 Feb 2025
https://github.com/muhamadajiw/parallel-matrix-inversion
A parallel program for matrix inversion using MPI, OpenMP, and CUDA
Last synced: 17 Jan 2025
https://github.com/viktor-akusoff/chernabogpy
ChernabogPy is a Python package for visualizing gravitational distortions caused by black holes using nonlinear ray tracing.
cuda gpu physics-simulation python3 relativity-of-space-and-time torch
Last synced: 12 Jan 2025
https://github.com/jonyandunh/avatargeneratorgan
It's a simple Generative Adversarial Network about generating avatars.
avatar-generator cuda gan pytorch
Last synced: 14 Jan 2025
https://github.com/parlaynu/inference-tvm
Export ONNX to ApacheTVM and run inference in containerized environments.
apache-tvm cuda docker jetson-nano onnx raspberrypi4 x86-64
Last synced: 28 Jan 2025
https://github.com/rugleb/cuda
A simple example of a program that uses parallel GPU computing on an NVIDIA graphics card using CUDA technology
Last synced: 23 Dec 2024
https://github.com/mathiasotnes/gemm
General Matrix Multiplication (GEMM) optimization in Cuda.
Last synced: 31 Jan 2025
https://github.com/saadarazzaq/cuda-device-info
Check if Cuda is correctly configured in your windows 🖥️
Last synced: 23 Jan 2025
https://github.com/i-m-iron-man/abmax
Abmax is an agent-based modelling framework in Jax, focused on dynamic population size
abm agent agent-based agent-based-modeling agent-based-simulation agents cuda jax python
Last synced: 13 Jan 2025
https://github.com/dpetrosy/fractal
This project is a Fractal Visualizer developed in C++ with SFML and CUDA.
burning-ship cmake cmakelists cpp cpp-programming cpp-project cuda cuda-opengl cuda-programming fractal fractal-generation fractal-visualization julia mandelbox mandelbrot opengl opengl-project sfml sfml-library tricorn
Last synced: 21 Jan 2025
https://github.com/dongskie43/nlp-engineering-hub
📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤
cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers
Last synced: 03 Feb 2025
https://github.com/denyskryvytskyi/capgemini-cuda
CUDA implementation of vector additon, matrix multiplication, reduction and sorting
bitonic-sort cpp cuda cuda-kernels gpgpu matrix matrix-multiplication matrix-multiplication-parallel matrix-transpose nvidia nvidia-cuda nvidia-gpu reduction-dimension sort sorting-algorithms-implemented vector vector-addition vectorization
Last synced: 10 Feb 2025
https://github.com/tianzonglin/cloud-control-gui
A tool to compute, visualize, analyse and drag points (high-dimensional data)
cuda interaction-design visualization
Last synced: 30 Dec 2024
https://github.com/mmz33/practice-cuda
c cpp cuda cuda-programming gpu-programming parallel-programming
Last synced: 22 Jan 2025
https://github.com/jamesnulliu/learning-programming-massively-parallel-processors
Leaning notes of Programming Massively Parallel Processors, 4-th edition.
Last synced: 02 Feb 2025
https://github.com/branebb/nn-framework
Framework for creating neural networks using C++ and CUDA platform. This project is part of my final university assignment for bachelor's degree.
cmake cpp cuda cuda-programming
Last synced: 19 Nov 2024
https://github.com/daelsepara/hipnewton
GPU Implementation of Newton Fractal Generator with Benchmarking
amd cuda fractal gpu gpu-compute gpu-computing hip newton parallel-computing rocm sdk
Last synced: 05 Feb 2025
https://github.com/dreamjet31/licence_plate_detection
Automated License Plate recognition system
cuda opencv python pytorch ultralytics yolov8
Last synced: 10 Feb 2025
https://github.com/adesoji1/youtubesummaryai
Python script for YouTube summary. The service should summarize an YouTube video by url. It should works for long video and for different languages.
cuda googleapi python3 speech-recognition transformers youtube-api-v3 youtube-dl
Last synced: 10 Feb 2025
https://github.com/bd2720/accesspatterns
Comparing chunked vs. striped memory access patterns for CPU and GPU code using the CUDA toolkit in C.
c cache cuda cuda-toolkit performance-analysis performance-testing profiling
Last synced: 31 Jan 2025
https://github.com/marcorentap/kokkos-docker-cluster
Deploy Docker containers with Kokkos, OpenMP, OpenMPI and CUDA as a Docker swarm.
Last synced: 23 Oct 2024
https://github.com/mcobzarenco/bitonic.cu
CUDA bitonic sort in rust
cuda parallel-computing rust sorting-algorithms
Last synced: 10 Feb 2025
https://github.com/jonastoth/cuda_raytracer
University project to implement a basic Raytracer in CUDA
Last synced: 02 Feb 2025
https://github.com/mxm-tr/docker-darknet-opencv
Accelerated objects detection on streams and files, using a Docker darknet YOLO container
cuda docker docker-compose object-recognition opencv-python python3 yolo
Last synced: 17 Jan 2025
https://github.com/sebftw/interp2gpu
GPU-accelerated 2D spline interpolation, à la interp2(..., "spline"), in MATLAB.
cuda gpu gpu-acceleration matlab spline spline-interpolation
Last synced: 14 Dec 2024
https://github.com/edumucelli/build-tensorflow
Build Tensorflow from source using a Dockerfile
Last synced: 24 Dec 2024
https://github.com/amruthapatil/nyu-cudamatrixoperations
Optimizing CUDA programs for vector addition and matrix multiplication
cuda high-performance-computing
Last synced: 20 Jan 2025
https://github.com/raiszo/cs334
Journey thorugh Intro to Parallel Programming
Last synced: 25 Jan 2025
https://github.com/dragonscypher/prompty
Tool for generating smart and secure prompts for language models!
autotokenizer bert-model cuda google-t5 llm python3 tensorflow threading
Last synced: 22 Jan 2025
https://github.com/shineiarakawa/cuda-cmake-minimal-template
A minimal CUDA C++ project template with CMake
cmake cuda dear-imgui opengl project-template stb-image
Last synced: 21 Jan 2025
https://github.com/hdelan/msc-hpc-final-project
In this project I implement a CUDA Lanczos method to approximate the matrix exponential. The matrix exponential is an important centrality measure for large, sparse graphs.
cuda graph-algorithms krylov-methods
Last synced: 24 Dec 2024
https://github.com/bdwhst/fluora
A CUDA PBR path tracer
cpp cuda pathtracing pbr rendering
Last synced: 12 Feb 2025
https://github.com/pankajarm/ethereum-mining-cuda
cuda ethereum ethereum-mining ethminer ubuntu1604
Last synced: 13 Jan 2025
https://github.com/notkartikye/cuda-image-box-filters
🖼️ CUDA-powered tool for applying box filters to a large amount of images
cuda cuda-library cuda-programming npp
Last synced: 25 Dec 2024
https://github.com/fmigneault/dockers
Collection of docker setup with common libraries for image processing and machine learning.
boost cuda docker image-processing opencv python
Last synced: 25 Dec 2024
https://github.com/dhruvsrikanth/monte-carlo-ray-tracing
In this repository, you will find a serial and distributed GPU-based implementation of the ray tracing simulation.
c cpp cuda gpu-computing gpu-programming high-performance-computing parallel-programming raytracing unified-memory-parallelism
Last synced: 25 Dec 2024
https://github.com/cs550-epfl/report
EPFL CS-550 project report
cuda formal-verification gpu memory-consistency ptx simt
Last synced: 10 Jan 2025
https://github.com/matteopolak/stock-predict
Stock prediction with LSTM using TensorFlow and TypeScript.
ai artificial-intelligence cuda lstm machine-learning stock tensorflow typescript
Last synced: 25 Dec 2024
https://github.com/llm-db/understanding-gpu-architecture-implications-on-llm-serving-workloads
Understanding GPU Architecture Implications on LLM Serving Workloads (Master Thesis, ETH Zürich, 2024)
cuda inference pytorch rocm transformer
Last synced: 14 Dec 2024
https://github.com/prateekshukla1108/thunderkittens-docs
Documentation for ThunderKittens framework
Last synced: 24 Jan 2025
https://github.com/neel-dandiwala/cuda-programs
Miscellaneous programs that grasp the concept of Parallel Computing
cuda gpu-programming parallel-programming
Last synced: 26 Dec 2024
https://github.com/jpodivin/gputomata
Cellular automata running on CUDA capable GPUs
cellular-automata cellular-automaton cuda
Last synced: 27 Dec 2024
https://github.com/0xhilsa/vector-cuda
vector calculation with GPU acceleration using CUDA
c cpp11 cuda cuda-kernels cuda-programming nvcc
Last synced: 08 Feb 2025
https://github.com/dbklim/optimized_tensorflow_wheels
Optimized versions TensorFlow and TensorFlow-GPU for specific CPUs and GPUs (for both old and new).
cuda nvidia-cuda nvidia-gpu tensorflow tensorflow-community-wheels tensorflow-gpu tensorflow-packages tensorflow-whells wheels
Last synced: 10 Jan 2025
https://github.com/tommaso-dognini/polimi_gpu101_courseproject
Polimi Passion In Action GPU101 course project.
cpp cuda cuda-programming parallel-computing
Last synced: 26 Dec 2024
https://github.com/andreasholt/cuda-matmul-benchmarking
Implementing and benchmarking various matmul implementations in CUDA
Last synced: 26 Dec 2024
https://github.com/katpercent/raytracing
A foundation for ray tracing using CUDA and parallel computing techniques.
3d cuda engine game parrallel-computing ray raytracing
Last synced: 26 Dec 2024
https://github.com/sydney-informatics-hub/computer-vision-fine-tuning
Fine tune a computer vision to solve your task locally, on HPC, in a container, or in the cloud!
computer-vision cuda deep-learning python
Last synced: 22 Jan 2025
https://github.com/kis-balazs/cuda-research
CUDA Research & Code. Course-style structured. Inspiration from @Infatoshi.
Last synced: 26 Dec 2024
https://github.com/miferreiro/cdap-cuda
CUDA exercises for the subject of "Computación Distribuída e de Altas Prestacións" in the Master Degree of Computer Engineering of the University of Vigo in 2020
Last synced: 27 Dec 2024
https://github.com/gama1903/cuda_programming
Practice of cuda programming according to <<programming massively parallel processors 4th>>, also refer to CUDA MODE series.
Last synced: 26 Dec 2024
https://github.com/ojeda-e/fokker-planck
Numerical solution of the Fokker-Planck equation in large times using CUDA/C.
Last synced: 26 Dec 2024
https://github.com/xueeinstein/udacity-cs344-cuda8
Code for Udacity CS344 (Intro to Parallel Programming) using CUDA 8.0
cuda cuda-8 parallel-computing
Last synced: 26 Dec 2024
https://github.com/brainlesslabs/jalebi
C++ String algorithms for maximum performance
c-plus-plus cplusplus cpp cpp-library cpu cuda library parallel performance simd sse string string-matching vectorization
Last synced: 26 Jan 2025
https://github.com/jonyandunh/stanforddogsresnet
A classifier for 120 dogs classified at Stanford Dogs Dataset, using the Pytorch framework and using custom Resnet for neural network learning
cuda deep-learning python pytorch resnet resnet-18 standford-dog stanford
Last synced: 14 Jan 2025
https://github.com/rssr25/cuda
Following Cuda By Example book.
cpp cuda cuda-programming hpc shaders
Last synced: 24 Dec 2024
https://github.com/baonguyen6742/uv-install-torch
Tutorial to install torch/pytorch with cuda using uv
cuda install installation package python pytorch resolver torch torchaudio torchvision tutorial uv
Last synced: 12 Feb 2025
https://github.com/grindelfp/cuda-n-body-simulation
Simulation of N-Body movement using CUDA.
Last synced: 12 Feb 2025
https://github.com/lttofu/cosmic
Fast, lightweight GUI-based C++ Ethereum ERC918 token miner for Win64 | CUDA GPUs | CPUs | Pool | Solo Mining
0xbitcoin 0xbtc cplusplus cplusplus-cli cpuminer cuda erc20 erc918 ethereum ethereum-token gpuminer gui pool-mining solo-mining windows windows-10 windows-7 windows-gui winforms
Last synced: 02 Jan 2025
https://github.com/marnovo/cuda-projects
cuda cuda-kernels gpu gpu-programming nvidia-cuda parallel-computing
Last synced: 26 Dec 2024
https://github.com/satyajitghana/gpu-programming
Contains the contents of GPU Architecture and Programming course done on NPTEL
c cpp cuda cuda-programming gpu-programming nptel nvidia
Last synced: 26 Dec 2024
https://github.com/brocbyte/cuball
CUDA-based implementation of "Real-Time Rigid Body Simulation on GPUs" [from GPU Gems 3]
Last synced: 05 Jan 2025
https://github.com/ran-2012/cuda-practice
cuda practice code for nvidia programming guide
Last synced: 10 Jan 2025
https://github.com/sustia-llc/gpu_logger_poc
GPU execution verification system with immutable Kafka logging. Monitors CUDA operations, validates GPU performance, and maintains auditable operation history. Built with Rust and Candle for reliable ML model execution tracking.
candle-core cuda docker gpu gpu-computing kafka logging machine-learning mlops monitoring nvidia performance-testing rust
Last synced: 12 Feb 2025
https://github.com/neel-dandiwala/npp_cudaatscale_project
For the enterprise course project, I have created a model that executes the histogram equalisation procedure on the given input image file.
Last synced: 26 Dec 2024
https://github.com/boostibot/bachelors
My bachelors thesis at CTU in Prague, Faculty of Nuclear Sciences and Physical Engineering supervised by Ing. Pavel Strachota, Ph.D
crystal-growth cuda finite-volume-method parallel-programming phase-field-method
Last synced: 18 Jan 2025
https://github.com/flavienbwk/nvidia-cuda-mirror-docker
An all-in-one mirror for installing NVIDIA Docker.
cuda docker linux-mirror mirror nvidia nvidia-docker nvidia-docker2 offline offline-capable
Last synced: 28 Jan 2025
https://github.com/maxenceleguery/3d-render-engine
3D Render engine accelerated with CUDA
Last synced: 27 Dec 2024
https://github.com/flavienbwk/tensorflow2-cuda-10.2-docker
Tensorflow 2.3, CUDA 10.2, Docker compatible image
cuda docker python3 tensorflow ubuntu1804
Last synced: 28 Jan 2025
https://github.com/drilonaliu/parallel-s_aes-ccm-xts
aes cryptography cuda gpu parallel-programming saes
Last synced: 26 Jan 2025
https://github.com/roryclear/cuda-ml
simple cuda optimized mnist classifier
colab-notebook cuda mnist-classification pycuda
Last synced: 21 Jan 2025
https://github.com/snandasena/courseera_gpu_specilization
Example for Cuda streaming
Last synced: 14 Jan 2025
https://github.com/programmergnome/cuda-codes
Snippet repository for learning parallel GPU programming with CUDA.
c cpp-programming cuda cuda-kernel gpu-programming learning-materials parallel-programming parallelization
Last synced: 22 Jan 2025
https://github.com/nyxflower/mosaics-cuda-openmp
Simple image mosaic command line too (CUDA-OpenMP-C Implementation)
c cuda gpu-programming mosaic mosaic-images openmp parallel-computing parallel-processing
Last synced: 03 Jan 2025