Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
CUDA
![](https://explore-feed.github.com/topics/cuda/cuda.png)
CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.
- GitHub: https://github.com/topics/cuda
- Wikipedia: https://en.wikipedia.org/wiki/CUDA
- Created by: Nvidia
- Released: June 23, 2007
- Related Topics: nvcc,
- Last updated: 2025-02-13 00:07:16 UTC
- JSON Representation
https://github.com/sbstndb/nbody_k
A simple 3D naïve NBody simulation using Kokkos enabling CUDA or OpenMP backend
cuda kokkos nbody openmp simulation
Last synced: 23 Jan 2025
https://github.com/snandasena/cuda-at-scale-for-the-enterprise
Gauss Filter with CUDA and NPP
Last synced: 01 Feb 2025
https://github.com/ncorgan/arrayfire-config-info
A small command-line utility that outputs all available ArrayFire devices
Last synced: 13 Feb 2025
https://github.com/nick8592/ubuntu-20.04-cuda-cudnn-pytorch
cuda cuda-toolkit cudnn python3 pytorch ubuntu2004
Last synced: 30 Jan 2025
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 22 Dec 2024
https://github.com/ronaldsg20/compu-paralela
Códigos de ejemplo para computación paralela y distribuida
cuda opencv openmp posix-threads
Last synced: 05 Jan 2025
https://github.com/drilonaliu/parallel-permutation-cipher
cryptography cuda gpu parallel-programming permutation
Last synced: 26 Jan 2025
https://github.com/drilonaliu/parallel-image-edge-detection
cuda edge-detection gpu image-processing
Last synced: 26 Jan 2025
https://github.com/jackrekirby/raytracing-cuda
Raytracing using CUDA
cpp cuda raytracing raytracing-in-one-weekend
Last synced: 08 Feb 2025
https://github.com/rssr25/cuda
Following Cuda By Example book.
cpp cuda cuda-programming hpc shaders
Last synced: 24 Dec 2024
https://github.com/muhac/docker-jupyter-pytorch
JupyterLab for AI in Docker! Anaconda and PyTorch GPU supported.
conda-environment cuda docker jupyterlab pytorch
Last synced: 15 Dec 2024
https://github.com/andreeo/parallel-computing-cuda
Programs in terminal applying the parallel programming model with the CUDA arquitecture
c cpp cuda docker lineal-search parallel-computing parallel-reduction rank-sort-algorithm
Last synced: 02 Feb 2025
https://github.com/enapiuz/logic-circuit-simulator
Logic circuit (based on NAND gates) simulator using OpenCL
c circuit-simulator cuda digital-logic gpgpu logic-gates opencl simulator
Last synced: 06 Feb 2025
https://github.com/chrisdalvit/gpu-matrix-transpose
Implementation and benchmarking of different matrix transpose with CUDA
c cpp cuda cuda-kernels cuda-programming gpu-acceleration gpu-computing gpu-programming matrix-transpose nvidia-gpu
Last synced: 13 Feb 2025
https://github.com/rog0d/gpuss_watchers
"The GPU Watchers swore upon their shared memory hierarchy, from L1 to global memory, which also served as their mandate as lords of parallel computation."
cuda gpu-acceleration gpu-monitoring gpu-profiling
Last synced: 13 Feb 2025
https://github.com/m-torhan/advent-of-code
🎄 Solutions for the Advent of Code
advent-of-code advent-of-code-2024 cuda
Last synced: 13 Feb 2025
https://github.com/larygwil/cuda-samples-old
nvidia cuda samples old (5.0 - 7.5)
Last synced: 02 Feb 2025
https://github.com/atelierarith/julia_gpu_playground
For those who want use Julia with GPU
cuda docker docker-compose julia
Last synced: 06 Feb 2025
https://github.com/sephiroth7712/k-nearest-neigbours
Implementation of K-Nearest Neighbors algorithm using multiple parallel computing approaches: CUDA (GPU), Hadoop, Spark, MPI, OpenMP, and PThreads. Demonstrates scalable machine learning across different parallel computing paradigms from GPU to distributed frameworks.
cuda cuda-programming hadoop-mapreduce java mpi multiprocessing multithreading openmp pthreads scala spark
Last synced: 06 Feb 2025
https://github.com/uwuwuwu363/tts-local
🎤 Natural TTS App: A Python-based text-to-speech GUI with multi-language support, playback controls, and audio export. Built with Tkinter, gTTS, and Pygame. 🚀
chatbot cuda deep-learning multilingual ollama pinokio raspberry-pi speech-recognition speech-to-text text-to-speech tts voice voices wav
Last synced: 10 Feb 2025
https://github.com/aeyage/intraday_prices
GPU-accelerated portfolio optimisation
Last synced: 10 Feb 2025
https://github.com/belrbez/ship-graphic-qt-qml-cuda-c
Client-Server application for Rocket driving in QML graphics
c client-server cpp cuda qml qt5 rocket
Last synced: 06 Feb 2025
https://github.com/daelsepara/hipnewton
GPU Implementation of Newton Fractal Generator with Benchmarking
amd cuda fractal gpu gpu-compute gpu-computing hip newton parallel-computing rocm sdk
Last synced: 05 Feb 2025
https://github.com/marcellodesales/nvidea-cuda-ubuntu-docker
NVidea CUDA base image on Ubuntu Linux, used to run Machine Learning
ai cuda docker docker-compose machine-learning ml nvidia-docker ubuntu
Last synced: 03 Feb 2025
https://github.com/juntyr/necsim-rust-docs
Documentation of the spatially explicit biodiversity simulation necsim-rust
biodiversity cuda docs mpi necsim rust simulation
Last synced: 03 Feb 2025
https://github.com/ivanbgd/cuda_quad_c
Calculates a definite integral by using three different rules. Compares sequential to parallel implementations.
cuda integrals parallel-implementations
Last synced: 03 Feb 2025
https://github.com/chibby0ne/cuda_by_example
Old notes (and new ones) of the Cuda by Example book
cuda cuda-programming gpgpu gpu-computing gpu-programming
Last synced: 31 Dec 2024
https://github.com/sir-photch/wicuvanity
Generate wireguard vanity keys on your Nvidia GPU
cuda gpu vanity-address vanity-addresses vanitygen wireguard
Last synced: 23 Oct 2024
https://github.com/zelosleone/audiobook-generator
A GPU-accelerated Python application that converts PDF and TXT documents into high-quality MP4 audio files using WhisperSpeech technology.
ai-audio audiobook cuda gpu-acceleration machine-learning pdf-converter python pytorch speech-synthesis text-processing text-to-speech
Last synced: 03 Feb 2025
https://github.com/awikramanayake/optimized-matrix-mult
Optimizing matrix multiplication using parallelism and SIMD (AVX2, CUDA)
avx2 cuda matrix-multiplication
Last synced: 21 Jan 2025
https://github.com/sedflix/cuda_pattern_matching
Getting words frequency using the concepts of pattern matching in CUDA
Last synced: 31 Dec 2024
https://github.com/dhruvsrikanth/fastconv
Distributed and serial implementations of the 2D Convolution operation in c++ and CUDA.
convolution-filters cpp cuda gpu-programming high-performance-computing hpc image-editor image-processing nvidia parallel-programming
Last synced: 25 Dec 2024
https://github.com/amitkumarj441/deep-learning-on-your-finger
A rich collection of dockerfiles for installing deep learning dependecies on your way :rocket:
Last synced: 26 Jan 2025
https://github.com/k-hengzhou/hphoto
一个基于AI的智能照片管理工具,支持人脸识别、相似人脸自动聚类和nsfw检测
cuda insightface nsfw nsfw-detection nudenet photos
Last synced: 09 Jan 2025
https://github.com/mattjesc/federated-learning-simulation-1gpu-mi-is
Federated Learning Simulation on a Single GPU with Model Interpretability and Interactive Visualization
ai cuda deep-learning distributed-systems federated-learning gpu hpc keras machine-learning ml model-interpretability python pytorch simulation streamlit tensorflow
Last synced: 12 Oct 2024
https://github.com/ionmich/cs149-local-dev
Provides `conda` installation instructions for Stanford's CS149 (Parallel Computing) programming assignments
conda cs149 cuda ispc parallel-computing
Last synced: 06 Feb 2025
https://github.com/hrolive/fundamentals-of-accelerated-computing-with-cuda-python
Explore how to use Numba—the just-in-time, type-specializing Python function compiler—to create and launch CUDA kernels to accelerate Python programs on massively parallel NVIDIA GPUs.
accelerated-computing cuda cuda-programming jit numba nvidia python
Last synced: 06 Feb 2025
https://github.com/airvzxf/c-plus-plus-understanding-cuda
Understanding CUDA with C++
cuda hacktoberfest hacktoberfest-accepted
Last synced: 27 Jan 2025
https://github.com/vwkyc/detectron2-api
Detectron2 server API
api cpu-inference-api cuda detectron2 flask gunicorn self-hosted
Last synced: 05 Feb 2025
https://github.com/nvaranki/cmmx
CUDA matrix multiplication (official guide, modified)
Last synced: 10 Dec 2024
https://github.com/demetriantitus/machine-vision---yolov8
This project provides a comprehensive guide to object detection in cluttered environments using YOLOv8. It demonstrates how to identify and classify objects in both still images and video streams
computer-vision cuda dataset image-classification machine-learning nvidia-gpu object-detection surveillance traffic-monitoring video-analysis yolov8
Last synced: 05 Feb 2025
https://github.com/rkarahul/person-detector-faceverifier
Person-Detector-FaceVerifier is a sophisticated system for detecting and verifying faces in images. Ideal for applications like passport control and security, it combines advanced face detection with precise verification techniques.
bootstrap5 css3 cuda django html5 javascipt opencv-python os python pytorch yolov8
Last synced: 05 Feb 2025
https://github.com/dasbd72/nthu-ipc-2022
National Tsing Hua University - Introduction to Parallel Computing - 2022
cuda cuda-programming hpc mpi openmp pthreads
Last synced: 05 Feb 2025
https://github.com/thanduriel/cuda_hip_comparison
performance study of atomics on GPUs
Last synced: 05 Feb 2025
https://github.com/apostolis1/parallel-processing-systems
Project of the undergrad course "Parallel Processing Systems" - NTUA
benchmark c cuda mpi openmp parallel-computing
Last synced: 05 Feb 2025
https://github.com/anne-andresen/autoencoder_3d_c_cuda
3D Autoencoder training in raw C/CUDA
Last synced: 05 Feb 2025
https://github.com/moshidev/acap
Prácticas de la asignatura Arquitectura y Computación de Altas Prestaciones
cuda homework-assignments mpi pthreads
Last synced: 05 Feb 2025
https://github.com/senli1073/docker-gpu-monitor
A lightweight GPU monitor designed for real-time web-based viewing of GPU server status.
container cuda docker flask gpu gpu-monitoring linux memory-usage nvidia-smi web
Last synced: 06 Feb 2025
https://github.com/alan-cooney/python-cuda-starter-template
Python CUDA Starter Template
Last synced: 06 Feb 2025
https://github.com/bonevbs/cuknn
Cuda implementation of k-nearest neighbor search
Last synced: 20 Jan 2025
https://github.com/malolm/football-player-detection-with-yolov8
Football player detection YOLOv8 fine-tuning
cuda jupyterlab python3 yolov8-detection
Last synced: 06 Feb 2025
https://github.com/daviddavo/19gpu
Short exercises for GPU at Complutense University of Madrid. Mirror from GitLab
accelerator cuda gpu-programming
Last synced: 23 Jan 2025
https://github.com/blazekill/hello-cuda
Cpp + Vcpkg + CUDA + VsCode starter project.
Last synced: 01 Jan 2025
https://github.com/juntyr/necsim-rust-analysis
Analysis of the spatially explicit biodiversity simulation `necsim-rust`
analysis biodiversity cuda mpi necsim rust simulation
Last synced: 25 Jan 2025
https://github.com/muhamadajiw/parallel-matrix-inversion
A parallel program for matrix inversion using MPI, OpenMP, and CUDA
Last synced: 17 Jan 2025
https://github.com/rugleb/cuda
A simple example of a program that uses parallel GPU computing on an NVIDIA graphics card using CUDA technology
Last synced: 23 Dec 2024
https://github.com/saadarazzaq/cuda-device-info
Check if Cuda is correctly configured in your windows 🖥️
Last synced: 23 Jan 2025
https://github.com/tomaszrewak/csgpathtracer
A constructive solid geometry path tracer.
computer-graphics cuda path-tracing rendering
Last synced: 05 Jan 2025
https://github.com/denyskryvytskyi/capgemini-cuda
CUDA implementation of vector additon, matrix multiplication, reduction and sorting
bitonic-sort cpp cuda cuda-kernels gpgpu matrix matrix-multiplication matrix-multiplication-parallel matrix-transpose nvidia nvidia-cuda nvidia-gpu reduction-dimension sort sorting-algorithms-implemented vector vector-addition vectorization
Last synced: 10 Feb 2025
https://github.com/cerit-sc/scipion-docker
Scipion (Cryo em image processing framework (https://scipion.i2pc.es/)) adapted to run in Kubernetes.
cryo-em cryoem cuda desktop kubernetes scipion vnc
Last synced: 06 Dec 2024
https://github.com/macaycz/nn
A lightweight, GPU-accelerated machine learning library built with CUDA.
cuda deep-learning gpu machine-learning neural-network
Last synced: 20 Dec 2024
https://github.com/mxm-tr/docker-darknet-opencv
Accelerated objects detection on streams and files, using a Docker darknet YOLO container
cuda docker docker-compose object-recognition opencv-python python3 yolo
Last synced: 17 Jan 2025
https://github.com/cuda8/brainwords2
GPU brainflayer for sale $250
brain brainflayer brainwords cuda gpu key pass passphrase private
Last synced: 23 Oct 2024
https://github.com/edumucelli/build-tensorflow
Build Tensorflow from source using a Dockerfile
Last synced: 24 Dec 2024
https://github.com/jeremywildsmith/shadowhash
Elixir distributed Shadow File password cracker with GPU accelerated cracking for md5crypt hashing algorithm.
cracking-hashes cuda distributed-systems elixir hashing nx security
Last synced: 20 Dec 2024
https://github.com/hdelan/msc-hpc-final-project
In this project I implement a CUDA Lanczos method to approximate the matrix exponential. The matrix exponential is an important centrality measure for large, sparse graphs.
cuda graph-algorithms krylov-methods
Last synced: 24 Dec 2024
https://github.com/bdwhst/fluora
A CUDA PBR path tracer
cpp cuda pathtracing pbr rendering
Last synced: 12 Feb 2025
https://github.com/notkartikye/cuda-image-box-filters
🖼️ CUDA-powered tool for applying box filters to a large amount of images
cuda cuda-library cuda-programming npp
Last synced: 25 Dec 2024
https://github.com/fmigneault/dockers
Collection of docker setup with common libraries for image processing and machine learning.
boost cuda docker image-processing opencv python
Last synced: 25 Dec 2024
https://github.com/dhruvsrikanth/monte-carlo-ray-tracing
In this repository, you will find a serial and distributed GPU-based implementation of the ray tracing simulation.
c cpp cuda gpu-computing gpu-programming high-performance-computing parallel-programming raytracing unified-memory-parallelism
Last synced: 25 Dec 2024
https://github.com/cs550-epfl/report
EPFL CS-550 project report
cuda formal-verification gpu memory-consistency ptx simt
Last synced: 10 Jan 2025
https://github.com/matteopolak/stock-predict
Stock prediction with LSTM using TensorFlow and TypeScript.
ai artificial-intelligence cuda lstm machine-learning stock tensorflow typescript
Last synced: 25 Dec 2024
https://github.com/tfogal/gemm-db
For creating a cacheable GEMM cost model.
Last synced: 21 Jan 2025
https://github.com/dbklim/optimized_tensorflow_wheels
Optimized versions TensorFlow and TensorFlow-GPU for specific CPUs and GPUs (for both old and new).
cuda nvidia-cuda nvidia-gpu tensorflow tensorflow-community-wheels tensorflow-gpu tensorflow-packages tensorflow-whells wheels
Last synced: 10 Jan 2025
https://github.com/tommaso-dognini/polimi_gpu101_courseproject
Polimi Passion In Action GPU101 course project.
cpp cuda cuda-programming parallel-computing
Last synced: 26 Dec 2024
https://github.com/andreasholt/cuda-matmul-benchmarking
Implementing and benchmarking various matmul implementations in CUDA
Last synced: 26 Dec 2024
https://github.com/katpercent/raytracing
A foundation for ray tracing using CUDA and parallel computing techniques.
3d cuda engine game parrallel-computing ray raytracing
Last synced: 26 Dec 2024
https://github.com/kis-balazs/cuda-research
CUDA Research & Code. Course-style structured. Inspiration from @Infatoshi.
Last synced: 26 Dec 2024
https://github.com/gama1903/cuda_programming
Practice of cuda programming according to <<programming massively parallel processors 4th>>, also refer to CUDA MODE series.
Last synced: 26 Dec 2024
https://github.com/ojeda-e/fokker-planck
Numerical solution of the Fokker-Planck equation in large times using CUDA/C.
Last synced: 26 Dec 2024
https://github.com/xueeinstein/udacity-cs344-cuda8
Code for Udacity CS344 (Intro to Parallel Programming) using CUDA 8.0
cuda cuda-8 parallel-computing
Last synced: 26 Dec 2024
https://github.com/psteinb/gtc2017
Slides for my presentation at GTC 2017 from May 8-11 in Silicon Valley
compression cuda ffmpeg gpu gpu-computing h264 h265 microscopes spim
Last synced: 06 Jan 2025
https://github.com/lttofu/cosmic
Fast, lightweight GUI-based C++ Ethereum ERC918 token miner for Win64 | CUDA GPUs | CPUs | Pool | Solo Mining
0xbitcoin 0xbtc cplusplus cplusplus-cli cpuminer cuda erc20 erc918 ethereum ethereum-token gpuminer gui pool-mining solo-mining windows windows-10 windows-7 windows-gui winforms
Last synced: 02 Jan 2025
https://github.com/marnovo/cuda-projects
cuda cuda-kernels gpu gpu-programming nvidia-cuda parallel-computing
Last synced: 26 Dec 2024
https://github.com/satyajitghana/gpu-programming
Contains the contents of GPU Architecture and Programming course done on NPTEL
c cpp cuda cuda-programming gpu-programming nptel nvidia
Last synced: 26 Dec 2024
https://github.com/ran-2012/cuda-practice
cuda practice code for nvidia programming guide
Last synced: 10 Jan 2025
https://github.com/neel-dandiwala/npp_cudaatscale_project
For the enterprise course project, I have created a model that executes the histogram equalisation procedure on the given input image file.
Last synced: 26 Dec 2024
https://github.com/maxenceleguery/3d-render-engine
3D Render engine accelerated with CUDA
Last synced: 27 Dec 2024
https://github.com/pvgupta24/parallel-programming
Basic algorithms for parallel programming in CUDA C++, Java and OpenMP
cuda openmp parallel-programming
Last synced: 06 Jan 2025