An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with cuda-toolkit

A curated list of projects in awesome lists tagged with cuda-toolkit .

https://github.com/deftruth/cuda-learn-notes

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

cuda cuda-12 cuda-cpp cuda-demo cuda-kernel cuda-kernels cuda-library cuda-toolkit flash-attention hgemm learn-cuda leet-cuda

Last synced: 14 May 2025

https://github.com/xlite-dev/cuda-learn-notes

📚Modern CUDA Learn Notes: 200+ Tensor/CUDA Cores Kernels🎉, HGEMM, FA2 via MMA and CuTe, 98~100% TFLOPS of cuBLAS/FA2.

cuda cuda-kernels cuda-programming cuda-toolkit cudnn cutlass flash-attention flash-mla gemm gemv hgemm

Last synced: 15 Apr 2025

https://github.com/xlite-dev/CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

cuda cuda-kernels cuda-programming cuda-toolkit cudnn cutlass flash-attention flash-mla gemm gemv hgemm

Last synced: 26 Mar 2025

https://github.com/DefTruth/CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

cuda cuda-kernels cuda-programming cuda-toolkit cudnn cutlass flash-attention flash-mla gemm gemv hgemm

Last synced: 20 Mar 2025

https://github.com/phohenecker/switch-cuda

A simple bash script for switching between installed versions of CUDA.

bash-script cuda-toolkit

Last synced: 05 Apr 2025

https://github.com/JuliaGPU/CUDAnative.jl

Julia support for native CUDA programming

cuda cuda-toolkit julia julia-library

Last synced: 22 Jul 2025

https://github.com/jimver/cuda-toolkit

GitHub Action to install CUDA

action cuda cuda-toolkit github-actions nvidia nvidia-cuda

Last synced: 14 Apr 2025

https://github.com/easternjournalist/diffnum

A light-weighted and flexible C++ differentiable programming library. Just replace float and double with it, and it does Auto-Grad for you...

autograd cpp cuda-toolkit

Last synced: 07 May 2025

https://github.com/sbl-sdsc/df-parallel

Comparison of Dataframe libraries for parallel processing of large tabular files on CPU and GPU.

cuda-toolkit dask dask-cudf dask-dataframes dataframes gpu-computing parallel-processing pyspark-dataframes rapidsai

Last synced: 12 Apr 2025

https://github.com/zeuscoderbe/legal-chat-bot

This project is designed as a legal advisory chatbot system to assist users by answering questions related to Vietnamese law.The main data source includes legal documents collected from official Vietnamese legal websites.

cuda-toolkit docker fine-tuning flask llm mlops python

Last synced: 13 Apr 2025

https://github.com/iacolippo/gpu-dnn-install

Scripts and instructions to install CUDA, cuDNN and the most common deep learning frameworks.

cuda-toolkit cudnn install-script theano torch

Last synced: 23 Aug 2025

https://github.com/wissem01chiha/cuastar

Parallel implementation of the A* trajectory planner algorithm on NVIDIA GPUs for dense point cloud environments

c cpp cpp17 cuda-programming cuda-toolkit motion-planning motion-planning-algorithm navigation nvidia-gpu openmp openmp-parallelization point-cloud point-cloud-processing simd-intrinsics vtk

Last synced: 11 Apr 2025

https://github.com/manfreddiaz/deep-docker

Dockerfile for creating a docker image with default configurations for Deep Learning tasks

cuda-toolkit deep-learning docker docker-image keras opencv3 pytorch tensorflow

Last synced: 10 Apr 2025

https://github.com/cedrickchee/dockerfile-fastai

Dockerfile for building NVIDIA CUDA image for PyTorch 1.0 and fastai 1.0 deep learning

cuda-toolkit deep-learning docker-container fastai nvidia-docker pytorch

Last synced: 25 Dec 2025

https://github.com/david-palma/cuda-programming

Educational CUDA C/C++ programming repository with commented examples on GPU parallel computing, matrix operations, and performance profiling. Requires a CUDA-enabled NVIDIA GPU.

c-cpp cpp cuda cuda-toolkit education gpu gpu-programming kernel matrix-operations nvcc nvidia parallel-computing parallel-programming practice profiling threads

Last synced: 26 Mar 2025

https://github.com/hariprashad-ravikumar/accelerated-computing-in-cuda-c

This repo contains my codes for problem sets in NVIDIA Getting Started with Accelerated Computing in CUDA C/C++

c cuda cuda-kernels cuda-toolkit

Last synced: 01 Jul 2025

https://github.com/zeuscoderbe/vietnam-political-consulting-chatbot

The information system investigates data, guides public opinion and enhances political activities for Party officials

cuda-toolkit fine-tuning flask llm lora rag

Last synced: 11 Oct 2025

https://github.com/kartavyaantani/cuda_image_processing

A CUDA-accelerated image processing project featuring multiple GPU-based filters and enhancement techniques. Implements convolution, edge detection, Non-Local Means (NLM) denoising, K-Nearest Neighbors (KNN), and pixelization. Each operation is optimized using CUDA kernels for real-time performance on large images. The project supports command-line

cuda cuda-kernels cuda-programming cuda-toolkit gpu-programming high-performance-computing image-manipulation image-processing nvidia-cuda nvidia-gpu

Last synced: 19 Apr 2025

https://github.com/alekseyscorpi/vacancies_server

This is a server for vacancies generation using LLM (Saiga3)

code cuda cuda-toolkit docker dockerfile flask llama3 llamacpp llm ngrok pydantic saiga

Last synced: 20 Jul 2025

https://github.com/codingrule/cuda-mbrot

Just another mandlebrot with cuda

cuda cuda-toolkit cupy fractal mandelbrot mathematics nvidia

Last synced: 30 Mar 2025

https://github.com/qtle3/gpu-checker

This script checks the availability of CUDA-enabled GPUs and prints detailed GPU information for both PyTorch and TensorFlow frameworks. It's a handy utility for ensuring that your deep learning environment is correctly configured to utilize GPU acceleration.

cross-framework cuda-toolkit gpu-acceleration gpu-computing gpu-information gpu-monitoring pytorch tensorflow

Last synced: 13 May 2025

https://github.com/jakubfr4czek/concurrent-gauss-elimination

Concurrent gaussian elimination algorithm implemented using traces theory. Parallelism has been achieved employing CUDA cores.

agh agh-ust agh-wi conda cuda cuda-kernels cuda-toolkit diekert-graph graphviz java python python3 traces-theory

Last synced: 20 Feb 2025

https://github.com/ranitmanik/dermadetectai

DermaDetectAI is a Flask app for detecting skin diseases using deep learning. Developed by Ranit Manik and team, it includes models for identifying 5, 10, and 23 skin conditions, and supports NVIDIA GPU acceleration. The models are trained with PyTorch.

cuda-toolkit flask gpu-acceleration machine-learning ml ml-engineering nvdia pytorch skin-detection

Last synced: 23 Mar 2025

https://github.com/anis196/bitesense

This project is a deep learning-based classification model using ResNet50 and TensorFlow to classify snake bites as Poisonous or Non-Poisonous based on wound patterns. The model is trained on an image dataset and fine-tuned for better accuracy using GPU.

cuda-toolkit cudnn deep-neural-networks python resnet-50 tensorflow-gpu

Last synced: 26 Jun 2025

https://github.com/assem-elqersh/parallel-and-distributed-computing-lab

Series of laboratory assignments focused on parallel and distributed computing concepts, from basic speedup analysis to advanced GPU programming.

cuda-programming cuda-toolkit openmp parallel-computing

Last synced: 23 Jun 2025

https://github.com/sufremoak/moudo

Python-based Library and Toolkit for Programming Computer Mice, inspired by CUDA

cuda cuda-toolkit cython-library mouse mouse-movement programming

Last synced: 20 Mar 2025

https://github.com/andrewtwin/ansible-role-install_nvidia_cuda

Ansible role for installing Nvidia drivers and CUDA toolkit

ansible-role cuda-toolkit drivers nvidia

Last synced: 19 Jun 2025

https://github.com/sahil-rajwar-2004/vector.py-cuda

a python lib for vector with GPU support

cuda-support cuda-toolkit python311 vector

Last synced: 24 Apr 2025

https://github.com/nuhan711/nvidia-driver-installer

Automate NVIDIA driver installation on various Linux distributions with this simple script. Get started quickly and easily! 🚀💻

archlinux build compile cuda cuda-toolkit cudnn deep-learning docker driver drivers grub install mok-management nvidia systemd-boot tensorflow tutorial ubuntu-drivers

Last synced: 30 Dec 2025

https://github.com/bd2720/accesspatterns

Comparing chunked vs. striped memory access patterns for CPU and GPU code using the CUDA toolkit in C.

c cache cuda cuda-toolkit performance-analysis performance-testing profiling

Last synced: 03 Oct 2025

https://github.com/tomtolleson/cuda-kernel-benchmarking-tool

A benchmarking tool in C++ that creates Cuda kernels and tests the overall system performance between CPU and GPU

cuda cuda-kernels cuda-support cuda-toolkit nvidia nvidia-cuda nvidia-gpu

Last synced: 30 Mar 2025

https://github.com/isquicha/cuda-parallel-studies

Learning CUDA programming here =D

cuda cuda-programming cuda-toolkit

Last synced: 03 Jul 2025

https://github.com/0xhilsa/variable

variable + CUDA

cuda-kernels cuda-toolkit python3

Last synced: 09 Apr 2025

https://github.com/mrgkanev/tensorflow-gpu-docker-setup

A Docker environment for TensorFlow GPU development with optimized configurations for WSL2, troubleshooting guides, and common error fixes

cuda cuda-toolkit deep-learning dev-environment development-tools docker gpu-acceleration machine-learning nvidia-docker nvidia-docker-support python tensorflow

Last synced: 07 Jul 2025

https://github.com/0xhilsa/vector.py-cuda

a python lib for vector with GPU support

cuda-support cuda-toolkit python311 vector

Last synced: 14 Jul 2025

https://github.com/efecaliskannn/generative-adverserial-networks--gan-

DCGAN was used for synthetic data generation, ACGAN for classification, and SRGAN for image enhancement.

artificial-intelligence auxiliary-classifier-gan cuda-toolkit cudnn deep-learning generative-adversarial-network python super-resolution-image

Last synced: 02 Mar 2025