Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
CUDA
![](https://explore-feed.github.com/topics/cuda/cuda.png)
CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.
- GitHub: https://github.com/topics/cuda
- Wikipedia: https://en.wikipedia.org/wiki/CUDA
- Created by: Nvidia
- Released: June 23, 2007
- Related Topics: nvcc,
- Last updated: 2025-02-13 00:07:16 UTC
- JSON Representation
https://github.com/drilonaliu/parallel-mandelbrot-set
GPU-accelerated Mandelbrot Set generation with CUDA and OpenGL interoperability.
cuda fractals gpu mandelbrot-fractal parallel-programming
Last synced: 26 Jan 2025
https://github.com/bhavinpatel4199/image-processing-with-opencv-and-cuda-on-google-colab
This repository demonstrates image processing using OpenCV with CUDA for GPU acceleration on Google Colab. It includes basics like displaying and manipulating images, alongside advanced techniques using CUDA to enhance performance. Ideal for learning GPU-accelerated image processing in Python.
computer-vision cuda google-colab gpu-acceleration high-performance-computing image-processing opencv pixel-manupulation
Last synced: 12 Feb 2025
https://github.com/drilonaliu/parallel-image-scaling
cuda gpu image-processing scaling-algorithms
Last synced: 26 Jan 2025
https://github.com/drilonaliu/parallel-caesar-cipher
caesar-cipher cryptography cuda gpu parallel-programming
Last synced: 26 Jan 2025
https://github.com/drilonaliu/parallel-sierpinski-triangle
GPU-accelerated Sierpinski Triangle generation with CUDA and OpenGL interoperability.
cuda fractals gpu parallel-programming sierpinski-triangle
Last synced: 26 Jan 2025
https://github.com/airvzxf/c-plus-plus-understanding-cuda
Understanding CUDA with C++
cuda hacktoberfest hacktoberfest-accepted
Last synced: 27 Jan 2025
https://github.com/awikramanayake/optimized-matrix-mult
Optimizing matrix multiplication using parallelism and SIMD (AVX2, CUDA)
avx2 cuda matrix-multiplication
Last synced: 21 Jan 2025
https://github.com/shibatch/tlfloat
Template library for floating point operations
arbitrary-precision constexpr cplusplus cpp20 cuda float128 floating-point half-precision ieee754 math octuple-precision quadruple-precision templates
Last synced: 10 Jan 2025
https://github.com/ivanbgd/cuda_quad_c
Calculates a definite integral by using three different rules. Compares sequential to parallel implementations.
cuda integrals parallel-implementations
Last synced: 03 Feb 2025
https://github.com/lablup/backend.ai-accelerator-cuda
The Backend.AI CUDA Accelerator Plugin
Last synced: 03 Jan 2025
https://github.com/juntyr/necsim-rust-docs
Documentation of the spatially explicit biodiversity simulation necsim-rust
biodiversity cuda docs mpi necsim rust simulation
Last synced: 03 Feb 2025
https://github.com/aaaastark/nvidia-cuda-google-colab
Deployment of NVIDIA-CUDA on Google Colab. With in examples codes (Vector Addition and Matrix Multiplication).
c cpp cuda googlecolab googlecolaboratory matrix-multiplication nvidia python vector-addition
Last synced: 15 Jan 2025
https://github.com/matiasvlevi/cuno
Provides cuda bindings, kernel maps and device memory managment for Dannjs computations. [Experimental and not complete]
addon cuda dann dannjs machine-learning nodejs
Last synced: 11 Jan 2025
https://github.com/marcellodesales/nvidea-cuda-ubuntu-docker
NVidea CUDA base image on Ubuntu Linux, used to run Machine Learning
ai cuda docker docker-compose machine-learning ml nvidia-docker ubuntu
Last synced: 03 Feb 2025
https://github.com/aeyage/intraday_prices
GPU-accelerated portfolio optimisation
Last synced: 10 Feb 2025
https://github.com/uwuwuwu363/tts-local
🎤 Natural TTS App: A Python-based text-to-speech GUI with multi-language support, playback controls, and audio export. Built with Tkinter, gTTS, and Pygame. 🚀
chatbot cuda deep-learning multilingual ollama pinokio raspberry-pi speech-recognition speech-to-text text-to-speech tts voice voices wav
Last synced: 10 Feb 2025
https://github.com/shreya888/learning-cuda-with-cpp-and-pytorch
My notes, code, & insights will be recorded here while learning CUDA with C++ and PyTorch
Last synced: 30 Dec 2024
https://github.com/hrolive/data-analytics-in-the-era-of-large-scale-machine-learning
Slides and other material for the Cyprus NCC training event about "Data analytics in the era of large-scale machine learning".
cuda deep-learning gpu-acceleration gradient-boosting large-language-models machine-learning preprocessing python pytorch
Last synced: 04 Jan 2025
https://github.com/pankajarm/ethereum-mining-cuda
cuda ethereum ethereum-mining ethminer ubuntu1604
Last synced: 13 Jan 2025
https://github.com/tianzonglin/cloud-control-gui
A tool to compute, visualize, analyse and drag points (high-dimensional data)
cuda interaction-design visualization
Last synced: 30 Dec 2024
https://github.com/hit07/ml-dl-torch
This repository contains comprehensive understanding of Machine Leaning, DeepLeaning using Pytorch
computer-vision convolutional-neural-networks cuda neural-networks pytorch
Last synced: 11 Jan 2025
https://github.com/jmuwrobotics/libbicos
GPU-Accelerated Binary Correspondence Search for Multishot Stereo Vision
computer-vision cuda depth-map stereo-camera stereo-matching stereo-vision
Last synced: 30 Dec 2024
https://github.com/abhiram-kandiyana/cuda-blast-2024
Reimplementation of NCBI BLAST with CUDA backend for faster retrieval
blast cuda gpu-acceleration parallel-processing
Last synced: 21 Jan 2025
https://github.com/d-krylov/cuda_to_opengl
Simple examples for CUDA OpenGL interoperability
Last synced: 11 Jan 2025
https://github.com/hnthap/vietnamese-word-segment
Vietnamese word segmentation package.
cuda torch transformers vietnamese vietnamese-nlp vietnamese-tokenizer word-segmentation
Last synced: 21 Jan 2025
https://github.com/ray-chew/modified_ch
Density functional theory (DFT) and self-consistent field theory (SCFT) simulation of diblock copolymers
cuda density-functional-theory diblock-copolymer numerical-analysis numerical-methods self-consistent-field-theory
Last synced: 11 Jan 2025
https://github.com/ivanbuccella/sf2bio
Deep reinforcement learning for de novo drug design: a ReLeaSe method execution on a Docker Environment
cuda deep-learning deep-reinforcement-learning docker docker-compose machine-learning nvidia-cuda nvidia-docker reinforcement-learning release release-method
Last synced: 11 Jan 2025
https://github.com/timdev-r/cv-ground-truth-extraction
(Dump) Helper for ground truth extraction, movement analytics and silhouette visual demonstration
computer-vision cuda ground-truth intel-realsense pandas python
Last synced: 21 Jan 2025
https://github.com/sandialabs/tenzing
Core library for optimizing CUDA+MPI programs as sequential decision problems.
cuda mpi scr-2759 sequential-decision-problem
Last synced: 11 Jan 2025
https://github.com/proafxin/cuda-docker
High performance computing Images with pycuda and tensorrt preinstalled
cuda docker dockerfile libcudnn nvidia-tensorrt pycuda python tensorrt
Last synced: 12 Jan 2025
https://github.com/mvishiu11/kmeans-clustering
K-Means Clustering with both GPU (CUDA) and CPU implementations
Last synced: 21 Jan 2025
https://github.com/kabir5296/deep-learning-setup-for-ubuntu-guide
CUDA, CuDNN, NVIDIA Driver, and PyTorch Installation for Ubuntu
cuda cudnn deeplearning nlp python pytorch
Last synced: 21 Jan 2025
https://github.com/kichappa/spy-sim
Simulate a spying strategy on a topography
combat-modeling cuda differential-equations julia modeling-and-simulation topography-simulation
Last synced: 12 Jan 2025
https://github.com/simonschoelly/poisson-solver
A solver for a modified poisson equation using cuda.
cpp cuda finite-difference gpgpu pgc poisson-equation preconditioned-conjugate-gradient thomas-algorithm
Last synced: 12 Jan 2025
https://github.com/occisor2/fluidsimulation
Second project of my parallel algorithms course
cuda high-performance-computing
Last synced: 11 Jan 2025
https://github.com/hr-fahim/transformer-model-optimization
Sample GPT Transformer Model from Scratch.
cuda few-shot-learning transfomers
Last synced: 24 Jan 2025
https://github.com/michaelfranzl/image_fah-client
Dockerfile for Folding@home client with AMD and Nvidia GPGPU support
container cuda debian docker foldingathome gpu-computing opencl
Last synced: 21 Jan 2025
https://github.com/yangfengzzz/tardis
Travel space and time by using autodiff and codegen
Last synced: 09 Feb 2025
https://github.com/islamshahil/live-video-analysis
Live Video Analysis using PyTorch
cuda deeplearning neural-network opencv-python python pytorch video-processing webcam
Last synced: 26 Jan 2025
https://github.com/scar17off/ai-2048
A Python implementation of 2048 with a self-learning AI agent powered by TensorFlow. Features reinforcement learning, GPU acceleration, and real-time gameplay visualization.
2048 2048-ai 2048-game artificial-intelligence cuda deep-learning game-ai gpu-computing machine-learning neural-networks pygame python reinforcement-learning self-learning tensorflow
Last synced: 30 Dec 2024
https://github.com/saiccoumar/cuda-programming-exercises
Brief collection of GPU exercises (my reimplementation). Comes with relevant resources.
cuda cuda-programming nvcc nvidia
Last synced: 18 Jan 2025
https://github.com/lord-turmoil/cudacmakedemo
A demo for building CUDA program with CMake
Last synced: 23 Jan 2025
https://github.com/danieljvickers/fluid_simulation
An educational example for learning the Navier-Stoke equations. Also included is a C++ and CUDA shared object library, buildable with CMake, for use in your personal projects.
cpp cuda differential-equations navier-stokes numpy physics python simulation
Last synced: 30 Dec 2024
https://github.com/deep-1704/coa-lab-repo
Computer Organization and Architecture lab assignments.
Last synced: 18 Jan 2025
https://github.com/adesoji1/visis_backend_assessment_submission-adesoji
Create a backend API to handle book information requests, and summary generation.
bart cache cuda data-extraction fastapi flask hugging-face hugging-face-hub llama postman-api python3 pytorch spacy sqlite3-database swagger-api tensorboard-visualizations transformer ubuntu2304
Last synced: 22 Dec 2024
https://github.com/f-koehler/itesol
WIP: Iterative eigensolvers for C++20, Python and CUDA
cpp20 cuda eigenvalues linear-algebra python
Last synced: 28 Dec 2024
https://github.com/ronaldsg20/compu-paralela
Códigos de ejemplo para computación paralela y distribuida
cuda opencv openmp posix-threads
Last synced: 05 Jan 2025
https://github.com/jackrekirby/raytracing-cuda
Raytracing using CUDA
cpp cuda raytracing raytracing-in-one-weekend
Last synced: 08 Feb 2025
https://github.com/betarixm/csed490c
POSTECH: Heterogeneous Parallel Computing (Fall 2023)
cuda gpu parallel-computing postech
Last synced: 19 Jan 2025
https://github.com/emanuelemessina/cuda-benchmark
Evaluate matrix calculations time between CPU and GPU (CUDA)
benchmark cuda matrix-calculations
Last synced: 10 Feb 2025
https://github.com/wojcikmikolaj/particles-in-a-jar
Collisions between particles simulated on GPU.
algorithms-and-data-structures collision-detection collisions cuda gpu-programming
Last synced: 19 Jan 2025
https://github.com/vectorworksreal/ooba-text-docker
ooba text gen webui docker image.
artificial-intelligence containerization cuda docker docker-image large-language-model llm machine-learning python python3 text-generation text-generation-webui ubuntu webui
Last synced: 10 Feb 2025
https://github.com/illagrenan/cuda-80-cudnn6-runtime-1604-py36
Ubuntu 16.04 with Python 3.6 and CUDA Dockerfile
Last synced: 19 Jan 2025
https://github.com/illagrenan/cuda-90-cudnn7-runtime-1604-py36
Ubuntu 16.04 with Python 3.6 and CUDA9 Dockerfile
Last synced: 19 Jan 2025
https://github.com/kenwuqianghao/c4ai-cuda-birds
Homework assignments for C4AI Beginners in Research-Driven Studies
Last synced: 27 Dec 2024
https://github.com/aaditya29/parallel-computing-and-cuda
Learning about Parallel Computing and GPU programming using CUDA.
c cpp cuda cuda-kernels cuda-programming nvidia-cuda openmp openmpi parallel-computing parallel-programming
Last synced: 07 Feb 2025
https://github.com/neuro-mechatronics-interfaces/python_intan
Tools and demos for working with EMG data from intan using python
circuitpython cuda emg pico python realtime tensorflow
Last synced: 30 Dec 2024
https://github.com/dwain-barnes/llm-gguf-auto-converter
Automated Jupyter notebook solution for batch converting Large Language Models to GGUF format with multiple quantization options. Built on llama.cpp with HuggingFace integration.
auto-converter batch-processing cuda gguf huggingface jupyter-notebook llama-cpp llm model-quantization
Last synced: 31 Jan 2025
https://github.com/dlr-amr/t8gpu
Header-only finite volume library targetting GPUs using t8code as meshing backend.
adaptive-mesh-refinement cuda finite-volume gpgpu-computing hpc mesh mpi parallel-computing simulation
Last synced: 06 Feb 2025
https://github.com/phrutis/minikeys_for_sale
GPU program for brute MiniKeys Casascius Serie1 (22 characters)
bitcoin brute-force btc casascius cuda gpu minikeys program uncompressed
Last synced: 24 Jan 2025
https://github.com/sarodyatawatta/flagpol
Energy and polarization based interference mitigation
cuda energy mixed-precision polarization radio-frequency-interference radio-interferometry reinforcement-learning statistical-inference
Last synced: 07 Jan 2025
https://github.com/mathiasotnes/gemm
General Matrix Multiplication (GEMM) optimization in Cuda.
Last synced: 31 Jan 2025
https://github.com/jaidevd/ipec-fdp
cuda hpc keras mapreduce numba spark tensorflow
Last synced: 01 Feb 2025
https://github.com/ahmed5827/image_generation
This application provides a graphical user interface (GUI) for generating images using the Stable Diffusion model. The GUI allows users to input a text prompt, and the application generates an image based on the prompt.
ai cuda generative-ai image-generation
Last synced: 07 Jan 2025
https://github.com/brendanm12345/simple_renderer_cs149
Simple CUDA renderer implementation. 19th most efficient out of 150+ submissions
Last synced: 07 Jan 2025
https://github.com/abdelrahman-amen/active_learning_with_different_query_strategies
This project explores the implementation of active learning techniques, focusing on various query strategies to optimize the selection of informative data points for model training. It aims to reduce the amount of labeled data required while improving model performance, especially in scenarios with limited labeled data.
activelearning cuda entropy kldivergence margin numpy python pyto uncertainty
Last synced: 24 Jan 2025
https://github.com/daelsepara/hipnewton
GPU Implementation of Newton Fractal Generator with Benchmarking
amd cuda fractal gpu gpu-compute gpu-computing hip newton parallel-computing rocm sdk
Last synced: 05 Feb 2025
https://github.com/akhuntsaria/image-filters
Image filters implemented in CUDA C/C++
Last synced: 07 Jan 2025
https://github.com/sugarcane-mk/finetuning_wav2vec2
This repo provides step by step process from sctatch to fine tune facebook's wav2vec2-large model using transformers
asr asr-model cuda facebook fairseq fine-tuning finetuning huggingface librosa python torch transformers wav2vec2 wav2vec2-large-960h
Last synced: 24 Jan 2025
https://github.com/sarah627/horus_eye_fcih_graduation_project
An AI-powered tourism website using YOLOv7 for real-time landmark detection in images. Built with Flask, PyTorch, and Roboflow for seamless tourist interaction.
computer-vision cuda flask jupyter-notebook kaggle matplotlib object-detection opencv python pytorch roboflow
Last synced: 21 Jan 2025
https://github.com/marius311/cudadistributedtools.jl
A set of utility tools for multi-GPU + multi-process workflows
Last synced: 07 Feb 2025
https://github.com/roryclear/cuda-ml
simple cuda optimized mnist classifier
colab-notebook cuda mnist-classification pycuda
Last synced: 21 Jan 2025
https://github.com/curiousci/wind
Multicore systems programming course project
Last synced: 29 Dec 2024
https://github.com/sir-photch/wicuvanity
Generate wireguard vanity keys on your Nvidia GPU
cuda gpu vanity-address vanity-addresses vanitygen wireguard
Last synced: 23 Oct 2024
https://github.com/bjornmelin/ml-production-engineering
⚙️ End-to-end ML deployment solutions. Focused on model serving, multi-GPU optimization, and production-grade system implementation. 🎯
cuda deployment docker fastapi gpu-computing kubernetes mlops production
Last synced: 25 Jan 2025
https://github.com/starlitdreams/pacman-convolutional-q-learning
This project implements a Deep Q-Network (DQN) using PyTorch to train an agent to play Atari's Ms. Pac-Man. It utilizes reinforcement learning with a convolutional neural network (CNN) for image processing. Features include experience replay, frame preprocessing, and CUDA support, with trained model saving and video rendering of gameplay.
artificial-intelligence artificial-neural-networks atari cuda deep-learning deep-learning-algorithms deep-q-learning deeplearning gymnasium gymnasium-environment python pytorch
Last synced: 07 Feb 2025
https://github.com/bjornmelin/ml-vision-lab
👁️ Production-grade computer vision implementations. Real-world applications in image processing, object detection, and video analytics with GPU acceleration. 📸
computer-vision cuda deep-learning image-processing object-detection opencv pytorch video-analytics
Last synced: 25 Jan 2025
https://github.com/rushirg/cuda-matrix-multiplication
Matrix Multiplication on GPGPU in CUDA
cpu cuda gpu parallel-processing
Last synced: 21 Jan 2025
https://github.com/h1me01/cuda_neural_network
Cuda version of my previous AVX-512 based neural network.
chess cuda cuda-programming neural-network
Last synced: 07 Jan 2025
https://github.com/bjornmelin/nlp-engineering-hub
📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤
cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers
Last synced: 25 Jan 2025
https://github.com/karusb/2dca-cuda
2 Dimensional Cellular Automata Visualisation (Game of Life)
algorithm-flowchart cellular-automata cuda game game-of-life glut visual-studio
Last synced: 08 Jan 2025