Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

CUDA

CUDA® is a parallel computing platform and programming model developed by NVIDIA for general computing on graphical processing units (GPUs). With CUDA, developers are able to dramatically speed up computing applications by harnessing the power of GPUs.

https://github.com/drilonaliu/parallel-mandelbrot-set

GPU-accelerated Mandelbrot Set generation with CUDA and OpenGL interoperability.

cuda fractals gpu mandelbrot-fractal parallel-programming

Last synced: 26 Jan 2025

https://github.com/tyler-hilbert/cuda-kmeans

K-Means written from scratch in CUDA

cuda kmeans-clustering machine-learning nsight

Last synced: 05 Feb 2025

https://github.com/noorkhokhar99/how-to-setup-nvidia-gpu-for-object-detection-installing-cuda-toolkit-and-cudnn

How to Setup NVIDIA GPU For object detection | Installing Cuda Toolkit And cuDNN

computer cuda nividia opencv python roboflow vision

Last synced: 05 Feb 2025

https://github.com/codename-detective/cuda_gpgpus_shared_memory_systems_pdp

CUDA GPGPUs Shared Memory Systems Parallel & Distributed Programming

cuda cuda-programming numa parallel-programming

Last synced: 05 Feb 2025

https://github.com/fedesky25/hpc-project-2024

Project for the 2024 course of HPC: generator of streamplot of complex-valued functions

complex-numbers cuda openmp

Last synced: 05 Feb 2025

https://github.com/efecaliskannn/pneumonia-detection-with-cnn--vgg16--and-resnet50-deep-learning-models

In this project, pneumonia detection using deep learning, a subset of artificial intelligence, is aimed. The performance of deep learning algorithms, including CNN, VGG16, and ResNet50 models, in detecting pneumonia has been examined.(Bu projede yapay zekanın alt kümesi olan derin öğrenme ile zatürre tespiti amaçlanmaktadır.)

artificial-intelligence convolutional-neural-networks cuda deep-learning keras-tensorflow nvidia-cuda pyhton transfer-learning

Last synced: 05 Feb 2025

https://github.com/fatlipp/toyslam

SLAM implementation from scratch w/o external graph optimization libs

cuda gpu lidar-slam mapping odometry robotics slam

Last synced: 05 Feb 2025

https://github.com/h1me01/cuda-neural-network

CUDA version of my previous AVX-512 based Neural Network. (Still in development)

chess cuda cuda-programming neural-network neural-networks-from-scratch

Last synced: 05 Feb 2025

https://github.com/r00tens/text-classifier

Naive Bayes classifier for text classification with CPU and GPU (CUDA)

classification classifier cpp cuda machine-learning naive-bayes

Last synced: 05 Feb 2025

https://github.com/mrtejas/cv-sandbox

A collection of Computer Vision mini-projects tuned for a number of tasks, including face detection, object detection, image segmentation and CLIP. Trained on popular datasets and includes comparative study of the methods. Done as a part of S24 course : Computer Vision at IIIT Hyd

computer-vision cuda ml opencv pytorch yolo

Last synced: 05 Feb 2025

https://github.com/neugence/acehub

AI Champions for Excellence: Fresh, informative courses and content designed to help developers, researchers, and leaders advance in the field of AI.

ai cuda cv ml mlops nlp pytorch rl rlhf tensorflow

Last synced: 17 Feb 2025

https://github.com/eastonman/tensorrt-pytorch-wrapper

A wrapper makes TensorRT engine accept PyTorch Cuda Tensor.

cuda pytorch tensorrt

Last synced: 18 Jan 2025

https://github.com/qanastek/concurency-tetravex

This software is an fast and reliable tetravex solver based on C++ and CUDA.

c-plus-plus cuda parrallel-computing tetravex

Last synced: 18 Jan 2025

https://github.com/seieric/gst-dsobjectsmask

📀NVIDIA DeepStream integrated GStreamer Plugin. Mask objects with cuda cores on Jetson boards. Fast and smooth since everything is done on NVMM.🏎

cuda cuda-programming deepstream gpu gstreamer gstreamer-plugins instance-segmentation jetson-agx-orin jetson-agx-xavier jetson-tx1 jetson-tx2 jetson-xavier maskrcnn nvidia-jetson nvidia-jetson-nano opencv opencv4 resnet resnet50

Last synced: 29 Jan 2025

https://github.com/llm-db/understanding-gpu-architecture-implications-on-llm-serving-workloads

Understanding GPU Architecture Implications on LLM Serving Workloads (Master Thesis, ETH Zürich, 2024)

cuda inference pytorch rocm transformer

Last synced: 14 Dec 2024

https://github.com/fattorib/thunderkittens-simple-gemm

Simple Tensorcore GEMM in ThunderKittens

cuda gemm gpu thunderkittens

Last synced: 16 Feb 2025

https://github.com/hshindo/libcuda.jl

CUDA GPU array for Julia

cuda gpu julia

Last synced: 09 Jan 2025

https://github.com/rssr25/cuda

Following Cuda By Example book.

cpp cuda cuda-programming hpc shaders

Last synced: 15 Feb 2025

https://github.com/sebftw/interp2gpu

GPU-accelerated 2D spline interpolation, à la interp2(..., "spline"), in MATLAB.

cuda gpu gpu-acceleration matlab spline spline-interpolation

Last synced: 14 Dec 2024

https://github.com/storterald/neural-network

Simple neural network implementation in C++ and CUDA

asm asmx86 c-plus-plus cmake cpp cuda machine-learning neural-network

Last synced: 02 Feb 2025

https://github.com/githubfoam/cuda-travisci

cuda miniconda pytorch

cuda miniconda pytroch

Last synced: 05 Feb 2025

https://github.com/jonastoth/cuda_raytracer

University project to implement a basic Raytracer in CUDA

cpp14 cuda raytracer

Last synced: 02 Feb 2025

https://github.com/bniladridas/text-to-image

This project enables you to generate images from text descriptions using the powerful Stable Diffusion model.

ai computer-vision cuda diffusion-models generative-ai python tkinter transformers

Last synced: 20 Nov 2024

https://github.com/drilonaliu/parallel-sierpinski-triangle

GPU-accelerated Sierpinski Triangle generation with CUDA and OpenGL interoperability.

cuda fractals gpu parallel-programming sierpinski-triangle

Last synced: 26 Jan 2025

https://github.com/naetherm/derelictcurand

Dynamic bindings to the CuRAND library for the D Programming Language.

cuda curand d derelict dlang

Last synced: 01 Feb 2025

https://github.com/adesoji1/youtubesummaryai

Python script for YouTube summary. The service should summarize an YouTube video by url. It should works for long video and for different languages.

cuda googleapi python3 speech-recognition transformers youtube-api-v3 youtube-dl

Last synced: 10 Feb 2025

https://github.com/naetherm/derelictcublas

Dynamic bindings to the CuBLAS library for the D Programming Language.

cublas cuda d derelict dlang

Last synced: 01 Feb 2025

https://github.com/tier4/autoware_nova_carter

Integration of NVIDIA Nova Carter with Autoware

amr autonomous-driving autoware cuda nvidia nvidia-jetson ros2

Last synced: 17 Feb 2025

https://github.com/thesoenke/deeplearning-docker

Setup for Deep Learning experiments in Docker with Cuda

cuda docker fastai jupyter

Last synced: 05 Feb 2025

https://github.com/zyn10/cuda_code

cude practice

cuda cuda-programming

Last synced: 19 Jan 2025

https://github.com/f-koehler/itesol

WIP: Iterative eigensolvers for C++20, Python and CUDA

cpp20 cuda eigenvalues linear-algebra python

Last synced: 18 Feb 2025

https://github.com/aaaastark/nvidia-cuda-google-colab

Deployment of NVIDIA-CUDA on Google Colab. With in examples codes (Vector Addition and Matrix Multiplication).

c cpp cuda googlecolab googlecolaboratory matrix-multiplication nvidia python vector-addition

Last synced: 15 Jan 2025

https://github.com/dreamjet31/licence_plate_detection

Automated License Plate recognition system

cuda opencv python pytorch ultralytics yolov8

Last synced: 10 Feb 2025

https://github.com/dongskie43/nlp-engineering-hub

📚 Enterprise NLP systems and LLM applications. Features custom language model implementations, distributed training pipelines, and efficient inference systems. 🔤

cuda gpu-optimization huggingface huggingface-transformers langchain language-models large-language-models nlp openai python transformers

Last synced: 03 Feb 2025

https://github.com/tomtolleson/cuda-kernel-benchmarking-tool

A benchmarking tool in C++ that creates Cuda kernels and tests the overall system performance between CPU and GPU

cuda cuda-kernels cuda-support cuda-toolkit nvidia nvidia-cuda nvidia-gpu

Last synced: 05 Feb 2025

https://github.com/grindelfp/cuda-texture-memory

Exercise on using texture memory in CUDA.

cuda texture-memory

Last synced: 05 Feb 2025

https://github.com/codingrule/cuda-mbrot

Just another mandlebrot with cuda

cuda cuda-toolkit cupy fractal mandelbrot mathematics nvidia

Last synced: 05 Feb 2025

https://github.com/mohammadshabazuddin/text_to_speech_generation_with_llm_with_hugging_face

Build a text-to-speech generation system using LLMs and Hugging Face to convert text into natural audio speech.

cuda huggingface-transformers llms nlp

Last synced: 05 Feb 2025

https://github.com/thesupercd/cuda_sort

A simple project implementing and measuring the runtime performance metrics related to massively parallel algorithms (radix sort) on an NVIDIA GPU device.

benchmarking c cpp cuda cuda-programming gpu-acceleration gpu-programming multithreading parallel-processing radix-sort sorting-algorithms

Last synced: 05 Feb 2025

https://github.com/kichappa/videosift

CUDA based 3D Computer Vision for Exoskins

computer-vision convolution-filter cuda hpc julia sift-algorithm

Last synced: 29 Jan 2025

https://github.com/m-torhan/cuda-fractals

CUDA C++ implementation of Fractals visualization

cuda

Last synced: 31 Dec 2024

https://github.com/notkartikye/cuda-image-box-filters

🖼️ CUDA-powered tool for applying box filters to a large amount of images

cuda cuda-library cuda-programming npp

Last synced: 16 Feb 2025

https://github.com/drtey/cuda-zero

CUDA Programming

c cpp cuda makefile

Last synced: 13 Jan 2025

https://github.com/university-project-repos/reinforcementlearninggym

Small university project for experimentally training and evaluating reinforcement learning agents in OpenAI Gym environments using DQN.

acrobot ai anaconda atari breakout cartpole cnn cuda deep-learning dqn gym machine-learning mountain-car openai pong python pytorch q-learning reinforcement-learning university-project

Last synced: 29 Jan 2025

https://github.com/sid911/neuralnetworkcpp

A small experiment to learn about neural networks and their runtimes in cpp

cpp cuda machine-learning neural-network

Last synced: 14 Jan 2025

https://github.com/aespinosadev/basicrenderer

Basic renderer engine with advanced graphic techniques i´m currently working on.

cplusplus cuda glsl graphics graphics-engine graphics-programming open opengl renderer rendering rendering-3d-graphics rendering-engine shaders

Last synced: 23 Jan 2025

https://github.com/matrixji/annb

Approximate Nearest Neighbor Benchmark

anns benchmarks cuda gpu

Last synced: 23 Jan 2025

https://github.com/avicted/hip_fm_synthesis

This project demonstrates FM Synthesis (Frequency Modulation) using HIP (Heterogeneous Compute Interface), enabling high-performance sound generation on both AMD and NVIDIA GPUs.

amd audio-processing cuda fm-synthesis hip nvidia rocm

Last synced: 23 Jan 2025

https://github.com/not-ml/ml-3

A PyTorch-based Convolutional Neural Network (CNN) for image classification using the CIFAR-10 dataset, featuring advanced architecture, data augmentation, GPU support, and dynamic learning rate scheduling.

ai cifar10 cnn cuda gpu image-classification machine-learning modeltraining python pytorch torchvision

Last synced: 23 Jan 2025

https://github.com/nel-s/vein-cracker

Recovers which internal generator states could have generated a provided set of Minecraft Java b1.6-1.12.2 veins. Those can then be used to recover 3/4ths of any worldseeds that could have generated them.

cuda minecraft seedcracking veins

Last synced: 23 Jan 2025

https://github.com/i-m-iron-man/abmax

Abmax is an agent-based modelling framework in Jax, focused on dynamic population size

abm agent agent-based agent-based-modeling agent-based-simulation agents cuda jax python

Last synced: 13 Jan 2025

https://github.com/xueeinstein/udacity-cs344-cuda8

Code for Udacity CS344 (Intro to Parallel Programming) using CUDA 8.0

cuda cuda-8 parallel-computing

Last synced: 17 Feb 2025

https://github.com/sergeipapina/color2graycuda

color to gray image conversion nvidia CUDA kernel implementation using make or cmake to compile and link

cmake cuda cuda-kernels cuda-programming link makefile nvidia

Last synced: 12 Feb 2025

https://github.com/gama1903/cuda_programming

Practice of cuda programming

cuda parallel-computing

Last synced: 17 Feb 2025

https://github.com/kar-dim/fidelityfx-cas-cuda

Implementation of the AMD FidelityFX CAS (Contrast Adaptive Sharpening) algorithm on CUDA, for sharpening static images.

cpp cuda dll fidelityfx gpu image-processing parallel-computing sharpen

Last synced: 17 Feb 2025

https://github.com/hit07/ml-dl-torch

This repository contains comprehensive understanding of Machine Leaning, DeepLeaning using Pytorch

computer-vision convolutional-neural-networks cuda neural-networks pytorch

Last synced: 11 Jan 2025

https://github.com/kis-balazs/cuda-research

CUDA Research & Code. Course-style structured. Inspiration from @Infatoshi.

cuda

Last synced: 17 Feb 2025

https://github.com/theodoreai/montecuda

CUDA application for Monte Carlo simulation is used to determine the range of outcomes for a series of parameters, each of which has a probability distribution showing how likely each option is to happen. This is using CUDA.

cuda gpu-computing monte-carlo-simulation parallel-computing

Last synced: 25 Jan 2025

https://github.com/andreasholt/cuda-matmul-benchmarking

Implementing and benchmarking various matmul implementations in CUDA

cuda matrix-multiplication

Last synced: 17 Feb 2025

https://github.com/programmergnome/kutyai

This is a python dog breed recognizer graphical application with 420 breeds and 42000 images.

cuda deep-learning image-classification python3 qt5-gui tensorflow transfer-learning

Last synced: 22 Jan 2025

https://github.com/lanceberge/cuda-newton-fractals

Parallelize and visualize the Newton Iteration

cpp cuda mathematical-modelling visualization

Last synced: 29 Jan 2025

https://github.com/cmazakas/cuda-stuff

A CUDA-based playground

cmake cuda delaunay-triangulation vscode

Last synced: 29 Jan 2025

https://github.com/abhiram-kandiyana/cuda-blast-2024

Reimplementation of NCBI BLAST with CUDA backend for faster retrieval

blast cuda gpu-acceleration parallel-processing

Last synced: 21 Jan 2025

https://github.com/yangfengzzz/tardis

Travel space and time by using autodiff and codegen

autodiff codegen cuda

Last synced: 09 Feb 2025

https://github.com/elprofesoriqo/ml-optimizer

Python library designed to revolutionize machine learning workflows by automating data preprocessing, tensor optimization, and model selection.

api-rest cuda imagesearch machine-learning machine-learning-algorithms numpy-arrays python pytorch tensor

Last synced: 31 Dec 2024

https://github.com/jakubfr4czek/concurrent-gauss-elimination

Concurrent gaussian elimination algorithm implemented using traces theory. Parallelism has been achieved employing CUDA cores.

agh agh-ust agh-wi conda cuda cuda-kernels cuda-toolkit diekert-graph graphviz java python python3 traces-theory

Last synced: 31 Dec 2024

https://github.com/timdev-r/cv-ground-truth-extraction

(Dump) Helper for ground truth extraction, movement analytics and silhouette visual demonstration

computer-vision cuda ground-truth intel-realsense pandas python

Last synced: 21 Jan 2025

https://github.com/mvishiu11/kmeans-clustering

K-Means Clustering with both GPU (CUDA) and CPU implementations

cuda kmeans-clustering

Last synced: 21 Jan 2025

https://github.com/sonhm3029/setup-experience

This project for storage my setup experience, error met-and-solve in developing end to end AI, software project

ai computer-vision cuda deep-learning software

Last synced: 17 Jan 2025

https://github.com/kabir5296/deep-learning-setup-for-ubuntu-guide

CUDA, CuDNN, NVIDIA Driver, and PyTorch Installation for Ubuntu

cuda cudnn deeplearning nlp python pytorch

Last synced: 21 Jan 2025

https://github.com/sbstndb/nbody_k

A simple 3D naïve NBody simulation using Kokkos enabling CUDA or OpenMP backend

cuda kokkos nbody openmp simulation

Last synced: 23 Jan 2025

https://github.com/snandasena/cuda-at-scale-for-the-enterprise

Gauss Filter with CUDA and NPP

cpp cuda gpu nvidia

Last synced: 01 Feb 2025

https://github.com/sid911/scions_old

A small, fast and easy to use Machine Learning framework for edge

cpp cuda library machine-learning

Last synced: 14 Jan 2025

https://github.com/ncorgan/arrayfire-config-info

A small command-line utility that outputs all available ArrayFire devices

arrayfire cuda gpu opencl

Last synced: 13 Feb 2025

https://github.com/amypad/miutil

Basic functionality needed for AMYPAD

cuda matlab medical-imaging python

Last synced: 31 Oct 2024

https://github.com/mattjesc/federated-learning-simulation-1gpu-mi-is

Federated Learning Simulation on a Single GPU with Model Interpretability and Interactive Visualization

ai cuda deep-learning distributed-systems federated-learning gpu hpc keras machine-learning ml model-interpretability python pytorch simulation streamlit tensorflow

Last synced: 17 Feb 2025

https://github.com/shineiarakawa/particle-stabilizer

A C++ and CUDA-based program for simulating the motion of particles.

cpp cuda n-body particles

Last synced: 13 Jan 2025

https://github.com/occisor2/fluidsimulation

Second project of my parallel algorithms course

cuda high-performance-computing

Last synced: 11 Jan 2025

https://github.com/hdelan/msc-hpc-final-project

In this project I implement a CUDA Lanczos method to approximate the matrix exponential. The matrix exponential is an important centrality measure for large, sparse graphs.

cuda graph-algorithms krylov-methods

Last synced: 16 Feb 2025

https://github.com/michaelfranzl/image_fah-client

Dockerfile for Folding@home client with AMD and Nvidia GPGPU support

container cuda debian docker foldingathome gpu-computing opencl

Last synced: 21 Jan 2025

https://github.com/voltr0x/raytracing-cuda

Raytracing in a weekend using CUDA

cpp11 cuda raytracing sdl2

Last synced: 20 Jan 2025

https://github.com/edumucelli/build-tensorflow

Build Tensorflow from source using a Dockerfile

cuda cudnn docker tensorflow

Last synced: 16 Feb 2025