Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/awrsha/cuda-gpus-and-triton-adcanced-review

This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.
https://github.com/awrsha/cuda-gpus-and-triton-adcanced-review

cuda-programming gpu-programming jit kernels matmul mojo-language multiprocessing multithreading torchquantum triton

Last synced: about 24 hours ago
JSON representation

This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.

Awesome Lists containing this project