Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/awrsha/cuda-gpus-and-triton-adcanced-review
This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.
https://github.com/awrsha/cuda-gpus-and-triton-adcanced-review
cuda-programming gpu-programming jit kernels matmul mojo-language multiprocessing multithreading torchquantum triton
Last synced: about 24 hours ago
JSON representation
This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.
- Host: GitHub
- URL: https://github.com/awrsha/cuda-gpus-and-triton-adcanced-review
- Owner: Awrsha
- Created: 2024-11-11T20:47:14.000Z (2 months ago)
- Default Branch: master
- Last Pushed: 2024-11-13T15:38:57.000Z (2 months ago)
- Last Synced: 2024-11-13T16:18:43.756Z (2 months ago)
- Topics: cuda-programming, gpu-programming, jit, kernels, matmul, mojo-language, multiprocessing, multithreading, torchquantum, triton
- Language: Cuda
- Homepage:
- Size: 25.1 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md