Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/interestingLSY/CUDA-From-Correctness-To-Performance-Code
Codes & examples for "CUDA - From Correctness to Performance"
https://github.com/interestingLSY/CUDA-From-Correctness-To-Performance-Code
Last synced: 6 days ago
JSON representation
Codes & examples for "CUDA - From Correctness to Performance"
- Host: GitHub
- URL: https://github.com/interestingLSY/CUDA-From-Correctness-To-Performance-Code
- Owner: interestingLSY
- License: apache-2.0
- Created: 2023-07-14T03:37:46.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2024-10-24T06:52:08.000Z (4 months ago)
- Last Synced: 2024-10-25T02:08:46.488Z (4 months ago)
- Language: C++
- Size: 44.9 KB
- Stars: 60
- Watchers: 2
- Forks: 15
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-cuda-triton-hpc - interestingLSY/CUDA-From-Correctness-To-Performance-Code - From-Correctness-To-Performance-Code?style=social"/> : Codes & examples for "CUDA - From Correctness to Performance". The lecture can be found at [https://wiki.lcpu.dev/zh/hpc/from-scratch/cuda](https://wiki.lcpu.dev/zh/hpc/from-scratch/cuda). (Learning Resources)
- awesome-cuda-triton-hpc - interestingLSY/CUDA-From-Correctness-To-Performance-Code - From-Correctness-To-Performance-Code?style=social"/> : Codes & examples for "CUDA - From Correctness to Performance". The lecture can be found at [https://wiki.lcpu.dev/zh/hpc/from-scratch/cuda](https://wiki.lcpu.dev/zh/hpc/from-scratch/cuda). (Learning Resources)
README
# CUDA - From Correctness to Performance
This repo includes codes & examples for "CUDA - From Correctness to Performance".
The lecture can be found at https://wiki.lcpu.dev/zh/hpc/from-scratch/cuda or [here](lecture.md)
## How to Build
Make sure you have installed the CUDA toolkit, and a CUDA-compatible GPU is available.
Run `make all` to build this repo.
## How to Use
Usage:
```bash
./gemm_test [implementation]
```If `implementation` is not specified, all implementations will be benchmarked.