Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zalo/matmul_cuda

A simple learning example for CUDA
https://github.com/zalo/matmul_cuda

cuda

Last synced: 29 days ago
JSON representation

A simple learning example for CUDA

Host: GitHub
URL: https://github.com/zalo/matmul_cuda
Owner: zalo
License: mit
Created: 2022-08-04T03:31:38.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2022-08-19T19:00:44.000Z (over 2 years ago)
Last Synced: 2024-11-14T22:31:44.373Z (3 months ago)
Topics: cuda
Language: Cuda
Homepage:
Size: 55.7 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# matmul_cuda
A simple learning example for CUDA.

Take in a `.bin` file specifying a series of `int32 rows, int32 cols, int64_t* matrixContents` matrices and multiply them together as quickly as possible. Save the resultant matrix as `output.bin` in the same format.

On Windows, this application compiles with:
```
nvcc -o matmul_cuda_v2 matmul_cuda.cu -ccbin "C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.29.30133\bin\Hostx64\x64" -O 3 -arch=all --extra-device-vectorization
```

On Linux, this application compiles with:
```
nvcc -o matmul_cuda_v2 matmul_cuda.cu -O 3 -arch=all --extra-device-vectorization
```