Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jundaf2/cutlass-kernel-volta-gemm

volta fp16 gemm kernel
https://github.com/jundaf2/cutlass-kernel-volta-gemm

Last synced: 2 days ago
JSON representation

volta fp16 gemm kernel

Awesome Lists containing this project

README

        

# cutlass-kernel-volta-gemm
This is a cutlass-based kernel-level GEMM for Volta architecture.

## Dependencies
- pytorch
- pytest

## Notes
First deploy blocks, then deploy warps.
- Block
- BM = 64
- BN = 64
- Warp
- WM = 16
- WN = 16
NUM_WARPS = (BMxBN)/(WMxWN) = 16
NUM_THREADS_PER_CTA = WARP_SIZExNUM_WARPS = 32x16 = 512