Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/deftruth/cuda-learn-notes
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
https://github.com/deftruth/cuda-learn-notes
cuda gemm gemv hgemm
Last synced: 5 days ago
JSON representation
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
- Host: GitHub
- URL: https://github.com/deftruth/cuda-learn-notes
- Owner: DefTruth
- License: gpl-3.0
- Created: 2022-12-17T08:19:52.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2025-01-27T09:30:41.000Z (8 days ago)
- Last Synced: 2025-01-30T18:49:08.844Z (5 days ago)
- Topics: cuda, gemm, gemv, hgemm
- Language: Cuda
- Homepage:
- Size: 221 MB
- Stars: 2,172
- Watchers: 17
- Forks: 229
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE