An open API service indexing awesome lists of open source software.

https://github.com/fortunato777a/cutlass

CUTLASS 4.1.0 offers high-performance matrix-matrix multiplication in CUDA, with flexible abstractions for custom kernels. Perfect for efficient linear algebra. 🚀💻
https://github.com/fortunato777a/cutlass

blas cmake convolution cpp cublas cutlass deep-learning-library deepseek fcp final-cut-pro mlir nvim-plugin parallel-programming ptx python tensorrt tensorrt-llm vlm

Last synced: 8 months ago
JSON representation

CUTLASS 4.1.0 offers high-performance matrix-matrix multiplication in CUDA, with flexible abstractions for custom kernels. Perfect for efficient linear algebra. 🚀💻

Awesome Lists containing this project