https://github.com/fortunato777a/cutlass
CUTLASS 4.1.0 offers high-performance matrix-matrix multiplication in CUDA, with flexible abstractions for custom kernels. Perfect for efficient linear algebra. 🚀💻
https://github.com/fortunato777a/cutlass
blas cmake convolution cpp cublas cutlass deep-learning-library deepseek fcp final-cut-pro mlir nvim-plugin parallel-programming ptx python tensorrt tensorrt-llm vlm
Last synced: 8 months ago
JSON representation
CUTLASS 4.1.0 offers high-performance matrix-matrix multiplication in CUDA, with flexible abstractions for custom kernels. Perfect for efficient linear algebra. 🚀💻
- Host: GitHub
- URL: https://github.com/fortunato777a/cutlass
- Owner: Fortunato777a
- License: other
- Created: 2025-07-14T21:34:33.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2025-07-15T02:39:43.000Z (9 months ago)
- Last Synced: 2025-07-15T04:44:10.838Z (9 months ago)
- Topics: blas, cmake, convolution, cpp, cublas, cutlass, deep-learning-library, deepseek, fcp, final-cut-pro, mlir, nvim-plugin, parallel-programming, ptx, python, tensorrt, tensorrt-llm, vlm
- Language: C++
- Size: 41.9 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0