Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
https://github.com/NVIDIA/cutlass
cpp cuda deep-learning deep-learning-library gpu nvidia
Last synced: about 2 months ago
JSON representation
CUDA Templates for Linear Algebra Subroutines
- Host: GitHub
- URL: https://github.com/NVIDIA/cutlass
- Owner: NVIDIA
- License: other
- Created: 2017-11-30T00:11:24.000Z (about 7 years ago)
- Default Branch: main
- Last Pushed: 2024-10-23T18:24:10.000Z (about 2 months ago)
- Last Synced: 2024-10-24T09:25:41.665Z (about 2 months ago)
- Topics: cpp, cuda, deep-learning, deep-learning-library, gpu, nvidia
- Language: C++
- Homepage:
- Size: 43.6 MB
- Stars: 5,548
- Watchers: 107
- Forks: 947
- Open Issues: 188
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.txt
- Citation: CITATION.cff
Awesome Lists containing this project
- awesome-gemm - NVIDIA CUTLASS - 3-Clause`](https://github.com/NVIDIA/cutlass/blob/main/LICENSE.txt) (Libraries / GPU Libraries)
- CUDA-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools)
- NLP-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools Libraries, and Frameworks)
- LiDAR-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools)
- Vulkan-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools Libraries, and Frameworks)
- Deep-Learning-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools Libraries, and Frameworks)
- Autonomous-Systems-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools Libraries, and Frameworks)
- MATLAB-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools Libraries, and Frameworks)
- CNT-Guide - CUTLASS - performance matrix-multiplication (GEMM) at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement similar to those used to implement cuBLAS. (CUDA Tools Libraries, and Frameworks)
- awesome-cuda-and-hpc - CUTLASS
- awesome-cuda-and-hpc - CUTLASS