https://github.com/oneflow-inc/trt_flash_attention
https://github.com/oneflow-inc/trt_flash_attention
Last synced: 9 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/oneflow-inc/trt_flash_attention
- Owner: Oneflow-Inc
- Created: 2022-12-29T11:11:19.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-01-13T06:52:35.000Z (over 3 years ago)
- Last Synced: 2024-11-08T18:12:02.189Z (over 1 year ago)
- Language: C++
- Size: 5.14 MB
- Stars: 4
- Watchers: 4
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# trt_flash_attention
From https://github.com/NVIDIA/TensorRT/tree/main/plugin/multiHeadFlashAttentionPlugin