https://github.com/Qcompiler/MixQ_Tensorrt_LLM
Mixed precision inference by Tensorrt-LLM
https://github.com/Qcompiler/MixQ_Tensorrt_LLM
Last synced: 12 months ago
JSON representation
Mixed precision inference by Tensorrt-LLM
- Host: GitHub
- URL: https://github.com/Qcompiler/MixQ_Tensorrt_LLM
- Owner: Qcompiler
- Created: 2024-08-16T06:13:35.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-10-21T15:36:33.000Z (over 1 year ago)
- Last Synced: 2024-10-22T02:07:35.468Z (over 1 year ago)
- Language: C++
- Size: 58.5 MB
- Stars: 85
- Watchers: 13
- Forks: 18
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- StarryDivineSky - Qcompiler/MixQ_Tensorrt_LLM