https://github.com/vadimkantorov/fastmlp

[WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP
https://github.com/vadimkantorov/fastmlp

cublaslt mlp pytorch quantized-neural-networks

Last synced: 3 months ago
JSON representation

[WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP

Host: GitHub
URL: https://github.com/vadimkantorov/fastmlp
Owner: vadimkantorov
Created: 2023-08-23T15:52:57.000Z (over 1 year ago)
Default Branch: master
Last Pushed: 2023-08-23T15:59:40.000Z (over 1 year ago)
Last Synced: 2025-01-12T19:22:04.729Z (4 months ago)
Topics: cublaslt, mlp, pytorch, quantized-neural-networks
Homepage:
Size: 1000 Bytes
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# fastmlp
[WIP] PyTorch bindings for cublasLt with an example of quantized i8f16 MLP

# References
- https://github.com/OpenBMB/cpm_kernels/blob/master/cpm_kernels/library/cublaslt.py