An open API service indexing awesome lists of open source software.

https://github.com/zhzihao/QPruningKV

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression
https://github.com/zhzihao/QPruningKV

Last synced: about 1 year ago
JSON representation

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Awesome Lists containing this project