An open API service indexing awesome lists of open source software.

https://github.com/llmsresearch/specstream

Fast LLM inference with 2.8x speedup using speculative decoding
https://github.com/llmsresearch/specstream

inference largelanguagemodel llms speculative-decoding

Last synced: 27 days ago
JSON representation

Fast LLM inference with 2.8x speedup using speculative decoding

Awesome Lists containing this project