Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with paged-attention
A curated list of projects in awesome lists tagged with paged-attention .
https://github.com/nickpotafiy/illama
A fast, lightweight, parallel inference server for Llama LLMs.
exllama exllamav2 flash-attention-2 inference llama llama2 llama3 llm-inference paged-attention server
Last synced: 10 Oct 2024
https://github.com/nptt9/illama
A fast, lightweight, parallel inference server for Llama LLMs.
exllama exllamav2 flash-attention-2 inference llama llama2 llama3 llm-inference paged-attention server
Last synced: 27 Oct 2024