Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nptt9/illama

A fast, lightweight, parallel inference server for Llama LLMs.
https://github.com/nptt9/illama

exllama exllamav2 flash-attention-2 inference llama llama2 llama3 llm-inference paged-attention server

Last synced: 11 days ago
JSON representation

A fast, lightweight, parallel inference server for Llama LLMs.

Awesome Lists containing this project