An open API service indexing awesome lists of open source software.

awesome-llm-inference

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.
https://github.com/deftruth/awesome-llm-inference

Last synced: about 3 hours ago
JSON representation

Programming Languages
Sub Categories