Projects in Awesome Lists by SqueezeAILab
A curated list of projects in awesome lists by SqueezeAILab .
https://github.com/SqueezeAILab/LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
efficient-inference function-calling large-language-models llama llama2 llm llm-agent llm-agents llm-framework llms natural-language-processing nlp parallel-function-call transformer
Last synced: 16 Apr 2025
https://github.com/squeezeailab/llmcompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
efficient-inference function-calling large-language-models llama llama2 llm llm-agent llm-agents llm-framework llms natural-language-processing nlp parallel-function-call transformer
Last synced: 08 Apr 2025
https://github.com/squeezeailab/squeezellm
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
efficient-inference large-language-models llama llm localllm model-compression natural-language-processing post-training-quantization quantization small-models text-generation transformer
Last synced: 13 Apr 2025
https://github.com/squeezeailab/kvquant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
compression efficient-inference efficient-model large-language-models llama llm localllama localllm mistral model-compression natural-language-processing quantization small-models text-generation transformer
Last synced: 07 Apr 2025
https://github.com/squeezeailab/llm2llm
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
data-augmentation llama llama2 llm llms natural-language-processing nlp synthetic-dataset-generation transformer
Last synced: 05 Dec 2024
https://github.com/squeezeailab/open_source_projects
Open Source Projects from Pallas Lab
Last synced: 27 Mar 2025
https://github.com/squeezeailab/squeezedattention
SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference
Last synced: 05 Dec 2024
https://github.com/squeezeailab/ets
ETS: Efficient Tree Search for Inference-Time Scaling
Last synced: 11 Mar 2025