Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sgl-project/sgl-learning-materials
Materials for learning SGLang
https://github.com/sgl-project/sgl-learning-materials
Last synced: 7 days ago
JSON representation
Materials for learning SGLang
- Host: GitHub
- URL: https://github.com/sgl-project/sgl-learning-materials
- Owner: sgl-project
- License: mit
- Created: 2024-09-10T16:04:05.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2024-11-09T17:07:49.000Z (9 days ago)
- Last Synced: 2024-11-09T18:19:04.795Z (9 days ago)
- Size: 58 MB
- Stars: 77
- Watchers: 5
- Forks: 5
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Materials for learning SGLang
## Blog
[2024-09-04] [SGLang v0.3 Release: 7x Faster DeepSeek MLA, 1.5x Faster torch.compile, Multi-Image/Video LLaVA-OneVision](https://lmsys.org/blog/2024-09-04-sglang-v0-3/)[2024-07-25] [Achieving Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, vLLM)](https://lmsys.org/blog/2024-07-25-sglang-llama3/)
[2024-02-05] [Fast JSON Decoding for Local LLMs with Compressed Finite State Machine](https://lmsys.org/blog/2024-02-05-compressed-fsm/)
[2024-01-17] [Fast and Expressive LLM Inference with RadixAttention and SGLang](https://lmsys.org/blog/2024-01-17-sglang/)
## Slides
### GPU MODE
[2024-11-10] [SGLang Performance Optimization](slides/SGLang-Performance-Optimization-YinengZhang.pdf)
### The first LMSYS online meetup: Efficient LLM Deployment and Serving
[2024-10-16] [SGLang Overview & CPU Overhead Hiding](slides/lmsys_1st_meetup_sglang.pdf)
[2024-10-16] [Faster Constrained Decoding](slides/lmsys_1st_meetup_constrained_decoding.pdf)
[2024-10-16] [SGLang DeepSeek MLA](slides/lmsys_1st_meetup_deepseek_mla.pdf)
[2024-10-16] [Universal LLM deployment and low-latency serving in MLC LLM](slides/lmsys_1st_meetup_mlcengine.pdf)
[2024-10-16] [XGrammar: Flexible And Efficient Structured Generation Engine for Large Language Models](slides/lmsys_1st_meetup_xgrammar.pdf)
[2024-10-16] [Review of the first LMSYS online meetup: Efficient LLM Deployment and Serving](blogs/Efficient%20LLM%20Deployment%20and%20Serving.md)
### AMD Advancing AI 2024
[2024-10-10] [Efficient LLM Inference with SGLang](slides/amd_dev_day_v2.pdf)
### SGLang Biweekly Meeting
[2024-11-02] [Quantization on AMD](slides/sglang-fp8-mxfp-quantizations.pdf)
[2024-10-05] [SGLang Double Sparsity](https://docs.google.com/presentation/d/1ibXDY7hjsk12QYcLeDpsFDYGlNvbkEF1XBgQJ-CqRDE)
[2024-09-21] [SGLang DeepSeek MLA](https://docs.google.com/presentation/d/1wB_Ul0LZwIDL47qFl64b8hVhH1_ya-1YPAPSSv0cKMs)
### Other
[SGLang v0.2: Faster Interface and Runtime for LLM Inference](slides/sglang_v0_2.pdf)
## Videos
Welcome to follow our YouTube [channel](https://www.youtube.com/@lmsys-org).
### GPU MODE
[2024-11-10] [SGLang Performance Optimization](https://www.youtube.com/watch?v=XQylGyG7yp8)### The first LMSYS online meetup
[2024-10-16] [The First SGLang Online Meetup](https://www.youtube.com/watch?v=_mzKptPj0hE)
### SGLang Biweekly Meeting
[2024-11-03] [SGLang Developer Sync 20241103](https://www.youtube.com/watch?v=k8Jo6-XcStM)
[2024-10-19] [SGLang Developer Sync 20241019](https://www.youtube.com/watch?v=wEUxk2WrMg8)
[2024-10-05] [SGLang Developer Sync 20241005](https://www.youtube.com/watch?v=GdLE5nd2nJo)
[2024-09-21] [SGLang Developer Sync 20240921](https://www.youtube.com/watch?v=bPtM-LLlCCU)
## Paper
[NeurIPS 24] [SGLang: Efficient Execution of Structured Language Model Programs](https://arxiv.org/abs/2312.07104)
## Documentaion
[SGLang Documentation](https://sgl-project.github.io/)