https://github.com/harleyszhang/llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
https://github.com/harleyszhang/llm_note
cuda-programming kv-cache llm llm-inference transformer-models triton-kernels vllm
Last synced: about 2 months ago
JSON representation
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
- Host: GitHub
- URL: https://github.com/harleyszhang/llm_note
- Owner: harleyszhang
- Created: 2024-09-18T10:06:58.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-08-08T09:36:11.000Z (2 months ago)
- Last Synced: 2025-08-08T11:29:37.457Z (2 months ago)
- Topics: cuda-programming, kv-cache, llm, llm-inference, transformer-models, triton-kernels, vllm
- Language: Python
- Homepage:
- Size: 185 MB
- Stars: 807
- Watchers: 9
- Forks: 89
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-llm-and-aigc - harleyszhang/llm_note