https://github.com/harleyszhang/llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes
https://github.com/harleyszhang/llm_note
cuda-programming kv-cache llm llm-inference transformer-models triton-kernels vllm
Last synced: 4 months ago
JSON representation
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes
- Host: GitHub
- URL: https://github.com/harleyszhang/llm_note
- Owner: harleyszhang
- Created: 2024-09-18T10:06:58.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-12-14T14:21:57.000Z (5 months ago)
- Last Synced: 2024-12-14T15:17:50.612Z (5 months ago)
- Topics: cuda-programming, kv-cache, llm, llm-inference, transformer-models, triton-kernels, vllm
- Language: Python
- Homepage:
- Size: 144 MB
- Stars: 266
- Watchers: 5
- Forks: 23
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-llm-and-aigc - harleyszhang/llm_note