https://github.com/MegEngine/InferLLM
a lightweight LLM model inference framework
https://github.com/MegEngine/InferLLM
deeplearning inference llm mobile
Last synced: 2 months ago
JSON representation
a lightweight LLM model inference framework
- Host: GitHub
- URL: https://github.com/MegEngine/InferLLM
- Owner: MegEngine
- License: apache-2.0
- Created: 2023-05-04T05:40:17.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-04-07T01:57:37.000Z (about 1 year ago)
- Last Synced: 2025-03-12T12:13:02.643Z (2 months ago)
- Topics: deeplearning, inference, llm, mobile
- Language: C++
- Homepage:
- Size: 4.91 MB
- Stars: 717
- Watchers: 10
- Forks: 91
- Open Issues: 42
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-llm-and-aigc - MegEngine/InferLLM
- awesome-llm-and-aigc - MegEngine/InferLLM
- awesome-cuda-and-hpc - MegEngine/InferLLM
- awesome-cuda-and-hpc - MegEngine/InferLLM