https://github.com/lyogavin/Anima
Moved to here: https://github.com/lyogavin/airllm
https://github.com/lyogavin/Anima
chinese-llm chinese-nlp finetune generative-ai instruct-gpt instruction-set llama llm lora open-models open-source open-source-models qlora
Last synced: about 2 months ago
JSON representation
Moved to here: https://github.com/lyogavin/airllm
- Host: GitHub
- URL: https://github.com/lyogavin/Anima
- Owner: lyogavin
- Created: 2024-07-29T20:11:25.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-08-01T14:11:10.000Z (11 months ago)
- Last Synced: 2025-04-12T11:56:53.643Z (2 months ago)
- Size: 2.93 KB
- Stars: 14
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- StarryDivineSky - lyogavin/Anima
- awesome-llm-if - LLMBar - nlp/LLMBar.svg) (Uncategorized / Uncategorized)
- awesome-llm-if - LLMBar - nlp/LLMBar.svg) (Uncategorized / Uncategorized)
README

**AirLLM** optimizes inference memory usage, allowing 70B large language models to run inference on a single 4GB GPU card without quantization, distillation and pruning. And you can run **405B Llama3.1** on **8GB vram** now.

Moved to here: https://github.com/lyogavin/airllm