https://github.com/brontoguana/krasis
Krasis is a Hybrid LLM runtime which focuses on efficient running of larger models on consumer grade VRAM limited hardware
https://github.com/brontoguana/krasis
cpu-inference gguf-model-support gpu-inference high-performance-inference hybrid-inference inference-engine inference-optimization large-language-models llama-cpp-alternative llm-inference mixture-of-experts transformer
Last synced: 3 days ago
JSON representation
Krasis is a Hybrid LLM runtime which focuses on efficient running of larger models on consumer grade VRAM limited hardware
- Host: GitHub
- URL: https://github.com/brontoguana/krasis
- Owner: brontoguana
- License: other
- Created: 2026-02-09T17:54:07.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2026-05-28T00:18:52.000Z (5 days ago)
- Last Synced: 2026-05-28T02:23:52.829Z (5 days ago)
- Topics: cpu-inference, gguf-model-support, gpu-inference, high-performance-inference, hybrid-inference, inference-engine, inference-optimization, large-language-models, llama-cpp-alternative, llm-inference, mixture-of-experts, transformer
- Language: C++
- Homepage:
- Size: 376 MB
- Stars: 459
- Watchers: 9
- Forks: 26
- Open Issues: 15
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
- awesome-local-llm - krasis - a Hybrid LLM runtime which focuses on efficient running of larger models on consumer grade VRAM limited hardware (Inference engines)