An open API service indexing awesome lists of open source software.

https://github.com/pogzyb/llmabda

Run llama.cpp server on aws lambda for cheap
https://github.com/pogzyb/llmabda

agentic-ai inference inference-api inference-engine lambda-functions llamacpp

Last synced: 4 months ago
JSON representation

Run llama.cpp server on aws lambda for cheap

Awesome Lists containing this project

README

          

## LLMABDA

[![release](https://github.com/pogzyb/llmabda/actions/workflows/release.yml/badge.svg)](https://github.com/pogzyb/llmabda/actions/workflows/release.yml)

🐑 Run Quantized Agents on AWS Lambda for Cheap 🐥

### What is this?

LLMabda is a simple proxy for [llama.cpp](https://github.com/ggml-org/llama.cpp) server that runs on AWS Lambda.

### Deployment

Information about deployment is in [deploy/](/deploy/DEPLOY.md)

### Demo

1. Agent (Using ggml-org/Qwen3-1.7B-GGUF)
- [source code](https://github.com/pogzyb/llmabda/blob/main/examples/react_agent_simple.py)
- "What is 5 + (5 * 6)? Please shout the final answer."

![task_summarize](docs/react_agent_simple.png "Agent")

2. Summarize text (Using ggml-org/Qwen3-1.7B-GGUF)
- [source code](https://github.com/pogzyb/llmabda/blob/main/examples/task_summarize_osrs.py)
- "What happened to the Crystal Extractor?"

![task_summarize](docs/task_summarize_osrs.png "Summarization")