https://github.com/parthapray/llm_evaluation_metrics_localized

This repo contains code for localized LLM evaluation metrics vis a framework using Ollama and edge resource and novel derived metrics
https://github.com/parthapray/llm_evaluation_metrics_localized

evaluation evaluation-framework evaluation-metrics evaluations flask large-language-models metrics ollama-api restful-api

Last synced: 4 months ago
JSON representation

This repo contains code for localized LLM evaluation metrics vis a framework using Ollama and edge resource and novel derived metrics

Host: GitHub
URL: https://github.com/parthapray/llm_evaluation_metrics_localized
Owner: ParthaPRay
License: mit
Created: 2025-01-05T13:16:06.000Z (6 months ago)
Default Branch: main
Last Pushed: 2025-01-11T07:27:25.000Z (6 months ago)
Last Synced: 2025-02-27T01:16:26.532Z (4 months ago)
Topics: evaluation, evaluation-framework, evaluation-metrics, evaluations, flask, large-language-models, metrics, ollama-api, restful-api
Language: Python
Homepage:
Size: 103 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# LLM Evaluation Metrics at Edge

This project provides a Flask-based API for evaluating and monitoring the performance of Large Language Models (LLMs) with detailed metrics including resource usage (CPU, memory, power), and operational efficiency.

---
## Modify base_power and max_power of your device

* base_power = The Value e.g., 240 ; for IDLE Mode
* max_power = The Value e.g. 600 ; for FULL MODE
---

## Installation

### Step 1: Install Ollama
Ensure that you have **Ollama** installed on your system. Refer to [Ollama's official installation guide](https://ollama.com) for instructions.

### Step 2: Set Up Python Environment
1. Clone this repository:
```bash
git clone
cd
```

2. Install required dependencies:
```bash
pip install -r requirements.txt
```

### Requirements
Ensure the following Python libraries are installed:
- `flask`
- `psutil`
- `requests`
- `uvicorn`
- `asgiref`

These are already listed in the `requirements.txt` file.

## Running the Application

llm_metrics_11.py has CSV and JSON logging

1. Start the Flask server:
```bash
python llm_metrics_11.py
```

2. The server will start and listen on `http://0.0.0.0:5000`.

## API Usage

### POST `/process_prompt`
Send a prompt to the model and receive performance metrics.

#### Example `curl` Command
```bash
curl -X POST http://localhost:5000/process_prompt \
-H "Content-Type: application/json" \
-d '{
"prompt": "What is the capital of France?"
}'
```
#### Example `batch curl` Code
```python
python test.py
```

#### Response
The API will return a JSON object containing:
- **Ollama-based metrics**: Total duration, load duration, evaluation duration, etc.
- **Local resource usage**: CPU usage, memory usage, power consumption.
- **Derived metrics**: Tokens per second, energy per token, and more.

---

Feel free to customize the content for your specific repository and use case!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/parthapray/llm_evaluation_metrics_localized

Awesome Lists containing this project

README