https://github.com/teloryfrozy/llm-benchmark

🐍 Script to compare llm performances with statistics
https://github.com/teloryfrozy/llm-benchmark

anthropic gemini llm-wrapper mistral openai test-script

Last synced: 5 months ago
JSON representation

🐍 Script to compare llm performances with statistics

Host: GitHub
URL: https://github.com/teloryfrozy/llm-benchmark
Owner: teloryfrozy
License: apache-2.0
Created: 2024-11-19T09:25:13.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-11-20T17:16:25.000Z (about 1 year ago)
Last Synced: 2025-06-25T19:02:59.094Z (5 months ago)
Topics: anthropic, gemini, llm-wrapper, mistral, openai, test-script
Language: Python
Homepage:
Size: 15.6 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# LLM Benchmark

# Get started

```bash
git clone -b main https://github.com/teloryfrozy/llm-benchmark
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
```

## Create a .env file in the root directory and add the api keys you want to use

```bash
MISTRAL_API_KEY="Your Mistral API Key"
OPENAI_API_KEY="Your OpenAI API Key"
ANTHROPIC_API_KEY="Your Anthropic API Key"
GEMINI_API_KEY="Your Gemini API Key"
```

# Start the benchmark

## Define the llm models and roles to compare in the main.py file
```python
LLM_MODELS = {
"openai": ["gpt-4o-mini"],
"anthropic": ["claude-3-5-haiku-latest"],
"gemini": ["gemini-1.5-flash"],
"mistral": ["mistral-small-latest"],
}
ROLES = ["user", "assistant", "system"]
```

## Define your prompt in utils/constants.py
```python
PROMPT = "How to advertise a SaaS with a budget of $1000 in 3 key sentences?"
```

# Run the benchmark
```bash
python3 main.py
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/teloryfrozy/llm-benchmark

Awesome Lists containing this project

README