https://github.com/devlooped/ai.benchmarks

Benchmark different AI services that can be accessed using Microsoft.Extensions.AI
https://github.com/devlooped/ai.benchmarks

Last synced: 8 months ago
JSON representation

Benchmark different AI services that can be accessed using Microsoft.Extensions.AI

Host: GitHub
URL: https://github.com/devlooped/ai.benchmarks
Owner: devlooped
License: mit
Created: 2024-11-05T21:34:02.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-02-10T22:40:09.000Z (over 1 year ago)
Last Synced: 2025-02-13T17:57:21.964Z (over 1 year ago)
Language: C#
Size: 58.6 KB
Stars: 2
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: readme.md
- License: license.txt

Awesome Lists containing this project

README

          # Microsoft.Extensions.AI Benchmarks

Uses the new [Microsoft.Extensions.AI](https://devblogs.microsoft.com/dotnet/introducing-microsoft-extensions-ai-preview/) preview 

to compare the relative performance of OpenAI, AzureAI and xAI, using the same client code.

Currently, comparison includes:

* OpenAI: gpt-4o, gpt-4o-mini

* AzureAI: gpt-4o, gpt-4o-mini

* xAI: grok-beta, grok-2-latest

![xAI grok2](https://img.shields.io/badge/dynamic/json?url=https%3A%2F%2Fgithub.com%2Fdevlooped%2FAI.Benchmarks%2Fraw%2Frefs%2Fheads%2Fmain%2Fartifacts%2Fsummary.json&query=%24.xai.mean&logo=x&label=grok2)

![Azure gpt-4o](https://img.shields.io/badge/dynamic/json?url=https%3A%2F%2Fgithub.com%2Fdevlooped%2FAI.Benchmarks%2Fraw%2Frefs%2Fheads%2Fmain%2Fartifacts%2Fsummary.json&query=%24..aai.mean&label=gpt-4o&logo=data:image/svg+xml;base64,PHN2ZyBpZD0idXVpZC1hZGJkYWU4ZS01YTQxLTQ2ZDEtOGMxOC1hYTczY2RiZmVlMzIiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgd2lkdGg9IjE4IiBoZWlnaHQ9IjE4IiB2aWV3Qm94PSIwIDAgMTggMTgiPjxkZWZzPjxyYWRpYWxHcmFkaWVudCBpZD0idXVpZC0yYTc0MDdhYS1iNzg3LTQ4ZGQtYTk2YS0wZDgxYWI2ZTkzYmIiIGN4PSItNjcuOTgxIiBjeT0iNzkzLjE5OSIgcj0iLjQ1IiBncmFkaWVudFRyYW5zZm9ybT0idHJhbnNsYXRlKC0xNzkzOS4wMyAyMDM2OC4wMjkpIHJvdGF0ZSg0NSkgc2NhbGUoMjUuMDkxIC0zNC4xNDkpIiBncmFkaWVudFVuaXRzPSJ1c2VyU3BhY2VPblVzZSI+PHN0b3Agb2Zmc2V0PSIwIiBzdG9wLWNvbG9yPSIjODNiOWY5IiAvPjxzdG9wIG9mZnNldD0iMSIgc3RvcC1jb2xvcj0iIzAwNzhkNCIgLz48L3JhZGlhbEdyYWRpZW50PjwvZGVmcz48cGF0aCBkPSJtMCwyLjd2MTIuNmMwLDEuNDkxLDEuMjA5LDIuNywyLjcsMi43aDEyLjZjMS40OTEsMCwyLjctMS4yMDksMi43LTIuN1YyLjdjMC0xLjQ5MS0xLjIwOS0yLjctMi43LTIuN0gyLjdDMS4yMDksMCwwLDEuMjA5LDAsMi43Wk0xMC44LDB2My42YzAsMy45NzYsMy4yMjQsNy4yLDcuMiw3LjJoLTMuNmMtMy45NzYsMC03LjE5OSwzLjIyMi03LjIsNy4xOTh2LTMuNTk4YzAtMy45NzYtMy4yMjQtNy4yLTcuMi03LjJoMy42YzMuOTc2LDAsNy4yLTMuMjI0LDcuMi03LjJaIiBmaWxsPSJ1cmwoI3V1aWQtMmE3NDA3YWEtYjc4Ny00OGRkLWE5NmEtMGQ4MWFiNmU5M2JiKSIgc3Ryb2tlLXdpZHRoPSIwIiAvPjwvc3ZnPg==)

![OpenAI gpt-4o](https://img.shields.io/badge/dynamic/json?url=https%3A%2F%2Fgithub.com%2Fdevlooped%2FAI.Benchmarks%2Fraw%2Frefs%2Fheads%2Fmain%2Fartifacts%2Fsummary.json&query=%24.oai.mean&logo=openai&label=gpt-4o)

Prompt:

```csharp

IList prompt =

[

    new ChatMessage(ChatRole.System, "You are a chatbot inspired by the Hitchhiker's Guide to the Galaxy."),

    new ChatMessage(ChatRole.User, "What is the meaning of life, the universe, and everything?"),

];

```

Results:

```

BenchmarkDotNet v0.14.0, Windows 11 (10.0.22631.4890/23H2/2023Update/SunValley3)

Intel Core i9-10900T CPU 1.90GHz, 1 CPU, 20 logical and 10 physical cores

.NET SDK 9.0.200-preview.0.25057.12

  [Host]     : .NET 8.0.13 (8.0.1325.6609), X64 RyuJIT AVX2

  DefaultJob : .NET 8.0.13 (8.0.1325.6609), X64 RyuJIT AVX2

```

| Method | Client            | Provider | Model         | Mean    | Error    | StdDev   | Median  |

|------- |------------------ |--------- |-------------- |--------:|---------:|---------:|--------:|

| **Chat**   | **aai-gpt-4o**        | **Azure AI** | **gpt-4o**        | **1.724 s** | **0.2150 s** | **0.6029 s** | **1.493 s** |

| **Chat**   | **aai-gpt-4o-mini**   | **Azure AI** | **gpt-4o-mini**   | **1.897 s** | **0.2355 s** | **0.6407 s** | **1.730 s** |

| **Chat**   | **oai-gpt-4o**        | **OpenAI**   | **gpt-4o**        | **1.678 s** | **0.1188 s** | **0.3408 s** | **1.660 s** |

| **Chat**   | **oai-gpt-4o-mini**   | **OpenAI**   | **gpt-4o-mini**   | **2.835 s** | **0.2741 s** | **0.7995 s** | **2.629 s** |

| **Chat**   | **xai-grok-2-latest** | **xAI**      | **grok-2-latest** | **1.629 s** | **0.1254 s** | **0.3676 s** | **1.588 s** |

| **Chat**   | **xai-grok-beta**     | **xAI**      | **grok-beta**     | **1.678 s** | **0.1107 s** | **0.3194 s** | **1.655 s** |

## Run locally

You can trivially update (and optionally send a PR) the results by running the benchmarks locally.

Simply fork the repository, clone it locally and run the `.\update.ps1` script from a `pwsh` prompt.

The only requirement is to have `jq` installed and available in your `PATH`.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/devlooped/ai.benchmarks

Awesome Lists containing this project

README