Projects in Awesome Lists tagged with evaluations
A curated list of projects in awesome lists tagged with evaluations .
https://github.com/scale3-labs/langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊
ai datasets evaluations gpt langchain llm llm-framework llmops observability open-source open-telemetry openai prompt-engineering tracing
Last synced: 15 May 2025
https://github.com/Scale3-Labs/langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊
ai datasets evaluations gpt langchain llm llm-framework llmops observability open-source open-telemetry openai prompt-engineering tracing
Last synced: 15 Feb 2025
https://github.com/log10-io/log10
Python client library for improving your LLM app accuracy
agents ai anthropic artificial-intelligence autonomous-agents debugging evaluations feedback fine-tuning llmops llms logging monitoring openai python rlhf
Last synced: 11 Apr 2025
https://github.com/boxbeam/crunch
The fastest java expression compiler/evaluator
evaluating-mathematical-expressions evaluations
Last synced: 06 Apr 2025
https://github.com/llm-evaluation-s-always-fatiguing/leaf-playground
A framework to build scenario simulation projects where human and LLM based agents can participant in, with a user-friendly web UI to visualize simulation, support automatically evaluation on agent action level.
agent agent-based-simulation agents automation chatgpt evaluations llm-evaluation
Last synced: 02 Mar 2025
https://github.com/yisaienkov/evaluations
This library implements various metrics (including Kaggle Competition, Medicine) for evaluating ML, DL, AI models, and algorithms. 📐📊📈📉📏
evaluations kaggle kaggle-competition metrics metrics-library pypi python python-library python3
Last synced: 13 Apr 2025
https://github.com/bhadresh-laiya/program-evaluation.com
Do a program evaluation that really counts! That will help other students and will put really make universities and colleges take students experiences to heart!
blade-template built colleges counts evaluation evaluation-data evaluations laravel-framework laravel6 program students students-experiences universities using
Last synced: 04 Apr 2025
https://github.com/jtmuller5/vibe-checker
The TypeScript LLM Evaluation File
ai devtools evals evaluation-metrics evaluations gemini gemini-api gemini-flash javascript llm nodejs testing typescript vitest
Last synced: 25 Mar 2025
https://github.com/parthapray/llm_evaluation_metrics_localized
This repo contains code for localized LLM evaluation metrics vis a framework using Ollama and edge resource and novel derived metrics
evaluation evaluation-framework evaluation-metrics evaluations flask large-language-models metrics ollama-api restful-api
Last synced: 27 Feb 2025