Projects in Awesome Lists tagged with model-assessment
A curated list of projects in awesome lists tagged with model-assessment .
https://github.com/MLGroupJLU/LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
benchmark evaluation large-language-models llm llms model-assessment
Last synced: 04 Apr 2025
https://github.com/mlgroupjlu/llm-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
benchmark evaluation large-language-models llm llms model-assessment
Last synced: 26 Mar 2025
https://github.com/johnsnowlabs/langtest
Deliver safe & effective language models
ai-safety ai-testing artificial-intelligence benchmark-framework benchmarks ethics-in-ai large-language-models llm llm-as-evaluator llm-evaluation-toolkit llm-test llm-testing ml-safety ml-testing mlops model-assessment nlp responsible-ai trustworthy-ai
Last synced: 14 May 2025
https://github.com/JohnSnowLabs/langtest
Deliver safe & effective language models
ai-safety ai-testing artificial-intelligence benchmark-framework benchmarks ethics-in-ai large-language-models llm llm-as-evaluator llm-evaluation-toolkit llm-test llm-testing ml-safety ml-testing mlops model-assessment nlp responsible-ai trustworthy-ai
Last synced: 01 Feb 2025
https://github.com/slipguru/palladio
ParALleL frAmework for moDel selectIOn
analysis assessment evaluation machine-learning model model-assessment palladio pipeline python sklearn sklearn-compatible
Last synced: 13 Apr 2025
https://github.com/yassinelahdiy/page-language-model
Open-source framework for defining Page Language Models (PLMs) for intelligent app understanding and AI-assisted testing.
agent benchmark evaluation gui image-recognition instruction-tuning model-assessment nlp playwright pre-training prompt-engineering rest-api set-of-mark webpage
Last synced: 19 Apr 2025