Projects in Awesome Lists tagged with evaluating-models
A curated list of projects in awesome lists tagged with evaluating-models .
https://github.com/eth-sri/matharena
Evaluation of LLMs on latest math competitions
Last synced: 23 Jul 2025
https://github.com/thiagopanini/mlcomposer
Applying Machine Learning has never been easier than with ml composer. This package has excellent tools already built to carry out complex ml processes like training and evaluating multiple models.
classification data-prep-pipelines evaluating-models machine-learning python
Last synced: 29 Jun 2026
https://github.com/shaheennabi/rlvr_grpo-experiment-with-math500
A small experiment repository comparing a base reasoning model against RLVR-GRPO checkpoints on the Math500 dataset. It includes evaluation results, short-form observations, and a local temp_clone of the full open-posttraining-system codebase for reference.
evaluating-models grpo-checkpoint math500 open-posttraining-system policy-optimization post-training reasoning-models reinforcement-learning rlvr-grpo sparse-rewards
Last synced: 18 Jun 2026