Projects in Awesome Lists tagged with gsm8k
A curated list of projects in awesome lists tagged with gsm8k .
https://github.com/thu-keg/dice
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
benchmark data-contamination fine-tuning-llm gsm8k llm sft
Last synced: 13 May 2025
https://github.com/declare-lab/llm-reasoningtest
Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions
Last synced: 30 Dec 2024
https://github.com/superbrucejia/gsm8k-consistency
GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.
arithmetic-consistency arithmetic-reasoning factual-consistency foundation-models grade grade-school-math gsm8k large-language-models logical-consistency mathematical-reasoning prompt prompt-engineering prompt-perturbation prompt-toolkit reasoning self-consistency self-consistency-benchmark semantics-consistency semantics-preserving-transformations semantics-similar
Last synced: 23 Feb 2025