0 "benchmark" Awesome Lists
awesome-semantic-segmentation
:metal: awesome-semantic-segmentation
benchmark deeplearning evaluation semantic-segmentation
10,813 stars
2,482 forks
53 projects
Last updated: 27 Feb 2026
Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
agent awsome-list benchmark blogs compress evaluation large-language-models length-extrapolation llm long-context-modeling
1,908 stars
79 forks
1,465 projects
Last updated: 08 Feb 2026
Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
benchmark macro-action mcts o1 o3 prm r1 reasoning rl self-improve
1,324 stars
77 forks
410 projects
Last updated: 13 Feb 2026
openstl
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning
artificial-intelligence attention-mechanism awesome-list awesome-lists benchmark computer-vision deep-learning mlp predictive-learning pytorch
1,052 stars
178 forks
43 projects
Last updated: 04 Feb 2026
awesome-llm-eval
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model
608 stars
51 forks
479 projects
Last updated: 08 Feb 2026
awesome-state-of-depth-completion
Current state of supervised and unsupervised depth completion methods
3d-reconstruction 3d-vision awesome benchmark computer-vision depth-completion kitti machine-learning self-supervised-learning supervised-learning
502 stars
24 forks
71 projects
Last updated: 14 Feb 2026
Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
awesome benchmark evaluation evaluation-metrics evaluation-system generative-models image-generation video-generation
413 stars
20 forks
834 projects
Last updated: 06 Feb 2026
awesome_imputation
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper and tool list about applying neural networks to impute incomplete time series containing NaN missing values/data
benchmark data-mining deep-learning imputation incomplete-time-series interpolation irregular-time-series machine-learning missing-data missing-values
408 stars
45 forks
123 projects
Last updated: 18 Feb 2026
Awesome_Satellite_Benchmark_Datasets
Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
awesome-list benchmark benchmark-datasets dataset deep-learning earth-observation machine-learning remote-sensing satellite-imagery
362 stars
29 forks
31 projects
Last updated: 06 Feb 2026
awesome-foundation-model-leaderboards
A curated list of awesome leaderboard-oriented resources for AI domain
ai-agent artificial-intelligence awesome-list benchmark deep-learning evaluation foundation-model large-ai-model leaderboard machine-learning
307 stars
35 forks
532 projects
Last updated: 03 Feb 2026
awesome-mojo
Awesome Mojo🔥
artificial-intelligence awesome awesome-mojo benchmark compilers cpp future-ai future-technologies hacktoberfest hardware-acceleration
222 stars
11 forks
152 projects
Last updated: 06 Feb 2026
marbert
UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic
arabic arabic-dialects arabic-language arabic-models arabic-nlp arbert awesome benchmark bert bert-model
114 stars
17 forks
47 projects
Last updated: 14 Feb 2026
awesome-ai-agent-testing
🤖 A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems
agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering
28 stars
5 forks
168 projects
Last updated: 27 Feb 2026
awesome-semantic-segmentation
awesome-semantic-segmentation - list of awesome things around semantic segmentation :tada:
awesome awesome-list benchmark deep-learning fcn fully-convolutional-networks rcnn segmentation semantic-segmentation
21 stars
1 forks
59 projects
Last updated: 09 Dec 2025
awesome-idr-prediction
Curated Labs, Tools for Protein Intrinsic Disorder (IDR) prediction.
awesome-list benchmark bioinformatics curated-list intrinsically-disordered-regions machine-learning prediction protein-disorder structural-biology
4 stars
1 forks
118 projects
Last updated: 12 Dec 2025