0 "benchmark" Awesome Lists
awesome-semantic-segmentation
:metal: awesome-semantic-segmentation
benchmark deeplearning evaluation semantic-segmentation
10,835 stars
2,475 forks
53 projects
Last updated: 29 May 2026
Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
agent awsome-list benchmark blogs compress evaluation large-language-models length-extrapolation llm long-context-modeling
2,121 stars
96 forks
1,648 projects
Last updated: 13 Jun 2026
Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
benchmark macro-action mcts o1 o3 prm r1 reasoning rl self-improve
1,352 stars
80 forks
410 projects
Last updated: 27 May 2026
openstl
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning
artificial-intelligence attention-mechanism awesome-list awesome-lists benchmark computer-vision deep-learning mlp predictive-learning pytorch
1,108 stars
188 forks
43 projects
Last updated: 29 May 2026
awesome-llm-eval
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model
642 stars
74 forks
479 projects
Last updated: 08 Jun 2026
awesome-state-of-depth-completion
Current state of supervised and unsupervised depth completion methods
3d-reconstruction 3d-vision awesome benchmark computer-vision depth-completion kitti machine-learning self-supervised-learning supervised-learning
510 stars
24 forks
71 projects
Last updated: 24 Apr 2026
Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
awesome benchmark evaluation evaluation-metrics evaluation-system generative-models image-generation video-generation
448 stars
24 forks
837 projects
Last updated: 01 Jun 2026
awesome_imputation
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper and tool list about applying neural networks to impute incomplete time series containing NaN missing values/data
benchmark data-mining deep-learning imputation incomplete-time-series interpolation irregular-time-series machine-learning missing-data missing-values
419 stars
45 forks
123 projects
Last updated: 22 May 2026
awesome-ai-leaderboard
A curated list of awesome leaderboard-oriented resources for AI domain
ai-agent artificial-intelligence awesome-list benchmark deep-learning evaluation foundation-model large-ai-model leaderboard machine-learning
363 stars
52 forks
553 projects
Last updated: 21 Jun 2026
Awesome_Satellite_Benchmark_Datasets
Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
awesome-list benchmark benchmark-datasets dataset deep-learning earth-observation machine-learning remote-sensing satellite-imagery
361 stars
29 forks
31 projects
Last updated: 07 Jun 2026
awesome-mojo
Awesome Mojo🔥
artificial-intelligence awesome awesome-mojo benchmark compilers cpp future-ai future-technologies hacktoberfest hardware-acceleration
224 stars
10 forks
152 projects
Last updated: 29 May 2026
awesome-human-activity-recognition
Always up-to-date, most comprehensive HAR resource — continuously scanned and auto-updated from Papers with Code. 53 datasets integrated across all modalities.
action-recognition awesome awesome-list benchmark computer-vision datasets deep-learning human-activity-recognition machine-learning motion-detection
126 stars
3 forks
123 projects
Last updated: 01 Jun 2026
marbert
UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic
arabic arabic-dialects arabic-language arabic-models arabic-nlp arbert awesome benchmark bert bert-model
117 stars
17 forks
47 projects
Last updated: 23 Apr 2026
awesome-ai-agent-testing
🤖 A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems
agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering
37 stars
10 forks
168 projects
Last updated: 25 May 2026
awesome-semantic-segmentation
awesome-semantic-segmentation - list of awesome things around semantic segmentation :tada:
awesome awesome-list benchmark deep-learning fcn fully-convolutional-networks rcnn segmentation semantic-segmentation
21 stars
1 forks
162 projects
Last updated: 10 Apr 2026
awesome-idr-prediction
Curated Labs, Tools for Protein Intrinsic Disorder (IDR) prediction.
awesome-list benchmark bioinformatics curated-list intrinsically-disordered-regions machine-learning prediction protein-disorder structural-biology
6 stars
2 forks
118 projects
Last updated: 29 Mar 2026
awesome-ts-anomaly-detection-datasets
Collection of Time-Series Anomaly Detection Datasets
anomaly anomaly-detection awesome-list benchmark dataset machine-learning outlier-detection time-series
4 stars
1 forks
41 projects
Last updated: 07 Jun 2026
awesome-agent-rl-environments
A curated list of training & evaluation environments for LLM/VLM agents (SWE-Gym, GEM, RAGEN, AgentGym, WebArena, OSWorld, ToolBench…). Updated weekly.
agent agent-rl agentic-ai awesome awesome-list benchmark browser-agent computer-use-agent evaluation grpo
3 stars
0 forks
69 projects
Last updated: 25 May 2026
awesome-llm-agent-privacy
A curated list of papers on privacy, security, and compliance in LLM-based agent systems — attacks, defenses, benchmarks, and regulatory frameworks.
ai-safety awesome-list benchmark compliance llm llm-agents machine-learning multi-agent nlp paper-list
2 stars
0 forks
43 projects
Last updated: 11 Mar 2026