0 "benchmark" Awesome Lists
awesome-semantic-segmentation
:metal: awesome-semantic-segmentation
benchmark deeplearning evaluation semantic-segmentation
10,829 stars
2,478 forks
53 projects
Last updated: 09 Apr 2026
Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
agent awsome-list benchmark blogs compress evaluation large-language-models length-extrapolation llm long-context-modeling
2,054 stars
88 forks
1,536 projects
Last updated: 29 Apr 2026
Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
benchmark macro-action mcts o1 o3 prm r1 reasoning rl self-improve
1,341 stars
76 forks
410 projects
Last updated: 24 Mar 2026
openstl
OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning
artificial-intelligence attention-mechanism awesome-list awesome-lists benchmark computer-vision deep-learning mlp predictive-learning pytorch
1,085 stars
184 forks
43 projects
Last updated: 10 Apr 2026
awesome-llm-eval
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model
633 stars
59 forks
479 projects
Last updated: 20 Apr 2026
awesome-state-of-depth-completion
Current state of supervised and unsupervised depth completion methods
3d-reconstruction 3d-vision awesome benchmark computer-vision depth-completion kitti machine-learning self-supervised-learning supervised-learning
510 stars
24 forks
71 projects
Last updated: 10 Apr 2026
Awesome-Evaluation-of-Visual-Generation
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
awesome benchmark evaluation evaluation-metrics evaluation-system generative-models image-generation video-generation
429 stars
22 forks
834 projects
Last updated: 07 Apr 2026
awesome_imputation
Awesome Deep Learning for Time-Series Imputation, including an unmissable paper and tool list about applying neural networks to impute incomplete time series containing NaN missing values/data
benchmark data-mining deep-learning imputation incomplete-time-series interpolation irregular-time-series machine-learning missing-data missing-values
415 stars
46 forks
123 projects
Last updated: 31 Mar 2026
Awesome_Satellite_Benchmark_Datasets
Supplementary material for our paper "THERE IS NO DATA LIKE MORE DATA" is provided.
awesome-list benchmark benchmark-datasets dataset deep-learning earth-observation machine-learning remote-sensing satellite-imagery
363 stars
29 forks
31 projects
Last updated: 11 Mar 2026
awesome-foundation-model-leaderboards
A curated list of awesome leaderboard-oriented resources for AI domain
ai-agent artificial-intelligence awesome-list benchmark deep-learning evaluation foundation-model large-ai-model leaderboard machine-learning
329 stars
39 forks
542 projects
Last updated: 28 Mar 2026
awesome-mojo
Awesome Mojo🔥
artificial-intelligence awesome awesome-mojo benchmark compilers cpp future-ai future-technologies hacktoberfest hardware-acceleration
224 stars
10 forks
152 projects
Last updated: 04 Mar 2026
marbert
UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic
arabic arabic-dialects arabic-language arabic-models arabic-nlp arbert awesome benchmark bert bert-model
116 stars
17 forks
47 projects
Last updated: 05 Apr 2026
awesome-human-activity-recognition
Always up-to-date, most comprehensive HAR resource — continuously scanned and auto-updated from Papers with Code. 53 datasets integrated across all modalities.
action-recognition awesome awesome-list benchmark computer-vision datasets deep-learning human-activity-recognition machine-learning motion-detection
100 stars
1 forks
123 projects
Last updated: 15 Apr 2026
awesome-ai-agent-testing
🤖 A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems
agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering
31 stars
6 forks
168 projects
Last updated: 26 Mar 2026
awesome-semantic-segmentation
awesome-semantic-segmentation - list of awesome things around semantic segmentation :tada:
awesome awesome-list benchmark deep-learning fcn fully-convolutional-networks rcnn segmentation semantic-segmentation
21 stars
1 forks
162 projects
Last updated: 09 Dec 2025
awesome-idr-prediction
Curated Labs, Tools for Protein Intrinsic Disorder (IDR) prediction.
awesome-list benchmark bioinformatics curated-list intrinsically-disordered-regions machine-learning prediction protein-disorder structural-biology
6 stars
1 forks
118 projects
Last updated: 29 Mar 2026
awesome-llm-agent-privacy
A curated list of papers on privacy, security, and compliance in LLM-based agent systems — attacks, defenses, benchmarks, and regulatory frameworks.
ai-safety awesome-list benchmark compliance llm llm-agents machine-learning multi-agent nlp paper-list
2 stars
0 forks
43 projects
Last updated: 11 Mar 2026