An open API service indexing awesome lists of open source software.

0 "evaluation" Awesome Lists

awesome-semantic-segmentation

:metal: awesome-semantic-segmentation

benchmark deeplearning evaluation semantic-segmentation

10,813 stars
2,482 forks
53 projects

Last updated: 27 Feb 2026

Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

agent awsome-list benchmark blogs compress evaluation large-language-models length-extrapolation llm long-context-modeling

1,908 stars
79 forks
1,465 projects

Last updated: 08 Feb 2026

awesome-llm-eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model

608 stars
51 forks
479 projects

Last updated: 08 Feb 2026

Awesome-Evaluation-of-Visual-Generation

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

awesome benchmark evaluation evaluation-metrics evaluation-system generative-models image-generation video-generation

413 stars
20 forks
834 projects

Last updated: 06 Feb 2026

awesome-foundation-model-leaderboards

A curated list of awesome leaderboard-oriented resources for AI domain

ai-agent artificial-intelligence awesome-list benchmark deep-learning evaluation foundation-model large-ai-model leaderboard machine-learning

307 stars
35 forks
532 projects

Last updated: 03 Feb 2026

awesome-data-contamination

The Paper List on Data Contamination for Large Language Models Evaluation.

awesome-list data-contamination evaluation foundation-models large-language-models llm paper-list pre-trained-language-models pre-trained-model

110 stars
5 forks
260 projects

Last updated: 07 Feb 2026

awesome-ai-eval

☑️ A curated list of tools, methods & platforms for evaluating AI reliability in real applications.

ai-evaluation ai-evaluation-framework ai-evaluation-metrics ai-evaluation-tools awesome awesome-list awesome-lists chatgpt claude evaluation

61 stars
5 forks
183 projects

Last updated: 17 Feb 2026

awesome-datacentric-llm

Trending projects & awesome papers about data-centric llm studies.

data-centric-ai evaluation llm pre-training

39 stars
2 forks
45 projects

Last updated: 22 Jan 2026

awesome-ai-agent-testing

🤖 A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems

agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering

28 stars
5 forks
168 projects

Last updated: 27 Feb 2026

Search
Keywords
awesome-list 3,998 awesome 3,628 awesome-lists 566 machine-learning 451 list 421 deep-learning 376 resources 329 ai 270 hacktoberfest 237 python 222 llm 211 lists 205 javascript 194 security 172 programming 166 artificial-intelligence 156 blockchain 140 computer-vision 138 open-source 125 nlp 123 tools 118 large-language-models 118 data-science 110 react 108 chatgpt 101 android 95 curated-list 91 natural-language-processing 90 linux 85 learning 85 devops 82 awesome-readme 81 css 78 ios 78 awesome-resources 78 papers 77 cybersecurity 74 ethereum 73 reinforcement-learning 73 computer-science 65 nodejs 65 rust 64 kubernetes 64 robotics 62 collection 61 playground 60 game-development 59 design 59 free-resources 59 macos 58 openai 57 golang 57 getvm 57 free 57 java 56 survey 56 tutorials 55 frontend 54 privacy 53 developer-tools 53 bitcoin 52 go 51 hacking 51 tutorial 51 web 50 php 50 datasets 50 web3 50 llms 49 paper 48 education 48 cloud 47 data 47 swift 46 books 45 labex 45 development 45 github 45 hands-on 45 generative-ai 45 gpt 45 mcp 44 html 44 deep-neural-networks 44 dataset 44 cryptocurrency 43 automation 43 opensource 42 security-tools 42 ai-agents 42 exercises 41 data-visualization 41 ml 40 database 40 vue 40 documentation 40 typescript 40 agent 39 docker 39 gamedev 39 research 39 web-development 39 collections 38 algorithms 37 software 37 productivity 36 iot 36 prompt-engineering 35 neural-network 35 paper-list 34 testing 34 reactjs 34 game 34 courses 34 diffusion-models 33 framework 33 aws 33 cpp 32 api 32 projects 31 flutter 31 games 31 dotnet 30 transformer 30 community 30 graph-neural-networks 30 software-engineering 30 solidity 30 multimodal 30 hardware 30 bioinformatics 29 ruby 29 libraries 29 video 29 architecture 29 agents 28 react-native 28 csharp 28 smart-contracts 28 defi 28 ai-tools 28 crypto 28 windows 27 tensorflow 27 links 27 penetration-testing 26 object-detection 26 science 26 cli 26 music 26 self-hosted 26 cryptography 26 serverless 26 mlops 26 sql 25 infosec 25 pentesting 25 data-analysis 25 computer-graphics 25 best-practices 25 library 25 pytorch 25 git 25 django 25 self-supervised-learning 25 angular 25 microsoft 25 3d 24 data-mining 24 mathematics 24 mobile 24 jobs 24 claude 24 engineering 24 slam 24 coding 24 nextjs 23 ui 23 c 23 learning-resources 23 embedded 23 game-engine 23 saas 23 programming-language 23 unity 23 osint 23 neural-networks 23 r 23 knowledge-graph 23 markdown 23 software-development 23 roadmap 23 awsome-list 23 segmentation 22 deeplearning 22 kotlin 22 interview 22 statistics 22 reverse-engineering 22 blog 22 agentic-ai 22 visualization 22 python3 22 awesomeness 22 startups 22 audio 21 chatbot 21 networking 21 blogs 21 federated-learning 21 hacktoberfest-accepted 21 cloud-computing 21 opendata 21 finance 21 gpt-3 21 foundation-models 21 apple 21 startup 21 website 21 gpt-4 21 gemini 21 autonomous-driving 21 free-tutorials 21 azure 21 rag 20 chinese 20 data-structures 20 machinelearning 20 reasoning 20 time-series 20 apps 20 awsome 20 llama 20 graph 20 marketing 20 js 20 claude-code 20 raspberry-pi 20 guidelines 20 front-end 19 big-data 19 youtube 19 curated 19 anthropic 19 transformers 19 bugbounty 19 monitoring 19 image-generation 19 bash 18 stable-diffusion 18 aigc 18 backend 18 code 18 decentralized 18 graphql 18 representation-learning 18 prompts 18 prompt 18 optimization 18 seo 17 analytics 17 cloud-native 17 data-engineering 17 knowledge 17 generative-model 17 containers 17 beginner-friendly 17 cheatsheet 17 open-data 17 generative-art 17 devsecops 16 bert 16 telegram 16 microservices 16 text-to-image 16 autonomous-vehicles 16 deepseek 16 vscode 16 model-context-protocol 16 dart 16 speech-recognition 16 nerf 16 language 16 articles 16 algorithm 16 webgl 16 mysql 15 technology 15 evm 15 readme 15 leetcode 15 vlm 15 command-line 15 ros 15 image-processing 15 plugins 15 embodied-ai 15 mcp-server 15 selfhosted 15 android-development 15 distributed-systems 15 terminal 15 benchmark 15 malware-analysis 15 pentest 15 detection 15 p2p 15 cms 15 animation 15 laravel 15 resource 15 vibe-coding 15 generative-adversarial-network 14 multimodal-deep-learning 14 remote-sensing 14 swiftui 14 oss 14 node 14 databases 14 cross-platform 14 art 14 foss 14 performance 14 android-library 14 npm 14 mac 14 quantization 14 code-generation 14 large-language-model 14 videos 14 anomaly-detection 14 icons 14 react-components 14 webassembly 14 wasm 14 pose-estimation 14 google 14 dotnet-core 14 langchain 14 graphics 14 diffusion 14 recommender-system 14 language-model 14 awesome-ai 14 vuejs 14 microservice 13 webdevelopment 13 application 13 search 13 interview-questions 13 deep-reinforcement-learning 13 wordpress 13 frameworks 13 unicorns 13 multimodal-learning 13 semantic-segmentation 13 transfer-learning 13 quantum-computing 13 video-generation 13 developer 13 vision-language-model 13 multimodal-large-language-models 13 postgresql 13 gis 13 discord 13 podcast 13 yolo 13 pwa 13 fuzzing 13 svelte 13 hosting 13 programming-languages 12 model-compression 12 datascience 12 static-analysis 12 time-series-analysis 12 system-design 12 authentication 12 article 12 edge-computing 12 objective-c 12 guide 12 web-design 12 search-engine 12 dapp 12 static-site-generator 12 medical-imaging 12 chain-of-thought 12 gaming 12 llmops 12 3d-graphics 12 solana 12 planning 12 vision-and-language 12 gan 12 question-answering 12 es6 12 sustainability 12 cuda 12 mllm 12 image 12 ai-agent 12 graphics-programming 12 elasticsearch 12 rails 12 automl 12 infrastructure-as-code 12 utilities 12 workflow 12 storage 12 webapp 12 cursor 11 mobile-development 11 in-context-learning 11 movies 11 cicd 11 podcasts 11 anime 11 embedded-systems 11 rl 11 math 11 hacktoberfest2020 11 machine-learning-algorithms 11 webpack 11 climate-change 11 font 11 reading-list 11 cvpr 11 front-end-development 11 minecraft 11 rest-api 11 threat-intelligence 11 continual-learning 11 arduino 11 golang-library 11 bookmarks 11 ecommerce 11 writing 11 creative-coding 11 unsupervised-learning 11 point-cloud 11 open-science 11 infrastructure 11 unity3d 11 flutter-apps 11 ui-design 11 ctf 11 software-architecture 11 ux 11 few-shot-learning 11 interpretability 11 3d-reconstruction 11 sysadmin 11 mcp-servers 11 book 11 healthcare 11 course 11 free-software 11 semantic-web 10 text-mining 10 flutter-examples 10 leadership 10 knowledge-base 10 explainable-ai 10 flask 10 cv 10 apis 10 uav 10 vr 10 drones 10 hacking-tools 10 docs 10 geospatial 10 templates 10 ai-art 10 bug-bounty 10 malware 10 fpga 10 privacy-tools 10 applications 10 classification 10 bot 10 tool 10 game-design 10 ide 10 email 10 xai 10 management 10 haskell 10 cyber-security 10 jupyter-notebook 10 webcomponents 10 blockchain-technology 10 design-systems 10 reddit 10 python-library 10 neuroscience 10 hacktoberfest2021 10 meta-learning 10 gpu 10