An open API service indexing awesome lists of open source software.

0 "ai-evaluation" Awesome Lists

awesome-ai-eval

☑️ A curated list of tools, methods & platforms for evaluating AI reliability in real applications

ai-evaluation ai-evaluation-framework ai-evaluation-metrics ai-evaluation-tools awesome awesome-list awesome-lists chatgpt claude evaluation

81 stars
16 forks
186 projects

Last updated: 01 Jun 2026

Awesome-AI-Evaluation-Guide

A comprehensive, implementation-focused guide to evaluating Large Language Models, RAG systems, and Agentic AI in production environments.

agentic-ai ai-evaluation ai-evaluation-framework ai-evaluation-metrics ai-evaluation-tools awesome awesome-lists claude evaluation-framework evaluation-metrics

14 stars
4 forks
98 projects

Last updated: 30 May 2026

awesome-ai-benchmarks-evaluation

A curated list of evaluation tools, benchmark datasets, leaderboards, frameworks, and resources for assessing model performance.

ai ai-benchmark ai-benchmarks ai-evaluation awesome awesome-list awesome-lists

8 stars
4 forks
52 projects

Last updated: 08 Jun 2026

awesome-ai-agent-evaluation

A curated list of benchmarks, eval harnesses, papers, datasets, and production checks for AI agents.

agent-benchmark agent-evaluation ai-agents ai-evaluation awesome benchmarks coding-agents evals llm-agents llm-evaluation

1 stars
1 forks
141 projects

Last updated: 30 May 2026

Search
Keywords
awesome-list 4,302 awesome 3,867 awesome-lists 620 machine-learning 470 list 426 deep-learning 390 resources 333 ai 313 llm 259 hacktoberfest 231 python 227 lists 204 javascript 198 security 180 artificial-intelligence 171 programming 165 open-source 157 computer-vision 146 blockchain 145 large-language-models 131 nlp 129 tools 123 curated-list 117 data-science 115 react 109 chatgpt 108 ai-agents 103 android 96 natural-language-processing 93 devops 89 linux 87 learning 87 awesome-readme 85 cybersecurity 85 mcp 83 ios 81 papers 81 awesome-resources 79 developer-tools 79 reinforcement-learning 79 css 77 ethereum 76 robotics 72 openai 70 rust 69 kubernetes 67 nodejs 66 computer-science 65 collection 62 design 61 survey 61 playground 61 macos 61 game-development 60 free-resources 59 free 59 golang 59 tutorials 58 generative-ai 58 getvm 57 java 57 web3 57 datasets 57 privacy 56 frontend 54 hacking 54 php 53 bitcoin 53 automation 52 llms 52 tutorial 52 ai-tools 52 education 51 go 50 web 49 paper 49 swift 49 gpt 48 github 47 data 47 books 46 security-tools 46 agent 45 prompt-engineering 45 labex 45 hands-on 45 development 45 cloud 45 dataset 45 typescript 45 opensource 45 cryptocurrency 44 deep-neural-networks 44 data-visualization 43 database 43 claude-code 43 html 43 productivity 43 research 43 documentation 42 api 42 gamedev 41 claude 41 vue 41 exercises 41 docker 40 ml 39 software 39 web-development 39 diffusion-models 39 agentic-ai 38 collections 38 paper-list 38 algorithms 37 community 37 iot 36 game 36 open-data 35 agents 35 neural-network 35 cli 34 reactjs 34 courses 34 aws 34 testing 34 multimodal 33 framework 33 cpp 33 hardware 33 rag 33 video 32 crypto 32 software-engineering 32 flutter 32 defi 31 solidity 31 games 31 projects 31 react-native 31 transformer 30 graph-neural-networks 30 libraries 30 mlops 30 self-hosted 30 bioinformatics 29 penetration-testing 29 architecture 29 ruby 29 gemini 29 marketing 28 mobile 28 pentesting 28 microsoft 28 smart-contracts 28 dotnet 28 slam 27 music 27 tensorflow 27 best-practices 27 object-detection 27 cryptography 27 windows 27 angular 27 links 27 anthropic 27 saas 27 science 27 csharp 27 serverless 26 reverse-engineering 26 self-supervised-learning 26 data-analysis 26 library 26 pytorch 26 prompts 26 data-mining 25 nextjs 25 coding 25 django 25 jobs 25 sql 25 osint 25 knowledge-graph 25 infosec 25 3d 25 markdown 24 git 24 learning-resources 24 kotlin 24 openclaw 24 mathematics 24 chatbot 24 time-series 24 computer-graphics 24 ai-agent 24 neural-networks 24 awsome-list 24 embedded 24 model-context-protocol 23 chinese 23 ui 23 c 23 autonomous-driving 23 roadmap 23 audio 23 visualization 23 apple 23 software-development 23 segmentation 23 unity 23 engineering 23 foundation-models 23 blog 23 game-engine 23 programming-language 23 skills 23 image-generation 23 interview 23 monitoring 22 startup 22 seo 22 reasoning 22 awesomeness 22 python3 22 startups 22 agent-skills 22 website 22 networking 22 statistics 22 deeplearning 22 devsecops 22 awsome 21 finance 21 blogs 21 federated-learning 21 gpt-3 21 r 21 hacktoberfest-accepted 21 cloud-computing 21 azure 21 free-tutorials 21 apps 20 curated 20 terminal 20 graph 20 plugins 20 youtube 20 graphql 20 llama 20 js 20 opendata 20 data-structures 20 decentralized 20 machinelearning 20 transformers 20 big-data 19 speech-recognition 19 mcp-server 19 backend 19 benchmark 19 command-line 19 raspberry-pi 19 langchain 19 gpt-4 19 front-end 19 representation-learning 18 bugbounty 18 cursor 18 stable-diffusion 18 prompt 18 telegram 18 analytics 18 cloud-native 18 guidelines 18 embodied-ai 18 generative-art 18 aigc 18 optimization 18 text-to-image 18 vibe-coding 18 data-engineering 17 bash 17 deepseek 17 knowledge 17 autonomous-vehicles 17 cheatsheet 17 pentest 17 generative-model 17 vlm 17 swiftui 17 containers 17 animation 17 laravel 17 beginner-friendly 17 malware-analysis 17 code 17 nerf 17 large-language-model 16 webgl 16 dart 16 articles 16 autonomous-agents 16 videos 16 ai-safety 16 quantization 16 mysql 16 bert 16 espana 16 spain 16 image-processing 16 algorithm 16 microservices 16 language 16 evm 16 video-generation 16 reddit 16 ros 16 readme 15 vuejs 15 anomaly-detection 15 resource 15 android-development 15 pose-estimation 15 mcp-servers 15 multimodal-deep-learning 15 codex 15 multimodal-large-language-models 15 technology 15 awesome-ai 15 performance 15 leetcode 15 remote-sensing 15 llm-agents 15 vscode 15 gis 15 graphics 15 art 15 p2p 15 foss 15 cms 15 databases 15 distributed-systems 15 drones 14 recommender-system 14 detection 14 google 14 llmops 14 discord 14 code-generation 14 npm 14 gaming 14 react-components 14 language-model 14 vision-language-model 14 system-design 14 icons 14 generative-adversarial-network 14 android-library 14 selfhosted 14 mac 14 webassembly 14 node 14 diffusion 14 yolo 14 medical-imaging 14 oss 14 deep-reinforcement-learning 14 multi-agent 14 wasm 14 dotnet-core 14 podcast 14 multimodal-learning 14 software-libre 14 programming-languages 13 edge-computing 13 minecraft 13 automl 13 uav 13 sustainability 13 solana 13 unicorns 13 model-compression 13 bug-bounty 13 mllm 13 frameworks 13 utilities 13 cicd 13 dapp 13 rest-api 13 transfer-learning 13 svelte 13 claude-skills 13 microservice 13 webdevelopment 13 wordpress 13 sre 13 pwa 13 cyber-security 13 interview-questions 13 continual-learning 13 rl 13 datos-abiertos 13 gan 13 quantum-computing 13 copilot 13 storage 13 objective-c 13 developer 13 workflow 13 semantic-segmentation 13 infrastructure 13 planning 13 search 13 postgresql 13 ecommerce 13 sysadmin 13 mobile-development 13 search-engine 12 ux 12 trading 12 observability 12 article 12 open-science 12 ai-art 12 cuda 12 web-design 12 graphics-programming 12 elasticsearch 12 neuroscience 12 threat-intelligence 12 speech-to-text 12 datascience 12 static-analysis 12 privacy-tools 12 rails 12 email 12 infrastructure-as-code 12 cross-platform 12 ctf 12 software-architecture 12 es6 12 ui-design 12 vector-database 12 authentication 12 creative-coding 12 vision-and-language 12 image 12 guide 12 chain-of-thought 12 3d-graphics 12 question-answering 12 static-site-generator 12 time-series-analysis 12 webapp 12 ai-coding 12 application 12 leadership 12 healthcare 12 fuzzing 12 test-automation 11 movies 11 xai 11 few-shot-learning 11 reading-list 11 font 11 3d-reconstruction 11 packages 11 math 11 webpack 11 arduino 11 book 11 knowledge-base 11 hacktoberfest2020 11 course 11 hosting 11 students 11 simulation 11 ros2 11 management 11 unity3d 11 swift-library 11 embedded-systems 11 in-context-learning 11 geospatial 11 point-cloud 11 anime 11 climate-change 11 incident-response 11 lua 11 evaluation 11 cvpr 11 fpga 11 drone 11 free-software 11 writing 11 compliance 11 vr 11 elixir 11 unsupervised-learning 11 gpu 11