Awesome llm-evaluation Lists | Ecosyste.ms: Awesome

0 "llm-evaluation" Awesome Lists

awesome-llm-eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表，主要面向基础大模型评测，旨在探求生成式AI的技术边界.

awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model

608 stars
51 forks
479 projects

Last updated: 08 Feb 2026

awesome-ai-agent-testing

🤖 A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems

agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering

27 stars
5 forks
168 projects

Last updated: 15 Feb 2026

Comprehensive LLMOps reference index: observability platforms, inference cost intelligence, failure mode taxonomy, stack compatibility matrices, and regulatory compliance mapping for LLMs in production.

ai-compliance ai-governance ai-infrastructure ai-observability ai-safety awesome-list llm-benchmarks llm-cost-comparison llm-evaluation llm-failure-modes

1 stars
0 forks
95 projects

Last updated: 12 Mar 2026

Keywords

awesome-list 3,982 awesome 3,614 awesome-lists 564 machine-learning 451 list 423 deep-learning 377 resources 328 ai 266 hacktoberfest 236 python 220 llm 209 lists 205 javascript 193 security 170 programming 165 artificial-intelligence 155 blockchain 140 computer-vision 137 open-source 123 nlp 122 large-language-models 117 tools 117 data-science 110 react 108 chatgpt 102 android 95 natural-language-processing 90 curated-list 90 linux 85 learning 85 devops 82 awesome-readme 81 ios 78 css 78 awesome-resources 77 papers 77 ethereum 73 cybersecurity 72 reinforcement-learning 72 nodejs 65 kubernetes 64 rust 64 computer-science 64 robotics 62 collection 61 playground 60 design 59 game-development 59 free-resources 59 macos 58 free 57 golang 57 getvm 57 java 56 openai 56 tutorials 55 survey 55 frontend 54 privacy 53 developer-tools 53 bitcoin 52 tutorial 51 go 51 datasets 50 web 50 hacking 50 web3 50 llms 49 php 49 paper 48 education 48 data 47 cloud 47 swift 46 development 45 books 45 gpt 45 labex 45 hands-on 45 html 44 generative-ai 44 dataset 44 github 44 deep-neural-networks 44 automation 43 cryptocurrency 43 security-tools 42 ai-agents 42 mcp 42 data-visualization 41 exercises 41 opensource 40 vue 40 typescript 40 documentation 40 database 40 ml 40 web-development 39 docker 39 research 39 gamedev 39 collections 38 algorithms 37 software 37 agent 37 productivity 36 iot 36 neural-network 35 prompt-engineering 35 game 34 courses 34 reactjs 34 paper-list 34 testing 34 aws 33 diffusion-models 33 framework 33 api 32 cpp 32 games 31 flutter 31 hardware 30 software-engineering 30 dotnet 30 transformer 30 graph-neural-networks 30 multimodal 30 community 30 projects 30 solidity 30 ruby 29 bioinformatics 29 architecture 29 video 29 csharp 28 smart-contracts 28 react-native 28 ai-tools 28 crypto 28 libraries 28 agents 28 defi 28 links 27 windows 27 self-hosted 27 tensorflow 27 mlops 26 music 26 science 26 penetration-testing 26 object-detection 26 pytorch 25 library 25 django 25 self-supervised-learning 25 microsoft 25 angular 25 data-analysis 25 serverless 25 sql 25 best-practices 25 cryptography 25 cli 25 data-mining 24 git 24 slam 24 engineering 24 pentesting 24 infosec 24 neural-networks 24 mobile 24 claude 24 jobs 24 coding 24 mathematics 24 computer-graphics 24 unity 23 nextjs 23 game-engine 23 awsome-list 23 3d 23 software-development 23 roadmap 23 saas 23 ui 23 knowledge-graph 23 programming-language 23 r 23 osint 23 markdown 23 c 23 learning-resources 23 embedded 23 statistics 22 reverse-engineering 22 awesomeness 22 interview 22 segmentation 22 deeplearning 22 blog 22 python3 22 kotlin 22 startups 22 visualization 22 cloud-computing 21 opendata 21 startup 21 gemini 21 federated-learning 21 audio 21 hacktoberfest-accepted 21 free-tutorials 21 apple 21 gpt-3 21 azure 21 website 21 agentic-ai 21 chatbot 21 gpt-4 21 blogs 21 autonomous-driving 21 foundation-models 21 guidelines 20 raspberry-pi 20 networking 20 data-structures 20 graph 20 apps 20 finance 20 claude-code 20 time-series 20 llama 20 marketing 20 chinese 20 awsome 20 machinelearning 20 js 20 monitoring 19 curated 19 anthropic 19 reasoning 19 transformers 19 rag 19 image-generation 19 front-end 19 big-data 19 bugbounty 19 youtube 19 aigc 18 code 18 decentralized 18 backend 18 prompt 18 prompts 18 bash 18 representation-learning 18 stable-diffusion 18 optimization 18 graphql 18 generative-art 17 data-engineering 17 analytics 17 knowledge 17 seo 17 cheatsheet 17 beginner-friendly 17 containers 17 generative-model 17 cloud-native 17 open-data 17 devsecops 16 model-context-protocol 16 deepseek 16 telegram 16 text-to-image 16 articles 16 webgl 16 vscode 16 autonomous-vehicles 16 dart 16 bert 16 language 16 algorithm 16 microservices 16 image-processing 16 nerf 16 speech-recognition 16 evm 15 android-development 15 mysql 15 resource 15 benchmark 15 ros 15 selfhosted 15 mcp-server 15 pentest 15 technology 15 terminal 15 p2p 15 distributed-systems 15 command-line 15 readme 15 laravel 15 plugins 15 detection 15 cms 15 leetcode 15 embodied-ai 15 animation 15 vibe-coding 15 malware-analysis 15 vlm 15 icons 14 node 14 databases 14 oss 14 dotnet-core 14 recommender-system 14 language-model 14 foss 14 pose-estimation 14 performance 14 code-generation 14 android-library 14 react-components 14 anomaly-detection 14 diffusion 14 large-language-model 14 webassembly 14 generative-adversarial-network 14 multimodal-deep-learning 14 swiftui 14 mac 14 vuejs 14 langchain 14 cross-platform 14 npm 14 wasm 14 graphics 14 art 14 quantization 14 videos 14 remote-sensing 14 awesome-ai 14 google 14 multimodal-large-language-models 13 webdevelopment 13 postgresql 13 interview-questions 13 frameworks 13 developer 13 semantic-segmentation 13 video-generation 13 transfer-learning 13 gis 13 search 13 microservice 13 medical-imaging 13 fuzzing 13 unicorns 13 wordpress 13 quantum-computing 13 application 13 multimodal-learning 13 discord 13 deep-reinforcement-learning 13 pwa 13 podcast 13 vision-language-model 13 yolo 13 hosting 13 svelte 13 graphics-programming 12 time-series-analysis 12 llmops 12 gaming 12 automl 12 cuda 12 system-design 12 elasticsearch 12 workflow 12 solana 12 ai-agent 12 dapp 12 utilities 12 gan 12 web-design 12 webapp 12 search-engine 12 guide 12 authentication 12 static-analysis 12 sustainability 12 rails 12 question-answering 12 image 12 edge-computing 12 storage 12 programming-languages 12 static-site-generator 12 planning 12 article 12 model-compression 12 es6 12 infrastructure-as-code 12 datascience 12 chain-of-thought 12 mllm 12 vision-and-language 12 objective-c 12 golang-library 11 ui-design 11 arduino 11 unity3d 11 anime 11 podcasts 11 bookmarks 11 book 11 infrastructure 11 machine-learning-algorithms 11 in-context-learning 11 movies 11 minecraft 11 3d-reconstruction 11 rest-api 11 threat-intelligence 11 climate-change 11 webpack 11 free-software 11 sysadmin 11 healthcare 11 rl 11 course 11 continual-learning 11 mcp-servers 11 writing 11 cursor 11 ecommerce 11 cvpr 11 open-science 11 font 11 ux 11 point-cloud 11 mobile-development 11 unsupervised-learning 11 embedded-systems 11 reading-list 11 hacktoberfest2020 11 ctf 11 3d-graphics 11 interpretability 11 creative-coding 11 few-shot-learning 11 flutter-apps 11 cicd 11 front-end-development 11 software-architecture 11 speech-processing 10 trading 10 bug-bounty 10 bot 10 classification 10 meta-learning 10 python-library 10 geospatial 10 sre 10 jupyter-notebook 10 qwen 10 hacking-tools 10 ai-art 10 swift-library 10 privacy-tools 10 blockchain-technology 10 applications 10 flask 10 drones 10 uav 10 android-app 10 android-application 10 design-systems 10 test-automation 10 skills 10 math 10 haskell 10 hacktoberfest2021 10 webdesign 10 cyber-security 10 augmented-reality 10 ide 10 news 10 students 10 templates 10 text-mining 10 leadership 10 docs 10 cv 10 xai 10 explainable-ai 10 webcomponents 10 flutter-examples 10

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

0 "llm-evaluation" Awesome Lists

awesome-llm-eval

awesome-ai-agent-testing

ai-llmops-index