An open API service indexing awesome lists of open source software.

0 "rl" Awesome Lists

Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

awesome-list deepseek-r1 llm lrm open-source reasoning rl

2,458 stars
130 forks
564 projects

Last updated: 23 May 2026

Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

benchmark macro-action mcts o1 o3 prm r1 reasoning rl self-improve

1,352 stars
80 forks
410 projects

Last updated: 27 May 2026

awesome-monte-carlo-tree-search-papers

A curated list of Monte Carlo tree search papers with implementations.

atari deep-learning deep-q-learning learning machine-learning machine-learning-algorithms monte-carlo monte-carlo-tree-search policy-evaluation policy-gradient

703 stars
74 forks
113 projects

Last updated: 26 May 2026

Awesome-Long-Chain-of-Thought-Reasoning

Latest Advances on Long Chain-of-Thought Reasoning

agent chain-of-thought deepseek-r1 long long-chain-of-thought o1 o3 openai-o1 r1 reasoning

635 stars
31 forks
313 projects

Last updated: 27 May 2026

awesome-deep-reasoning

Collect every awesome work about r1!

collection deepseek grpo o1 qwen r1 reasoning rl

432 stars
15 forks
130 projects

Last updated: 06 May 2026

awesome-on-policy-distillation

A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation (OPD) of large language models

awesome awesome-list distillation gkd knowledge-distillation llm llm-distillation llm-training minillm on-policy-distillation

265 stars
4 forks
349 projects

Last updated: 05 Jun 2026

awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

alignment llms rl rlaif rlhf

204 stars
7 forks
44 projects

Last updated: 21 May 2026

Awesome-Multimodal-Reasoning

Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal LLMs

chain-of-thought cot image-generation image-reasoning image-understanding large-language-models llm lvlm mllm multimodal-learning

67 stars
4 forks
585 projects

Last updated: 02 Jun 2026

Awesome-Agent-RL

A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more intelligent and aligned AI agents.

agent agent-training agentic-ai awesome awesome-list llm reinforcement-learning reward-model rl rlvr

59 stars
1 forks
80 projects

Last updated: 01 Apr 2026

awesome-agentic-search

🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to answer complex questions. Explore the latest research, benchmarks, and industry solutions for next-gen search-enhanced AI! 🤖✨

agentic reasoning rl search

56 stars
6 forks
37 projects

Last updated: 27 Mar 2026

awesome-rl-for-uavs

Awesome RL applications in UAV control.

rl uav

56 stars
1 forks
64 projects

Last updated: 23 May 2026

awesome-agent-rl-environments

A curated list of training & evaluation environments for LLM/VLM agents (SWE-Gym, GEM, RAGEN, AgentGym, WebArena, OSWorld, ToolBench…). Updated weekly.

agent agent-rl agentic-ai awesome awesome-list benchmark browser-agent computer-use-agent evaluation grpo

1 stars
0 forks
69 projects

Last updated: 21 May 2026

Search
Keywords
awesome-list 4,264 awesome 3,839 awesome-lists 620 machine-learning 468 list 425 deep-learning 391 resources 333 ai 309 llm 255 hacktoberfest 231 python 227 lists 205 javascript 198 security 180 artificial-intelligence 170 programming 165 blockchain 145 computer-vision 145 open-source 141 nlp 128 large-language-models 128 tools 123 data-science 115 curated-list 110 chatgpt 109 react 109 ai-agents 99 android 96 natural-language-processing 92 devops 89 learning 87 linux 87 cybersecurity 85 awesome-readme 84 ios 81 papers 81 awesome-resources 79 reinforcement-learning 79 developer-tools 77 css 77 mcp 77 ethereum 76 robotics 72 openai 70 rust 69 kubernetes 67 nodejs 66 computer-science 65 collection 62 macos 61 design 61 survey 61 playground 61 game-development 60 golang 59 free-resources 59 free 58 tutorials 58 generative-ai 57 web3 57 getvm 57 java 56 privacy 56 datasets 55 hacking 54 frontend 54 php 53 bitcoin 53 tutorial 52 education 51 ai-tools 50 llms 50 paper 50 go 50 automation 50 web 49 swift 49 gpt 48 data 47 books 46 security-tools 46 github 46 labex 45 hands-on 45 dataset 45 typescript 45 development 45 cloud 45 agent 45 deep-neural-networks 44 cryptocurrency 44 prompt-engineering 44 html 43 data-visualization 43 research 43 database 42 productivity 42 documentation 42 exercises 41 gamedev 41 claude-code 41 opensource 41 vue 41 api 40 docker 40 diffusion-models 39 claude 39 ml 39 web-development 39 collections 38 paper-list 38 software 38 algorithms 37 game 36 neural-network 35 iot 35 reactjs 34 testing 34 agentic-ai 34 cli 34 courses 34 agents 34 aws 34 framework 33 hardware 33 cpp 33 multimodal 33 flutter 32 software-engineering 32 crypto 32 community 32 video 32 solidity 31 react-native 31 games 31 defi 31 rag 31 projects 31 transformer 30 graph-neural-networks 30 libraries 30 mlops 30 self-hosted 30 bioinformatics 29 penetration-testing 29 ruby 29 architecture 29 microsoft 28 smart-contracts 28 mobile 28 pentesting 28 dotnet 28 marketing 28 gemini 28 angular 27 slam 27 science 27 tensorflow 27 csharp 27 best-practices 27 object-detection 27 cryptography 27 windows 27 saas 27 links 27 music 27 data-analysis 26 anthropic 26 reverse-engineering 26 self-supervised-learning 26 library 26 pytorch 26 prompts 26 serverless 26 3d 25 osint 25 knowledge-graph 25 coding 25 jobs 25 sql 25 django 25 infosec 25 nextjs 25 data-mining 25 open-data 24 neural-networks 24 git 24 kotlin 24 chatbot 24 learning-resources 24 markdown 24 computer-graphics 24 awsome-list 24 embedded 24 mathematics 24 time-series 24 unity 23 interview 23 programming-language 23 ai-agent 23 autonomous-driving 23 segmentation 23 visualization 23 image-generation 23 chinese 23 ui 23 blog 23 apple 23 c 23 engineering 23 software-development 23 game-engine 23 audio 23 roadmap 23 seo 22 openclaw 22 devsecops 22 startup 22 website 22 python3 22 deeplearning 22 awesomeness 22 foundation-models 22 networking 22 monitoring 22 statistics 22 startups 22 r 21 blogs 21 model-context-protocol 21 finance 21 gpt-3 21 awsome 21 federated-learning 21 hacktoberfest-accepted 21 skills 21 reasoning 21 free-tutorials 21 azure 21 cloud-computing 21 curated 20 agent-skills 20 apps 20 graph 20 js 20 transformers 20 youtube 20 data-structures 20 machinelearning 20 llama 20 opendata 20 terminal 20 decentralized 20 plugins 20 graphql 20 benchmark 19 gpt-4 19 langchain 19 aigc 19 front-end 19 raspberry-pi 19 big-data 19 backend 19 command-line 19 guidelines 18 code 18 stable-diffusion 18 representation-learning 18 cloud-native 18 prompt 18 speech-recognition 18 vibe-coding 18 text-to-image 18 embodied-ai 18 analytics 18 generative-art 18 mcp-server 18 telegram 18 bugbounty 18 vlm 18 optimization 18 laravel 17 beginner-friendly 17 deepseek 17 animation 17 knowledge 17 malware-analysis 17 data-engineering 17 pentest 17 bash 17 cheatsheet 17 nerf 17 containers 17 swiftui 17 cursor 17 generative-model 17 autonomous-vehicles 17 video-generation 16 ros 16 language 16 large-language-model 16 articles 16 image-processing 16 bert 16 webgl 16 dart 16 microservices 16 videos 16 reddit 16 autonomous-agents 16 evm 16 quantization 16 algorithm 16 distributed-systems 15 mcp-servers 15 vuejs 15 graphics 15 art 15 android-development 15 vscode 15 multimodal-deep-learning 15 cms 15 performance 15 mysql 15 multimodal-large-language-models 15 technology 15 awesome-ai 15 p2p 15 pose-estimation 15 foss 15 databases 15 readme 15 resource 15 anomaly-detection 15 ai-safety 15 remote-sensing 15 codex 15 leetcode 15 selfhosted 14 language-model 14 discord 14 mac 14 node 14 android-library 14 diffusion 14 wasm 14 webassembly 14 llmops 14 google 14 code-generation 14 deep-reinforcement-learning 14 llm-agents 14 podcast 14 oss 14 generative-adversarial-network 14 detection 14 icons 14 dotnet-core 14 multimodal-learning 14 gaming 14 recommender-system 14 vision-language-model 14 medical-imaging 14 system-design 14 react-components 14 npm 14 gis 14 drones 14 pwa 13 svelte 13 cyber-security 13 programming-languages 13 quantum-computing 13 postgresql 13 ecommerce 13 sysadmin 13 multi-agent 13 solana 13 search 13 model-compression 13 microservice 13 minecraft 13 transfer-learning 13 storage 13 gan 13 mobile-development 13 cicd 13 claude-skills 13 workflow 13 frameworks 13 unicorns 13 developer 13 automl 13 rl 13 interview-questions 13 yolo 13 mllm 13 bug-bounty 13 sustainability 13 continual-learning 13 rest-api 13 copilot 13 edge-computing 13 utilities 13 uav 13 webdevelopment 13 wordpress 13 sre 13 objective-c 13 dapp 13 planning 13 semantic-segmentation 13 infrastructure 13 rails 12 vision-and-language 12 cross-platform 12 webapp 12 graphics-programming 12 3d-graphics 12 article 12 software-architecture 12 elasticsearch 12 static-site-generator 12 static-analysis 12 threat-intelligence 12 trading 12 image 12 es6 12 fuzzing 12 time-series-analysis 12 neuroscience 12 authentication 12 open-science 12 infrastructure-as-code 12 ctf 12 chain-of-thought 12 ai-coding 12 ui-design 12 application 12 ux 12 search-engine 12 guide 12 web-design 12 question-answering 12 speech-to-text 12 creative-coding 12 datascience 12 ai-art 12 email 12 cuda 12 vector-database 12 privacy-tools 12 observability 12 incident-response 11 course 11 geospatial 11 cv 11 webpack 11 fpga 11 healthcare 11 evaluation 11 swift-library 11 compliance 11 leadership 11 in-context-learning 11 elixir 11 font 11 arduino 11 book 11 xai 11 hosting 11 podcasts 11 test-automation 11 machine-learning-algorithms 11 few-shot-learning 11 point-cloud 11 packages 11 vr 11 free-software 11 cvpr 11 embedded-systems 11 anime 11 gpu 11 3d-reconstruction 11 simulation 11 writing 11 students 11 math 11 unsupervised-learning 11 unity3d 11 ros2 11 knowledge-base 11 climate-change 11 hacktoberfest2020 11 reading-list 11 drone 11 movies 11 lua 11 semantic-web 10 erlang 10