https://github.com/tokarev-i-v/awesome-llm-rl-agents
List of sources related to llms, transformers and reinforcement learning agents
https://github.com/tokarev-i-v/awesome-llm-rl-agents
List: awesome-llm-rl-agents
Last synced: about 1 month ago
JSON representation
List of sources related to llms, transformers and reinforcement learning agents
- Host: GitHub
- URL: https://github.com/tokarev-i-v/awesome-llm-rl-agents
- Owner: tokarev-i-v
- Created: 2023-05-30T01:21:57.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-08-27T06:56:28.000Z (over 1 year ago)
- Last Synced: 2024-05-21T01:09:00.752Z (12 months ago)
- Size: 43.9 KB
- Stars: 1
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-rl-for-agents - [List - covering rl and agents before 2023 (🔗 Related Awesome Lists / MCP Agents)
- awesome-rl-for-agents - [List - covering rl and agents before 2023 (🔗 Related Awesome Lists / MCP Agents)
README
# awesome-ml-agents
List of sources related to machine learning, llms, transformers and reinforcement learning agents## Table of Contents
- [Code|Apps|Tools](#-code-apps-tools)
- [Papers](#-papers)
- [Blogposts](#-blogposts)## [↑](#table-of-contents) Code|Apps|Tools
[AgentGPT](https://github.com/reworkd/AgentGPT) - AgentGPT allows you to configure and deploy Autonomous AI agents. Name your own custom AI and have it embark on any goal imaginable. It will attempt to reach the goal by thinking of tasks to do, executing them, and learning from the results.
[Baby AGI](https://github.com/yoheinakajima/babyagi) - This Python script is an example of an AI-powered task management system. The system uses OpenAI and vector databases such as Chroma or Weaviate to create, prioritize, and execute tasks.
[BIG-AGI](https://github.com/enricoros/big-agi) - your personal AGI application powered by OpenAI GPT-4 and beyond. Designed for smart humans and super-heroes, this responsive web app comes with Personas, Drawing, Code Execution, PDF imports, Voice support, data Rendering, AGI functions, chats and much more.
[OpenAGI](https://github.com/agiresearch/OpenAGI) - This project presents OpenAGI, an open-source AGI research platform, specifically designed to offer complex, multi-step tasks and accompanied by task-specific datasets, evaluation metrics, and a diverse range of extensible models.
[Gorilla](https://github.com/ShishirPatil/gorilla) - Gorilla: An API store for LLMs.
[GPT Engineer](https://github.com/AntonOsika/gpt-engineer) - GPT Engineer is made to be easy to adapt, extend, and make your agent learn how you want your code to look. It generates an entire codebase based on a prompt.
[LangChain](https://github.com/hwchase17/langchain) - library for creating powerful apps and agents with LLMs
[LLM Agents](https://github.com/mpaepper/llm_agents) - Small library to build agents which are controlled by large language models (LLMs) which is heavily inspired by langchain.
[Voyager](https://github.com/MineDojo/Voyager) - code for Voyager paper## [↑](#table-of-contents) Papers
24.08.2023 [SayCanPay: Heuristic Planning with Large Language Models using Learnable Domain Knowledge](https://arxiv.org/abs/2308.12682) | ([code](https://github.com/RishiHazra/saycanpay)) | ([website](https://rishihazra.github.io/SayCanPay/))
30.06.2023 [Preference Ranking Optimization for Human Alignment](https://arxiv.org/abs/2306.17492)
10.06.2023 [Reflexion: Language Agents with Verbal Reinforcement Learning](https://arxiv.org/abs/2303.11366) | ([code](https://github.com/noahshinn024/reflexion))
xx.06.2023 [Language to Rewards for Robotic Skill Synthesis](https://language-to-reward.github.io/assets/l2r.pdf)
25.05.2023 [Voyager: An Open-Ended Embodied Agent with Large Language Models](https://arxiv.org/abs/2305.16291) | ([code](https://github.com/MineDojo/Voyager))
24.05.2023 [SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning](https://arxiv.org/pdf/2305.15486.pdf)
10.03.2023 [ReAct: Synergizing Reasoning and Acting in Language Models](https://arxiv.org/abs/2210.03629)
10.01.2023 [Chain-of-Thought Prompting Elicits Reasoning in Large Language Models](https://arxiv.org/abs/2201.11903)
08.12.2021 [Improving language models by retrieving from trillions of tokens](https://arxiv.org/abs/2112.04426) | ([code](https://github.com/lucidrains/RETRO-pytorch))
30.11.2021 [Show Your Work: Scratchpads for Intermediate Computation with Language Models](https://arxiv.org/abs/2112.00114)
20.10.2020 [Local Knowledge Powered Conversational Agents](https://arxiv.org/abs/2010.10150)## [↑](#table-of-contents) Blogposts
Yoav Goldberg, April 2023 | [Reinforcement Learning for Language Models](https://gist.github.com/yoavg/6bff0fecd65950898eba1bb321cfbd81)
30.03.2023 | [Language Models can Solve Computer Tasks](https://posgnu.github.io/rci-web/) | ([paper](https://arxiv.org/abs/2303.17491)) | ([code](https://github.com/posgnu/rci-agent))
21.02.2023 | [Do Large Language Models (LLMs) reason?](https://www.shaped.ai/blog/do-large-language-models-llms-reason)
2023 | [Gorilla: Large Language Model Connected with Massive APIs](https://shishirpatil.github.io/gorilla/)
2022 | [LISA: Learning Interpretable Skill Abstractions from Language](https://div99.github.io/LISA/)