https://github.com/minaek/reward_design_with_llms

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/minaek/reward_design_with_llms
Owner: minaek
Created: 2023-01-23T17:21:03.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-06-06T06:41:14.000Z (about 2 years ago)
Last Synced: 2024-11-03T06:33:08.526Z (8 months ago)
Language: Python
Size: 1.28 MB
Stars: 202
Watchers: 6
Forks: 19
Open Issues: 2
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-RLAIF - Code & Prompts

README

# Reward Design with Language Models
The code in this repository is based on the paper [Reward Design with Language Models](https://arxiv.org/pdf/2303.00001.pdf).
This repository contains the prompts that we used for each domain as well as code to train an RL agent with an LLM in the loop using those prompts.
Each domain (Ultimatum Game, Matrix Games, DealOrNoDeal) has a separate directory and will need a seperate conda/virtual environment.
Please check out the READMEs in each directory for more information on how to run things.

# Using GPT3
We use GPT3 for our experiments. You will need to have an API key from them saved in your `~/.bashrc` or `~/.zshrc` under the variable `OPENAI_API_KEY`.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/minaek/reward_design_with_llms

Awesome Lists containing this project

README