https://github.com/chendrag/sfbc

Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
https://github.com/chendrag/sfbc

diffusion expressive generative offline reinforcement-learning score-based

Last synced: over 1 year ago
JSON representation

Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548

Host: GitHub
URL: https://github.com/chendrag/sfbc
Owner: ChenDRAG
Created: 2023-02-18T13:50:29.000Z (over 3 years ago)
Default Branch: master
Last Pushed: 2023-10-11T09:15:15.000Z (over 2 years ago)
Last Synced: 2025-03-18T11:11:26.237Z (over 1 year ago)
Topics: diffusion, expressive, generative, offline, reinforcement-learning, score-based
Language: Python
Homepage:
Size: 16.7 MB
Stars: 38
Watchers: 1
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling

This is a pytorch implementation of SfBC: [Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling](https://arxiv.org/abs/2209.14548).

![Algorithm Overview](https://github.com/ChenDRAG/SfBC/blob/master/overview.PNG)

\* **For diffusion-based offline RL, we recommend trying our subsequent work, QGPO([paper](https://arxiv.org/abs/2304.12824); [Github](https://github.com/ChenDRAG/CEP-energy-guided-diffusion)). Compared with SfBC, QGPO has improved computational efficiency and noticeably better performance.**

## Requirements

- See conda requirements in `requirements.yml`

## Quick Start
Train the behavior model:

```shell
$ python3 train_behavior.py
```

Train the critic model and plot evaluation scores with tensorboard:

```shell
$ python3 train_critic.py
```

Evaluation only:

```shell
$ python3 evaluation.py
```

## Citing
If you find this code release useful, please reference in your paper:
```
@inproceedings{
chen2023offline,
title={Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling},
author={Huayu Chen and Cheng Lu and Chengyang Ying and Hang Su and Jun Zhu},
booktitle={The Eleventh International Conference on Learning Representations },
year={2023},
}
```

## Note
+ Contact us at: chenhuay17@gmail.com if you have any question.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/chendrag/sfbc

Awesome Lists containing this project

README