https://github.com/liuzuxin/DSRL

🔥 Datasets and env wrappers for offline safe reinforcement learning
https://github.com/liuzuxin/DSRL

benchmark dataset imitation-learning offline-rl reinforcement-learning robotics safe-rl safety

Last synced: 5 months ago
JSON representation

🔥 Datasets and env wrappers for offline safe reinforcement learning

Host: GitHub
URL: https://github.com/liuzuxin/DSRL
Owner: liuzuxin
License: apache-2.0
Created: 2023-06-05T20:53:56.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-09-13T17:01:39.000Z (about 1 year ago)
Last Synced: 2025-04-30T18:42:53.335Z (5 months ago)
Topics: benchmark, dataset, imitation-learning, offline-rl, reinforcement-learning, robotics, safe-rl, safety
Language: Python
Homepage: https://offline-saferl.org
Size: 6.79 MB
Stars: 88
Watchers: 2
Forks: 6
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-decision-transformer - DSRL

README

          


  








  ![Python 3.8+](https://img.shields.io/badge/Python-3.8%2B-brightgreen.svg)

  [![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](#license)

  [![PyPI](https://img.shields.io/pypi/v/dsrl?logo=pypi)](https://pypi.org/project/dsrl)

  [![GitHub Repo Stars](https://img.shields.io/github/stars/liuzuxin/dsrl?color=brightgreen&logo=github)](https://github.com/liuzuxin/dsrl/stargazers)

  [![Downloads](https://static.pepy.tech/personalized-badge/dsrl?period=total&left_color=grey&right_color=blue&left_text=downloads)](https://pepy.tech/project/dsrl)

  

  

  

  

  



---

**DSRL (Datasets for Safe Reinforcement Learning)** provides a rich collection of datasets specifically designed for offline Safe Reinforcement Learning (RL). Created with the objective of fostering progress in offline safe RL research, DSRL bridges a crucial gap in the availability of safety-centric public benchmarks and datasets. 



  



DSRL provides:

1. **Diverse datasets:** 38 datasets across different safe RL environments and difficulty levels in [SafetyGymnasium](https://github.com/PKU-Alignment/safety-gymnasium), [BulletSafetyGym](https://github.com/liuzuxin/Bullet-Safety-Gym), and [MetaDrive](https://github.com/HenryLHH/metadrive_clean), all prepared with safety considerations.

2. **Consistent API with D4RL:** For easy use and evaluation of offline learning methods.

3. **Data post-processing filters:** Allowing alteration of data density, noise level, and reward distributions to simulate various data collection conditions.

This package is a part of a comprehensive benchmarking suite that includes [FSRL](https://github.com/liuzuxin/fsrl) and [OSRL](https://github.com/liuzuxin/osrl) and aims to promote advancements in the development and evaluation of safe learning algorithms.

We provided a detailed breakdown of the datasets, including all the environments we use, the dataset sizes, and the cost-reward-return plot for each dataset. These details can be found in the [docs](https://github.com/liuzuxin/DSRL/tree/main/docs) folder.

To learn more, please visit our [project website](http://www.offline-saferl.org). If you find this code useful, please cite our paper, which has been accepted by the [DMLR journal](https://data.mlr.press/volumes/01.html):

```bibtex

@article{

  liu2024offlinesaferl,

  title={Datasets and Benchmarks for Offline Safe Reinforcement Learning},

  author={Zuxin Liu and Zijian Guo and Haohong Lin and Yihang Yao and Jiacheng Zhu and Zhepeng Cen and Hanjiang Hu and Wenhao Yu and Tingnan Zhang and Jie Tan and Ding Zhao},

  journal={Journal of Data-centric Machine Learning Research},

  year={2024}

}

```

## Installation

## Install from PyPI

DSRL is currently hosted on [PyPI](https://pypi.org/project/dsrl), you can simply install it by:

```bash

pip install dsrl

```

It will by default install `bullet-safety-gym` and `safety-gymnasium` environments automatically.

If you want to use the `MetaDrive` environment, please install it via:

```bash

pip install git+https://github.com/HenryLHH/metadrive_clean.git@main

```

## Install from source

Pull this repo and install:

```bash

git clone https://github.com/liuzuxin/DSRL.git

cd DSRL

pip install -e .

```

You can also install the `MetaDrive` package simply by specify the option:

```bash

pip install -e .[metadrive]

```

## How to use DSRL

DSRL uses the [Gymnasium](https://gymnasium.farama.org/) API. Tasks are created via the `gymnasium.make` function. Each task is associated with a fixed offline dataset, which can be obtained with the `env.get_dataset()` method. This method returns a dictionary with:

- `observations`: An N × obs_dim array of observations.

- `next_observations`: An N × obs_dim of next observations.

- `actions`: An N × act_dim array of actions.

- `rewards`: An N dimensional array of rewards.

- `costs`: An N dimensional array of costs.

- `terminals`: An N dimensional array of episode termination flags. This is true when episodes end due to termination conditions such as falling over.

- `timeouts`: An N dimensional array of termination flags. This is true when episodes end due to reaching the maximum episode length.

The usage is similar to [D4RL](https://github.com/Farama-Foundation/D4RL). Here is an example code:

```python

import gymnasium as gym

import dsrl

# Create the environment

env = gym.make('OfflineCarCircle-v0')

# Each task is associated with a dataset

# dataset contains observations, next_observatiosn, actions, rewards, costs, terminals, timeouts

dataset = env.get_dataset()

print(dataset['observations']) # An N x obs_dim Numpy array of observations

# dsrl abides by the OpenAI gym interface

obs, info = env.reset()

obs, reward, terminal, timeout, info = env.step(env.action_space.sample())

cost = info["cost"]

# Apply dataset filters [optional]

# dataset = env.pre_process_data(dataset, filter_cfgs)

```

Datasets are automatically downloaded to the `~/.dsrl/datasets` directory when `get_dataset()` is called. If you would like to change the location of this directory, you can set the `$DSRL_DATASET_DIR` environment variable to the directory of your choosing, or pass in the dataset filepath directly into the `get_dataset` method.

You can use run the following example scripts to play with the offline dataset of all the supported environments: 

``` bash

python examples/run_mujoco.py --agent [your_agent] --task [your_task]

python examples/run_bullet.py --agent [your_agent] --task [your_task]

python examples/run_metadrive.py --road [your_road] --traffic [your_traffic] 

```

We also add examples of rendering in the Metadrive environments for both third-person view and birds-eye-view: 

```bash

python examples/run_metadrive.py --road [your_road] --traffic [your_traffic] --render [bev/3pv/none]

```

### Normalizing Scores

- Set target cost by using `env.set_target_cost(target_cost)` function, where `target_cost` is the undiscounted sum of costs of an episode

- You can use the `env.get_normalized_score(return, cost_return)` function to compute a normalized reward and cost for an episode, where `returns` and `cost_returns` are the undiscounted sum of rewards and costs respectively of an episode. 

- The individual min and max reference returns are stored in `dsrl/infos.py` for reference.

## License

All datasets are licensed under the [Creative Commons Attribution 4.0 License (CC BY)](https://creativecommons.org/licenses/by/4.0/), and code is licensed under the [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0.html).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/liuzuxin/DSRL

Awesome Lists containing this project

README