Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/MushroomRL/mushroom-rl

Python library for Reinforcement Learning.
https://github.com/MushroomRL/mushroom-rl

atari ddpg deep-learning deep-reinforcement-learning dqn mujoco openai-gym pybullet pytorch qlearning reinforcement-learning rl sac trpo

Last synced: about 1 month ago
JSON representation

Python library for Reinforcement Learning.

Awesome Lists containing this project

README

        

**********
MushroomRL
**********

.. image:: https://github.com/MushroomRL/mushroom-rl/actions/workflows/continuous_integration.yml/badge.svg?branch=dev
:target: https://github.com/MushroomRL/mushroom-rl/actions/workflows/continuous_integration.yml
:alt: Continuous Integration

.. image:: https://readthedocs.org/projects/mushroomrl/badge/?version=latest
:target: https://mushroomrl.readthedocs.io/en/latest/?badge=latest
:alt: Documentation Status

.. image:: https://api.codeclimate.com/v1/badges/3b0e7167358a661ed882/maintainability
:target: https://codeclimate.com/github/MushroomRL/mushroom-rl/maintainability
:alt: Maintainability

.. image:: https://api.codeclimate.com/v1/badges/3b0e7167358a661ed882/test_coverage
:target: https://codeclimate.com/github/MushroomRL/mushroom-rl/test_coverage
:alt: Test Coverage

**MushroomRL: Reinforcement Learning Python library.**

.. contents:: **Contents of this document:**
:depth: 2

What is MushroomRL
==================
MushroomRL is a Python Reinforcement Learning (RL) library whose modularity allows
to easily use well-known Python libraries for tensor computation (e.g. PyTorch,
Tensorflow) and RL benchmarks (e.g. OpenAI Gym, PyBullet, Deepmind Control Suite).
It allows to perform RL experiments in a simple way providing classical RL algorithms
(e.g. Q-Learning, SARSA, FQI), and deep RL algorithms (e.g. DQN, DDPG, SAC, TD3,
TRPO, PPO).

`Full documentation and tutorials available here `_.

Installation
============

You can do a minimal installation of ``MushroomRL`` with:

.. code:: shell

pip3 install mushroom_rl

Installing everything
---------------------
``MushroomRL`` contains also some optional components e.g., support for ``OpenAI Gym``
environments, Atari 2600 games from the ``Arcade Learning Environment``, and the support
for physics simulators such as ``Pybullet`` and ``MuJoCo``.
Support for these classes is not enabled by default.

To install the whole set of features, you will need additional packages installed.
You can install everything by running:

.. code:: shell

pip3 install mushroom_rl[all]

This will install every dependency of MushroomRL, except the Plots dependency.
For ubuntu>20.04, you may need to install pygame and gym dependencies:

.. code:: shell

sudo apt -y install libsdl-image1.2-dev libsdl-mixer1.2-dev libsdl-ttf2.0-dev \
libsdl1.2-dev libsmpeg-dev libportmidi-dev ffmpeg libswscale-dev \
libavformat-dev libavcodec-dev swig

Notice that you still need to install some of these dependencies for different operating systems, e.g. swig for macOS

Below is the code that you need to run to install the Plots dependencies:

.. code:: shell

sudo apt -y install python3-pyqt5
pip3 install mushroom_rl[plots]

You might need to install external dependencies first. For more information about mujoco-py
installation follow the instructions on the `project page `_

WARNING! when using conda, there may be issues with QT. You can fix them by adding the following lines to the code, replacing ```` with the path to your conda distribution and ```` with the name of the conda environment you are using:

.. code:: python

import os
os.environ['QT_QPA_PLATFORM_PLUGIN_PATH'] = '/envs//bin/platforms'

To use dm_control MushroomRL interface, install ``dm_control`` following the instruction that can
be found `here `_

Using Habitat and iGibson with MushroomRL
-----------------------------------------

`Habitat `__ and `iGibson `__
are simulation platforms providing realistic and sensory-rich learning environments.
In MushroomRL, the agent's default observations are RGB images, but RGBD,
agent sensory data, and other information can also be used.

If you have previous versions of iGibson or Habitat already installed, we recommend to remove them and do clean installs.

iGibson Installation
^^^^^^^^^^^^^^^^^^^^
Follow the `official guide `__ and install its
`assets `__ and
`datasets `__.

For ``/mushroom-rl/examples/igibson_dqn.py`` you need to run

.. code:: shell

python -m igibson.utils.assets_utils --download_assets
python -m igibson.utils.assets_utils --download_demo_data
python -m igibson.utils.assets_utils --download_ig_dataset

You can also use `third party datasets `__.

The scene details are defined in a YAML file, that needs to be passed to the agent.
See ``/igibson/test/test_house.YAML`` for an example.

Habitat Installation
^^^^^^^^^^^^^^^^^^^^
Follow the `official guide `__
and do a **full install** with `habitat_baselines`.
Then you can download interactive datasets following
`this `__ and
`this `__.
If you need to download other datasets, you can use
`this utility `__.

Basic Usage of Habitat
^^^^^^^^^^^^^^^^^^^^^^
When you create a ``Habitat`` environment, you need to pass a wrapper name and two
YAML files: ``Habitat(wrapper, config_file, base_config_file)``.

* The wrapper has to be among the ones defined in ``/mushroom-rl/environments/habitat_env.py``,
and takes care of converting actions and observations in a gym-like format. If your task / robot requires it,
you may need to define new wrappers.

* The YAML files define every detail: the Habitat environment, the scene, the
sensors available to the robot, the rewards, the action discretization, and any
additional information you may need. The second YAML file is optional, and
overwrites whatever was already defined in the first YAML.

If you use YAMLs from ``habitat-lab``, check if they define a YAML for
``BASE_TASK_CONFIG_PATH``. If they do, you need to pass it as ``base_config_file`` to
``Habitat()``. ``habitat-lab`` YAMLs, in fact, use relative paths, and calling them
from outside its root folder will cause errors.

* If you use a dataset, be sure that the path defined in the YAML file is correct,
especially if you use relative paths. ``habitat-lab`` YAMLs use relative paths, so
be careful with that. By default, the path defined in the YAML file will be
relative to where you launched the python code. If your `data` folder is
somewhere else, you may also create a symbolic link.

Rearrange Task Example
^^^^^^^^^^^^^^^^^^^^^^
* Download the ReplicaCAD datasets (``--data-path data`` downloads them in the folder
from where you are launching your code)

.. code:: shell

python -m habitat_sim.utils.datasets_download --uids replica_cad_dataset --data-path data

* For this task we use ``/habitat_baselines/config/rearrange/rl_pick.yaml``.
This YAML defines ``BASE_TASK_CONFIG_PATH: configs/tasks/rearrange/pick.yaml``,
and since this is a relative path we need to overwrite it by passing its absolute path
as ``base_config_file`` argument to ``Habitat()``.

* Then, ``pick.yaml`` defines the dataset to be used with respect to ````.
If you have not used ``--data-path`` argument with the previous download command,
the ReplicaCAD datasets is now in ``/data`` and you need to
make a link to it

.. code:: shell

ln -s /data/ /mushroom-rl/examples/habitat

* Finally, you can launch ``python habitat_rearrange_sac.py``.

Navigation Task Example
^^^^^^^^^^^^^^^^^^^^^^^
* Download and extract Replica scenes

WARNING! The dataset is very large!

.. code:: shell

sudo apt-get install pigz
git clone https://github.com/facebookresearch/Replica-Dataset.git
cd Replica-Dataset
./download.sh replica-path

* For this task we only use the custom YAML file ``pointnav_apartment-0.yaml``.

* ``DATA_PATH: "replica_{split}_apartment-0.json.gz"`` defines the JSON file with
some scene details, such as the agent's initial position and orientation.
The ``{split}`` value is defined in the ``SPLIT`` key.

If you want to try new positions, you can sample some from the set of the scene's navigable points.
After initializing a ``habitat`` environment, for example ``mdp = Habitat(...)``,
run ``mdp.env._env._sim.sample_navigable_point()``.

* ``SCENES_DIR: "Replica-Dataset/replica-path/apartment_0"`` defines the scene.
As said before, this path is relative to where you launch the script, thus we need to link the Replica folder.
If you launch ``habitat_nav_dqn.py`` from its example folder, run

.. code:: shell

ln -s /Replica-Dataset/ /mushroom-rl/examples/habitat

* Finally, you can launch ``python habitat_nav_dqn.py``.

Editable Installation
---------------------

You can also perform a local editable installation by using:

.. code:: shell

pip install --no-use-pep517 -e .

To install also optional dependencies:

.. code:: shell

pip install --no-use-pep517 -e .[all]

How to set and run and experiment
=================================
To run experiments, MushroomRL requires a script file that provides the necessary information
for the experiment. Follow the scripts in the "examples" folder to have an idea
of how an experiment can be run.

For instance, to run a quick experiment with one of the provided example scripts, run:

.. code:: shell

python3 examples/car_on_hill_fqi.py

Cite MushroomRL
===============
If you are using MushroomRL for your scientific publications, please cite:

.. code:: bibtex

@article{JMLR:v22:18-056,
author = {Carlo D'Eramo and Davide Tateo and Andrea Bonarini and Marcello Restelli and Jan Peters},
title = {MushroomRL: Simplifying Reinforcement Learning Research},
journal = {Journal of Machine Learning Research},
year = {2021},
volume = {22},
number = {131},
pages = {1-5},
url = {http://jmlr.org/papers/v22/18-056.html}
}

How to contact us
=================
For any question, drop an e-mail at [email protected].

Follow us on Twitter `@Mushroom_RL `_!