https://github.com/tpbarron/rlflow

A TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms
https://github.com/tpbarron/rlflow

deep-reinforcement-learning openai-gym openai-universe tensorflow

Last synced: 4 months ago
JSON representation

A TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms

Host: GitHub
URL: https://github.com/tpbarron/rlflow
Owner: tpbarron
License: mit
Created: 2016-09-17T20:38:11.000Z (almost 10 years ago)
Default Branch: master
Last Pushed: 2017-04-07T20:19:38.000Z (about 9 years ago)
Last Synced: 2025-11-27T13:06:38.322Z (7 months ago)
Topics: deep-reinforcement-learning, openai-gym, openai-universe, tensorflow
Language: Python
Homepage:
Size: 168 MB
Stars: 20
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.rst
- Contributing: docs/contributing.rst
- License: LICENSE

Awesome Lists containing this project

README

          ===============================

RLFlow

===============================

.. image:: https://img.shields.io/pypi/v/rlflow.svg

        :target: https://pypi.python.org/pypi/rlflow

.. image:: https://img.shields.io/travis/tpbarron/rlflow.svg

        :target: https://travis-ci.org/tpbarron/rlflow

.. image:: https://readthedocs.org/projects/rlflow/badge/?version=latest

        :target: https://rlflow.readthedocs.io/en/latest/?badge=latest

        :alt: Documentation Status

A framework for learning about and experimenting with reinforcement learning algorithms.

It is built on top of TensorFlow and interfaces with OpenAI gym (universe should work, too).

It aims to be as modular as possible so that new algorithms and ideas can easily be tested.

I started it to gain a better understanding of core RL algorithms and maybe it can be

useful for others as well.

Features

--------

Algorithms (future algorithms italicized):

  - MDP algorithms

      + Value iteration

      + Policy iteration

  - Temporal Difference Learning

      + SARSA

      + Deep Q-Learning

      + *Policy gradient Q-learning*

  - Gradient algorithms

      + Vanilla policy gradient

      + *Deterministic policy gradient*

      + *Natural policy gradient*

  - Gradient-Free algorithms

      + *Cross entropy method*

Function approximators (defined by TFLearn model):

  - Linear

  - Neural network

  - *RBF*

Works with any OpenAI gym environment.

Future Enhancements

-------------------

* Improved TensorBoard logging

* Improved model snapshotting to include exploration states, memories, etc.

* Any suggestions?

Fixes

------------------

* Errors / warnings on TensorFlow session save

License

------------------

* Free software: MIT license

* Documentation: https://rlflow.readthedocs.io.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tpbarron/rlflow

Awesome Lists containing this project

README