https://github.com/qzed/irl-maxent

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python
https://github.com/qzed/irl-maxent

inverse-reinforcement-learning machine-learning maximum-entropy

Last synced: over 1 year ago
JSON representation

Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python

Host: GitHub
URL: https://github.com/qzed/irl-maxent
Owner: qzed
License: mit
Created: 2019-05-06T23:57:32.000Z (about 7 years ago)
Default Branch: master
Last Pushed: 2024-04-21T12:57:08.000Z (about 2 years ago)
Last Synced: 2025-03-29T05:09:22.773Z (over 1 year ago)
Topics: inverse-reinforcement-learning, machine-learning, maximum-entropy
Language: Jupyter Notebook
Homepage:
Size: 1.34 MB
Stars: 272
Watchers: 4
Forks: 61
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE
- Citation: CITATION.cff

Awesome Lists containing this project

awesome-causal-ai - irl-maxent - Maximum Entropy and Maximum Causal Entropy Inverse Reinforcement Learning Implementation in Python *(Jupyter Notebook)* (🚀 GitHub Repositories / 🌟 **Real-World Magic**)

README

# Maximum Entropy Inverse Reinforcement Learning

This is a python implementation of the Maximum Entropy Inverse Reinforcement Learning (MaxEnt IRL) algorithm based on the similarly named paper by Ziebart et al. and the Maximum Causal Entropy Inverse Reinforcement Learning (MaxCausalEnt IRL) algorithm based on his PhD thesis.
Project for the Advanced Seminar in Imitation Learning, summer term 2019, University of Stuttgart.

This implementation is available as python package at https://pypi.org/project/irl-maxent/ and can be installed via `pip install irl-maxent`.
You may also want to have a look at the accompanying [presentation][presentation].

For an example demonstrating how the Maximum (non-causal) Entropy IRL algorithm works, see the corresponding Jupyter notebook ([`notebooks/maxent.ipynb`][nb-viewer]).
Note that the provided python files (`src/`) contain a slightly more optimized implementation of the algorithms.

To run a demonstration without the notebook, you can directly run `./src/example.py`.
Also have a look at this file on how to use the provided framework.
The framework contains:
- Two GridWorld implementations for demonstration (`irl_maxent.gridworld`)
- The algorithm implementations (`irl_maxent.maxent`)
- A gradient based optimizer framework (`irl_maxent.optimizer`)
- Plotting helper functions (`irl_maxent.plot`)
- A MDP solver framework, i.e. value iteration and corresponding utilities (`irl_maxent.solver`)
- A trajectory/trajectory generation framework (`irl_maxent.trajectory`)

This project solely relies on the following dependencies: `numpy`, `matplotlib`, `itertools`, and `pytest`.

[nb-viewer]: https://nbviewer.jupyter.org/github/qzed/irl-maxent/blob/master/notebooks/maxent.ipynb
[presentation]: https://nbviewer.jupyter.org/github/qzed/irl-maxent/blob/master/Presentation.pdf

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/qzed/irl-maxent

Awesome Lists containing this project

README