https://github.com/devspaceship/madepro

A minimal Rust library for solving finite deterministic Markov decision processes
https://github.com/devspaceship/madepro

markov-decision-processes mdp q-learning reinforcement-learning rust sarsa temporal-difference

Last synced: 2 months ago
JSON representation

A minimal Rust library for solving finite deterministic Markov decision processes

Host: GitHub
URL: https://github.com/devspaceship/madepro
Owner: devspaceship
License: mit
Created: 2023-12-23T13:19:56.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-09-10T00:51:03.000Z (almost 2 years ago)
Last Synced: 2025-09-19T13:13:02.851Z (10 months ago)
Topics: markov-decision-processes, mdp, q-learning, reinforcement-learning, rust, sarsa, temporal-difference
Language: Rust
Homepage: https://crates.io/crates/madepro
Size: 64.5 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE

Awesome Lists containing this project

README

# madepro

A minimal Rust library crate for solving finite deterministic
Markov decision processes (MDPs).

The name of the library comes from the first syllable of each word.

## Limitations

This library is still in development and is not ready for production use.
It only implements a few algorithms and one environment.
It is also limited to deterministic MDPs.
Originally, I only created this library in order to refactor my blog post
on [Markov Decision Processes](https://devspaceship.com/posts/gridworld).
However, I decided to probably continue working on it when I have the time.
In the future I would first like to make the library more generic
and then add more algorithms and environments.

## Features

The library currently supports the following algorithms:

- [Policy Iteration](https://en.wikipedia.org/wiki/Markov_decision_process#Policy_iteration)
- [Value Iteration](https://en.wikipedia.org/wiki/Markov_decision_process#Value_iteration)
- [SARSA](https://en.wikipedia.org/wiki/State-Action-Reward-State-Action)
- [Q-Learning](https://en.wikipedia.org/wiki/Q-learning)

## Roadmap

The following features are planned for the future:

- Stochastic MDPs
- Stochastic policies
- State-dependent action spaces
- More algorithms

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/devspaceship/madepro

Awesome Lists containing this project

README