An open API service indexing awesome lists of open source software.

https://github.com/oiricaud/markov-decision-process-ai

Implement decision process for Monte-Carlo, Value Iteration & Q-Learning
https://github.com/oiricaud/markov-decision-process-ai

ai decision markov process

Last synced: 2 months ago
JSON representation

Implement decision process for Monte-Carlo, Value Iteration & Q-Learning

Awesome Lists containing this project

README

        

# Markov-Decision-Process: Artificial Intelligence

Objective
======
To expirement with some of the basic algorithms for solving MDPs on a simple domain.

Groups: You may optionally work in groups of 2 students.

Doomain: The domain is based on a simple MDP originally designed by Rich Sutton at the University of Alberta. The example describes a Markov Decision Porcess that models the life of a student and the decisions one must make to both have a good time and remain in good academic standing.

States
======
R = Rested
T = Tired
D = Homework Done
U = Homework Undone
8p = eight o'clock pm

Actions
======
P = Party
R = Rest
S = Study
any means any action has the same effect

*note: not all actions are possible in all states*

Red numbers are rewards

Green numbers are transition probabilities (all those not labeled are probability 1.0)

The gray rectangle denotes a terminal state.

See below for the diagram of the MDP.

![alt tag](Screenshots/Diagram.png "Diagram")