https://github.com/mattjmattj/php-rl

A basic reinforcement learning library in PHP
https://github.com/mattjmattj/php-rl

artificial-intelligence ddqn double-dqn dqn machine-learning neural-network prioritized-experience-replay qlearning reinforcement-learning rl sarsa

Last synced: 2 months ago
JSON representation

A basic reinforcement learning library in PHP

Host: GitHub
URL: https://github.com/mattjmattj/php-rl
Owner: mattjmattj
License: mit
Created: 2020-04-27T12:09:29.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2020-05-26T22:34:44.000Z (about 6 years ago)
Last Synced: 2025-10-29T22:53:49.254Z (8 months ago)
Topics: artificial-intelligence, ddqn, double-dqn, dqn, machine-learning, neural-network, prioritized-experience-replay, qlearning, reinforcement-learning, rl, sarsa
Language: PHP
Homepage:
Size: 50.8 KB
Stars: 4
Watchers: 3
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # php-rl

A reinforcement learning library in PHP

## Disclaimer

This library is basically me reimplementing well-known RL algorithms in order to better

understand them.

## Algorithms

### Value-based algorithms

#### SARSA

A standard state-action-reward-state-action implementation based on a Q table

#### Q-Learning

Based on a Q-table implemented as a "max" policy SARSA.

Current API provides a basic epsilon-greedy agent.

See the Tic-Tac-Toe example for some details

#### Deep Q-Learning

Current API provides a basic epsilon-greedy agent, with separated target model, as described

in Mnih, V., Kavukcuoglu, K., Silver, D. _et al_. Human-level control through deep reinforcement learning. _Nature_ **518**, 529–533 (2015). https://doi.org/10.1038/nature14236.

User can choose between a Vanilla DQN ou a Double DQN, (see Hado van Hasselt, Arthur Guez, David Silver. Deep Reinforcement Learning with Double Q-learning. [arXiv:1509.06461](https://arxiv.org/abs/1509.06461) [cs.LG])

Experience replay is available as 2 distinct implementations:

- random minibatch

- prioritized experience replay (Tom Schaul, John Quan, Ioannis Antonoglou, David Silver - Prioritized Experience Replay, [arXiv:1511.05952](https://arxiv.org/abs/1511.05952) [cs.LG], 2015)

### Policy-based algorithms

TODO

## TODO

- ~~Q-learning~~

- ~~SARSA~~

- ~~DQN~~

- ~~Double DQN~~

- ~~[DQN] prioritized experience replay~~

- Vanilla Policy Gradient (REINFORCE)

- Actor-Critic

- real documentation :)

- more examples

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mattjmattj/php-rl

Awesome Lists containing this project

README