https://github.com/alandoescs/easy-java-rl-library

A simple RL library, with a focus on DQNs
https://github.com/alandoescs/easy-java-rl-library

adam-optimizer ddqn dqn machine-learning pathfinding reinforcement-learning

Last synced: 5 months ago
JSON representation

A simple RL library, with a focus on DQNs

Host: GitHub
URL: https://github.com/alandoescs/easy-java-rl-library
Owner: AlanDoesCS
License: cc0-1.0
Created: 2024-04-22T21:41:14.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-12-24T14:44:08.000Z (7 months ago)
Last Synced: 2024-12-24T15:43:21.622Z (7 months ago)
Topics: adam-optimizer, ddqn, dqn, machine-learning, pathfinding, reinforcement-learning
Language: Java
Homepage:
Size: 11.3 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # Easy Java Reinforcement Learning Library (EJRLL)

A simple neural network library for training Deep Q Networks in different environments.

![GitHub Issues or Pull Requests](https://img.shields.io/github/issues/AlanDoesCS/RL-Pathfinding)

![GitHub commit activity](https://img.shields.io/github/commit-activity/t/AlanDoesCS/RL-Pathfinding)

![GitHub contributors](https://img.shields.io/github/contributors/AlanDoesCS/RL-Pathfinding)

![GitHub Repo stars](https://img.shields.io/github/stars/AlanDoesCS/RL-Pathfinding)

![GitHub forks](https://img.shields.io/github/forks/AlanDoesCS/RL-Pathfinding)

---

## Environments

- 2D perlin noise

- Maze (generated using recursive backtracking)

- Pseudorandom noise

---

## RL Algorithms

- DQN

- Double DQN

## Replay

- Replay Buffer

- Prioritized Experience Replay

## Optimizers

- Adam

---

## Example usage:

```java

public class Main {

    public static void main(String[] args) {

        Environment.setStateType(Environment.StateType.PositionVectorOnly);

        Environment.setDimensions(10, 10);

        Environment.setActionSpace(4);

        DDQNAgentTrainer trainer;

        try {

            trainer = new DDQNAgentTrainer(Set.of(EmptyGridEnvironment.class, RandomGridEnvironment.class, PerlinGridEnvironment.class, MazeGridEnvironment.class));

        } catch (InvalidTypeException e) {

            e.printStackTrace();

            return;

        }

        List layers = new ArrayList<>();

        LeakyReLU leakyRelu = new LeakyReLU(0.1f);

        float lambda = 0.0001f;

        // StateSpace is 104, ActionSpace is 5

        layers.add(new MLPLayer(Environment.getStateSpace(), 64, leakyRelu, 0, lambda));

        layers.add(new MLPLayer(64, 64, leakyRelu, 0, lambda));

        layers.add(new MLPLayer(64, Environment.getActionSpace(), new Linear(), 0, lambda));

        DDQNAgent ddqnAgent = new DDQNAgent(

                Environment.getActionSpace(),  // action space

                layers,                        // layers

                1,                             // initial epsilon

                0.9999,                        // epsilon decay

                0.01,                          // epsilon min

                0.999,                         // gamma

                0.0001,                        // learning rate

                0.99995,                       // learning rate decay

                0.000001f,                     // learning rate minimum

                0.005                          // tau

        );

        trainer.trainAgent(

                ddqnAgent,                     // agent

                600000,                        // num episodes

                500,                           // save period

                1,                             // visualiser update period

                "plot", "ease", "axis_ticks", "show_path", "verbose" // varargs

        );

    }

}

```

## Papers & Resources Used

This list is incomplete, but I will try and ensure I add all the sources I used eventually

### Papers

- https://arxiv.org/pdf/1511.08458

- http://arxiv.org/pdf/1511.05952v4

- https://arxiv.org/pdf/1412.6980

- http://arxiv.org/pdf/1502.03167

- https://proceedings.neurips.cc/paper/2020/file/32fcc8cfe1fa4c77b5c58dafd36d1a98-Paper.pdf

- https://doi.org/10.1109/ACCESS.2019.2941229

- http://www.arxiv.org/pdf/1509.06461

- https://arxiv.org/pdf/1312.5602

### Videos

- https://www.youtube.com/watch?v=z9hJzduHToc

- https://www.youtube.com/watch?v=s2coXdufOzE

- https://youtu.be/ECV5yeigZIg?si=3EXfuIGTH2BABkeS

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/alandoescs/easy-java-rl-library

Awesome Lists containing this project

README