https://github.com/notthestallion/rl_models
https://github.com/notthestallion/rl_models
Last synced: about 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/notthestallion/rl_models
- Owner: NotTheStallion
- License: apache-2.0
- Created: 2024-12-02T09:38:30.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2025-01-05T18:37:04.000Z (over 1 year ago)
- Last Synced: 2025-02-07T12:39:42.608Z (over 1 year ago)
- Language: Jupyter Notebook
- Size: 1.79 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# RL_models
This repository contains implementations of three reinforcement learning (RL) algorithms: Monte Carlo, SARSA, and Q-learning. These algorithms were tested on a grid world environment with two different approaches:
1. **Neutral Position Without Penalty**: In this approach, the neutral positions in the grid world do not penalize the agent. The agent is free to explore the grid without any negative reinforcement from neutral positions.
2. **Neutral Position With Penalty**: In this approach, the neutral positions impose a penalty of -1. This penalty encourages the agent to find the shortest path to the goal, as lingering in neutral positions results in a cumulative negative reward.
The goal of these experiments is to compare the performance and behavior of the RL algorithms under different conditions and to observe how the penalty influences the agent's pathfinding strategy.