https://github.com/davidmarne/learningalgorithms

Q-Learning and Value Iteration Learning algorithms used to learn the optimal path around a race track
https://github.com/davidmarne/learningalgorithms

Last synced: 10 months ago
JSON representation

Q-Learning and Value Iteration Learning algorithms used to learn the optimal path around a race track

Host: GitHub
URL: https://github.com/davidmarne/learningalgorithms
Owner: davidmarne
Created: 2014-02-10T00:34:42.000Z (almost 12 years ago)
Default Branch: master
Last Pushed: 2014-02-10T00:44:30.000Z (almost 12 years ago)
Last Synced: 2025-01-11T00:28:00.283Z (12 months ago)
Language: Java
Homepage:
Size: 273 KB
Stars: 0
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: readme.rtf

Awesome Lists containing this project

README

{\rtf1\ansi\ansicpg1252\cocoartf1187\cocoasubrtf340
{\fonttbl\f0\fswiss\fcharset0 Helvetica;}
{\colortbl;\red255\green255\blue255;}
\margl1440\margr1440\vieww10800\viewh8400\viewkind0
\pard\tx720\tx1440\tx2160\tx2880\tx3600\tx4320\tx5040\tx5760\tx6480\tx7200\tx7920\tx8640\pardirnatural

\f0\fs24 \cf0 File Descriptions:\
\
/data/\
All files in the data folder are example outputs. They show every step the agent takes during a given trial. The names of these files describe the parameters of the test. Each file starts with the name of the track the test was conducted on (ex. rtrack) followed by a 90 or a 95 (signifying the learning rate) for Value Iteration tests or _###### (number corresponds to trails ran before this test) for q-learning trials. Finally the _# signifies what trial number the test was.\
\
/src/reinforcement/learning\
\
Action: An object that holds two integer values, one for the acceleration of the agent in each direction.\
\
q2: An object that contains all of the tables necessary to store values and functions that search the table.\
\
QLearningAgent: Implements the q-learning algorithm\
\
ReinforcementLearning: main method\
\
State: an object consisting of an x position, a y position, a x velocity and a y velocity.\
\
Track: consists of the track read into the program at the beginning of main.\
\
ValueIteration: Implements the Value Iteration Algorithm}

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/davidmarne/learningalgorithms

Awesome Lists containing this project

README