Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nileshsah/reinforcement-learning-flappybird
In-browser reinforcement learning for flappy bird 🐦
https://github.com/nileshsah/reinforcement-learning-flappybird
flappy-bird javascript qlearning reinforcement-learning
Last synced: 6 days ago
JSON representation
In-browser reinforcement learning for flappy bird 🐦
- Host: GitHub
- URL: https://github.com/nileshsah/reinforcement-learning-flappybird
- Owner: nileshsah
- License: mit
- Created: 2017-08-13T20:50:31.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-09-23T10:01:48.000Z (about 6 years ago)
- Last Synced: 2024-08-01T21:40:17.062Z (3 months ago)
- Topics: flappy-bird, javascript, qlearning, reinforcement-learning
- Language: JavaScript
- Homepage: https://nileshsah.github.io/reinforcement-learning-flappybird/
- Size: 118 KB
- Stars: 40
- Watchers: 4
- Forks: 9
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
Reinforcement Learning for Flappy Bird in JS
===================A project aimed to explain reinforcement learning in the most simplistic way ever possible by training a _32px by 32px_ game of flappy bird using Q-learning through a script written purely in JavaScript.
The script [`js/brain.js`](js/brain.js) is where the learning logic resides and has been documented heavily to explain the baseline Q-learning algorithm from scratch and how it can be applied in a real-time scenario.
With everything written solely in JS, the game can be trained and tested right inside our browser with no external dependencies at all. You can witness how the bird learns to play the game in real-time by visiting the link: [`http://nileshsah.github.io/reinforcement-learning-flappybird/`](http://nileshsah.github.io/reinforcement-learning-flappybird/) ¯\\_(ツ)_/¯
In an ideal scenario, the bird learns to operate upon a static non-volatile environment in mere 25 trials of the game while for a randomized environment it might take up to 1000+ trials.
### Further Reading
---`[1]` http://people.revoledu.com/kardi/tutorial/ReinforcementLearning/
`[2]` https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-0-q-learning-with-tables-and-neural-networks-d195264329d0
`[3]` https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf