https://github.com/iandanforth/smaction
Softmax Action Detection Visualization
https://github.com/iandanforth/smaction
machine-learning reinforcement-learning visualization
Last synced: about 2 months ago
JSON representation
Softmax Action Detection Visualization
- Host: GitHub
- URL: https://github.com/iandanforth/smaction
- Owner: iandanforth
- Created: 2018-07-30T20:16:31.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2018-07-30T22:54:47.000Z (almost 8 years ago)
- Last Synced: 2025-03-17T04:11:53.850Z (over 1 year ago)
- Topics: machine-learning, reinforcement-learning, visualization
- Language: JavaScript
- Homepage: https://iandanforth.github.io/smaction/
- Size: 20.6 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Softmax Action Detection Visualization
[Visualization Demo](https://iandanforth.github.io/smaction/)
[Description of Softmax Action Selection](http://www.incompleteideas.net/book/ebook/node17.html)
The impact of temperature (tau) in the softmax equation on the probability of an action being selected may not be immediately obvious.
This visualization is a simple way to see that impact.
### Things to try
- Set temperature = 1
- Set the value of 'a' near the value of 'b'. Notice how small changes in value in this regime have large impacts.
- Set temperature to 1000 and try again.
- Try to fully recover the equiprobable action selection policy.