https://github.com/jianzhnie/rlzero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
https://github.com/jianzhnie/rlzero
alpha-zero mcts multi-agent muzero reinforcement-learning self-play
Last synced: 9 months ago
JSON representation
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
- Host: GitHub
- URL: https://github.com/jianzhnie/rlzero
- Owner: jianzhnie
- License: apache-2.0
- Created: 2023-02-09T06:58:45.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2024-10-15T07:27:34.000Z (over 1 year ago)
- Last Synced: 2025-04-03T07:51:24.232Z (about 1 year ago)
- Topics: alpha-zero, mcts, multi-agent, muzero, reinforcement-learning, self-play
- Language: Python
- Homepage:
- Size: 384 KB
- Stars: 14
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
## References
### Unified Toolkits
- https://github.com/datamllab/rlcard
- https://github.com/google-deepmind/open_spiel
- https://github.com/sotetsuk/pgx.git
- https://github.com/Unity-Technologies/ml-agents
- https://github.com/opendilab/LightZero
- https://github.com/datamllab/awesome-game-ai
### Self-Play
- https://github.com/inspirai/TimeChamber
- https://github.com/njustesen/botbowl
- https://github.com/davidADSP/SIMPLE
### MCTS
https://github.com/masouduut94/MCTS-agent-python
### AlpahZero
- https://github.com/suragnair/alpha-zero-general
- https://github.com/geochri/AlphaZero_Chess
- https://github.com/junxiaosong/AlphaZero_Gomoku
- https://github.com/lowrollr/turbozero
- https://github.com/dylandjian/SuperGo
### MuZero
- https://github.com/werner-duvaud/muzero-general
- https://github.com/koulanurag/muzero-pytorch
- https://github.com/YeWR/EfficientZero.git
### DouZero
- https://github.com/kwai/DouZero.git
### Blog
- https://medium.com/@bentou.pub/
- https://github.com/BentouAI/AlphaZero-Chain-Reaction
- https://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0
- http://xtf615.com/categories/%E5%BC%BA%E5%8C%96%E5%AD%A6%E4%B9%A0/
- [How to Build Your Own MuZero Using Python (Part 1/3)](https://medium.com/applied-data-science/how-to-build-your-own-muzero-in-python-f77d5718061a)
- [How to Build Your Own MuZero Using Python (Part 2/3)](https://medium.com/applied-data-science/how-to-build-your-own-deepmind-muzero-in-python-part-2-3-f99dad7a7ad)
- [How to Build Your Own MuZero Using Python (Part 3/3)](https://medium.com/applied-data-science/how-to-build-your-own-deepmind-muzero-in-python-part-3-3-ccea6b03538b)