https://github.com/jianzhnie/rlzero

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
https://github.com/jianzhnie/rlzero

alpha-zero mcts multi-agent muzero reinforcement-learning self-play

Last synced: 9 months ago
JSON representation

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

Host: GitHub
URL: https://github.com/jianzhnie/rlzero
Owner: jianzhnie
License: apache-2.0
Created: 2023-02-09T06:58:45.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2024-10-15T07:27:34.000Z (over 1 year ago)
Last Synced: 2025-04-03T07:51:24.232Z (about 1 year ago)
Topics: alpha-zero, mcts, multi-agent, muzero, reinforcement-learning, self-play
Language: Python
Homepage:
Size: 384 KB
Stars: 14
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# RLZero

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

## References

### Unified Toolkits

- https://github.com/datamllab/rlcard
- https://github.com/google-deepmind/open_spiel
- https://github.com/sotetsuk/pgx.git
- https://github.com/Unity-Technologies/ml-agents
- https://github.com/opendilab/LightZero
- https://github.com/datamllab/awesome-game-ai

### Self-Play

- https://github.com/inspirai/TimeChamber
- https://github.com/njustesen/botbowl
- https://github.com/davidADSP/SIMPLE

### MCTS

https://github.com/masouduut94/MCTS-agent-python

### AlpahZero

- https://github.com/suragnair/alpha-zero-general
- https://github.com/geochri/AlphaZero_Chess
- https://github.com/junxiaosong/AlphaZero_Gomoku
- https://github.com/lowrollr/turbozero
- https://github.com/dylandjian/SuperGo

### MuZero

- https://github.com/werner-duvaud/muzero-general
- https://github.com/koulanurag/muzero-pytorch
- https://github.com/YeWR/EfficientZero.git

### DouZero

- https://github.com/kwai/DouZero.git

### Blog

- https://medium.com/@bentou.pub/

- https://github.com/BentouAI/AlphaZero-Chain-Reaction

- https://medium.com/applied-data-science/alphago-zero-explained-in-one-diagram-365f5abf67e0

- http://xtf615.com/categories/%E5%BC%BA%E5%8C%96%E5%AD%A6%E4%B9%A0/

- [How to Build Your Own MuZero Using Python (Part 1/3)](https://medium.com/applied-data-science/how-to-build-your-own-muzero-in-python-f77d5718061a)

- [How to Build Your Own MuZero Using Python (Part 2/3)](https://medium.com/applied-data-science/how-to-build-your-own-deepmind-muzero-in-python-part-2-3-f99dad7a7ad)

- [How to Build Your Own MuZero Using Python (Part 3/3)](https://medium.com/applied-data-science/how-to-build-your-own-deepmind-muzero-in-python-part-3-3-ccea6b03538b)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jianzhnie/rlzero

Awesome Lists containing this project

README