https://github.com/zjeffer/connect4-deep-rl

Deep reinforcement learning algorithm to solve Connect 4, based on AlphaZero
https://github.com/zjeffer/connect4-deep-rl

ai alphazero deep-learning machine-learning reinforcement-learning

Last synced: 3 months ago
JSON representation

Deep reinforcement learning algorithm to solve Connect 4, based on AlphaZero

Host: GitHub
URL: https://github.com/zjeffer/connect4-deep-rl
Owner: zjeffer
License: gpl-3.0
Created: 2022-08-05T17:57:19.000Z (about 3 years ago)
Default Branch: main
Last Pushed: 2024-06-29T11:11:17.000Z (over 1 year ago)
Last Synced: 2025-02-13T09:46:11.882Z (8 months ago)
Topics: ai, alphazero, deep-learning, machine-learning, reinforcement-learning
Language: C++
Homepage:
Size: 969 KB
Stars: 1
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # AlphaZero on a Connect 4 environment

A deep reinforcement learning algorithm that plays Connect 4, based on AlphaZero. I'm creating this because [my chess algorithm](https://github.com/zjeffer/chess-deep-rl-cpp) learns too slowly, and I wanted to know if the problem is the amount of data needed, or my implementation of the algorithm itself.

See https://zjeffer.github.io/connect4-deep-rl/ for Doxygen documentation.

## TODO

* [X] Connect 4 environment

* [X] MCTS algorithm

* [X] Neural network

* [X] AlphaZero self-play

* [X] Argument parsing

* [ ] Load settings from file

* [X] Unit tests:

  * [X] Horizontal win

  * [X] Vertical win

  * [X] Diagonal win

  * [X] Easy puzzle

  * [ ] Harder puzzle

  * [ ] ...?

* [X] Save played moves to memory, and memory to file

* [X] AlphaZero training

* [ ] AlphaZero evaluation

* [ ] Automatic pipeline for selfplay, training and evaluation

* [ ] Play against computer

* [ ] GUI?

[![Hits](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fzjeffer%2Fconnect4-deep-rl&count_bg=%235E81AC&title_bg=%23555555&icon=&icon_color=%235E81AC&title=hits&edge_flat=false)](https://hits.seeyoufarm.com)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/zjeffer/connect4-deep-rl

Awesome Lists containing this project

README