https://github.com/anxingle/alphapig

Implementation of the AlphaZero algorithm for playing the simple board game Gomoku
https://github.com/anxingle/alphapig

alphazero gomoku

Last synced: 3 months ago
JSON representation

Implementation of the AlphaZero algorithm for playing the simple board game Gomoku

Host: GitHub
URL: https://github.com/anxingle/alphapig
Owner: anxingle
Created: 2018-07-21T13:00:46.000Z (about 7 years ago)
Default Branch: master
Last Pushed: 2023-05-22T21:11:54.000Z (over 2 years ago)
Last Synced: 2025-04-08T06:35:04.125Z (6 months ago)
Topics: alphazero, gomoku
Language: Python
Homepage: https://anxingle.github.io
Size: 13.3 MB
Stars: 15
Watchers: 1
Forks: 5
Open Issues: 8
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

## AlphaPig
使用AlphaZero算法在五子棋上的实现。五子棋比围棋简单，训练起来也稍微能够简单一点，所以选择了五子棋来作为AlphaZero的复现。

参考:
1. [junxiaosong/AlphaZero_Gomoku](https://github.com/junxiaosong/AlphaZero_Gomoku)
2. [starimpact/AlphaZero_Gomoku](https://github.com/starimpact/AlphaZero_Gomoku)
3. [yonghenglh6/GobangServer](https://github.com/yonghenglh6/GobangServer)

### 快速启动

```（确保本机安装docker/nvidia-docker）
docker-compose up # (默认使用0号显卡，根据run_ai.sh 进行适配性修改)
```

对 run_ai.sh 进行修改后，修改docker-compose.yml：

```
version: "2.3"

services:
gobang_server:
image: gospelslave/alphapig:v0.1.11
entrypoint: /bin/bash run_server.sh
privileged: true
environment:
- TZ=Asia/Shanghai
volumes:
- $PWD/run_server.sh:/workspace/run_server.sh # 这里修改后的映射
ports:
- 8888:8888
restart: always
logging:
driver: json-file
options:
max-size: "10M"
max-file: "5"

gobang_ai:
image: gospelslave/alphapig:v0.1.11
entrypoint: /bin/bash run_ai.sh
privileged: true
environment:
- TZ=Asia/Shanghai
volumes:
- $PWD/run_ai.sh:/workspace/run_ai.sh # 这里是修改后的映射
runtime: nvidia
restart: always
logging:
driver: json-file
options:
max-size: "10M"
max-file: "5"
```

### 上手入门
+ cd AlphaPig/sgf_data/

+ ```
sh ./download.sh
```

将会下载并解压SGF棋谱数据，解压后应该实在sgf_data/目录下，目录结构AlphaPig/sgf_data/*.sgf。

也可自行下载SGF棋谱数据，并自行处理。

+ 直接运行根目录下的start_train.sh开始训练

+ 或者进入 train_mxnet.py 修改网络结构等参数，其中conf下的.yaml为训练定义的一些参数，可修改为适合自己的相关参数。

+ SGF格式详解

```
FF[4] SGF格式的版本号，4是最新
SZ[15] 棋盘大小，这是15x15
PW[Pig]白棋棋手名称
WR[2a]白棋棋手段位
PB[stupid]黑棋棋手名称
BR[2c]黑棋棋手段位
DT[2018-07-06]棋谱生成日期
PC[CA]棋局所在位置
KM[6.5]贴目数量
RE[B+Resign]B+是黑胜，W+是白胜，Resign是对方GG的
CA[utf-8]棋局编码
TM[0]限时情况，0为无限时
OT[]读秒规则
;B[pp];W[dd];B[pc];W[dq] …… 棋谱下棋顺序
```

+ 如果需要和自己的AI对弈，可以进入evaluate目录，运行

```
python ChessServer.py --port 8888
```

既可与自己的AI进行对弈，或者与yixin对弈。详细说明请参阅evaluate目录下的ReadMe。

对弈例子:

+ 下载我训练的一些模型（还有很多bug）

```
cd AlphaPig/logs
sh ./download_model.sh
```

## 致谢

+ 源工程请移步[junxiaosong/AlphaZero_Gomoku](https://github.com/junxiaosong/AlphaZero_Gomoku) ，特别感谢大V的很多issue和指导。

+ 特别感谢格灵深瞳提供的很多训练帮助（课程与训练资源上提供了很大支持），没有格灵深瞳的这些帮助，训练起来毫无头绪。

+ 感谢[Uloud](https://www.ucloud.cn/) 提供的P40 AI-train服务，1256小时/实例的训练，验证了不少想法。而且最后还免单了，中间没少打扰技术支持。特别感谢他们。

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/anxingle/alphapig

Awesome Lists containing this project

README