https://github.com/ianfab/chess-analysis

Evaluate quality of play metrics for chess
https://github.com/ianfab/chess-analysis

acpl chess epd pandas pgn python python-chess stockfish

Last synced: 3 months ago
JSON representation

Evaluate quality of play metrics for chess

Host: GitHub
URL: https://github.com/ianfab/chess-analysis
Owner: ianfab
License: agpl-3.0
Created: 2022-12-18T17:00:57.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2023-02-02T22:35:50.000Z (over 2 years ago)
Last Synced: 2025-04-28T11:50:41.902Z (5 months ago)
Topics: acpl, chess, epd, pandas, pgn, python, python-chess, stockfish
Language: Python
Homepage:
Size: 23.4 KB
Stars: 8
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Chess analysis

This project analyses chess games using Stockfish and python-chess, and aggregates statistics such as ACPL and related metrics using pandas.

This can be either used to analyze the quality of play depending on various properties such as Elo, player, etc., or to evaluate the metrics themselves against objective quality of play indicators such as Elo and result.

## Process
1. `pgn2epd.py` generates an EPD file from a given PGN.
2. `analyze.py` analyzes positions from an EPD file and annotates them.
3. `stats.py` aggregates statistics from an annotated EPD file.

## Setup
The scripts require python3 as well as the dependencies from the `requirements.txt`. Install them using
```
pip3 install -r requirements.txt
```
In order to ensure that best and player moves are evaluated within the same search, a [custom version of stockfish](https://github.com/ianfab/Stockfish/tree/ensuremove) and [python-chess](https://github.com/ianfab/python-chess/tree/ensuremove) are used that support this feature.

## Output
The main focus of this project is to quantify quality of play using various metrics comparable to but different from average centipawn loss (ACPL). Metrics can have different strengths and weaknesses depending on their design, e.g.:
* Weighting: Metrics such as (uncapped) ACPL can give extremely large weight to single moves compared to the rest of a game, which can skew results.
* Bias: Using summative metrics can lead to a strong correlation with game length, which is undesirable.
* Exploitability: Using averaging can make metrics susceptible to biases by long sequences of moves not affecting the result, e.g., playing on in dead drawn or losing positions.

Some ways to attempt to make metrics more robust are:
* Capping values/differences can limit the influence of a single move.
* Using expectation values (EV) and their differences (expected loss, EL) instead of raw centipawn (loss) values transforming to a limited range mitigates the weighting problem.
* Normalizing by the potential for loss, e.g., dividing by the maximum possible loss, leads to more equal weighting of decisions.

Some abbreviations used in the names of metrics:
* base metric
* CP/CPL: centipawn / centipawn loss
* EV/EL: expectation value / expected loss
* transformation
* C: capped
* SF: stockfish win-rate model
* L: lichess win-rate model
* aggregation
* T: total
* A: average
* N: normalized
* EW: equal weighted

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ianfab/chess-analysis

Awesome Lists containing this project

README