Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/alexis-jacq/LOLA_DiCE

Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
https://github.com/alexis-jacq/LOLA_DiCE

Last synced: 2 months ago
JSON representation

Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)

Lists

README

        

# LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)

## Quick results:

### Results on IPD using DiCE
[lr_in=0.3, lr_out=0.2, lr_v=0.1, batch_size=128, len_rollout=150, use_baseline=True]
![ipd_with_dice](images/ipd_dice.png)

### Results on IPD using DiCE and opponent modelling
[lr_in=0.3, lr_out=0.2, lr_v=0.1, batch_size=128, len_rollout=150, use_baseline=True]
![ipd_with_dice](images/ipd_dice_om.png)
(It seems that 2 lookaheads is the most stable model with this set of hyperparameters)

### Results on IPD using exact gradients
[lr_in=0.3, lr_out=0.2, batch_size=128, len_rollout=150]
![ipd_with_exact_grads](images/ipd_exact.png)

### Results on IPD using exact gradients and opponent modelling
[lr_in=0.3, lr_out=0.2, batch_size=128, len_rollout=150]
![ipd_with_exact_grads](images/ipd_exact_om.png)

## Authors version:

The authors of the paper have their own version (Tensorflow) available here: https://github.com/alshedivat/lola