https://github.com/hellochick/FICM

:video_game: [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!
https://github.com/hellochick/FICM

curiosity-driven intrinsic-curiosity-module intrinsic-motivation optical-flow supermario

Last synced: about 1 month ago
JSON representation

:video_game: [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!

Host: GitHub
URL: https://github.com/hellochick/FICM
Owner: hellochick
Created: 2019-05-01T08:27:48.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2022-12-08T05:10:39.000Z (over 2 years ago)
Last Synced: 2024-11-13T05:36:01.451Z (7 months ago)
Topics: curiosity-driven, intrinsic-curiosity-module, intrinsic-motivation, optical-flow, supermario
Language: C++
Homepage:
Size: 15.7 MB
Stars: 102
Watchers: 7
Forks: 9
Open Issues: 8
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        #### Status: Archive. 

## Flow-based Intrinsic Curiosity Module (FICM) 



### [[Paper]](https://www.ijcai.org/Proceedings/2020/286) [[Demo Video]](https://www.youtube.com/watch?v=w-a6akKpWT0&feature=youtu.be) [[Reddit Discuss]](https://www.reddit.com/r/MachineLearning/comments/bu05ua/p_playing_supermario_bros_without_knowing_any/) 

This is a TensorFlow-based implementation for our paper "Flow-based Intrinsic Curiosity Module".

FICM is used for evaluating the novelty of observations, it generates intrinsic rewards based on the prediction errors of optical flow estimation since the rapid change part in consecutive frames usually serve as important signals. 

Without any external reward, FICM can help RL agent to play SuperMario successfully. 



This repo is largely inherited from [large-scale-curiosity](https://github.com/openai/large-scale-curiosity), and we also borrowed `correlation layer` from [flownet2_tf](https://github.com/sampepose/flownet2-tf).

-----

[Flow-based Intrinsic Curiosity Module](https://www.ijcai.org/Proceedings/2020/286)  

[Hsuan-Kung Yang*](https://hellochick.github.io/), Po-Han Chiang*, Min-Fong Hong, and Chun-Yi Lee (* equal contributions)  

ElsaLab, National Tsing Hua University  

In [IJCAI'20 main track](https://ijcai20.org/) and [ICLR'19 TARL workshop](https://tarl2019.github.io/)

### Dependencies

* Python3.5

### Installation

```bash

pip install -r requirement.txt

pip install git+https://github.com/openai/baselines.git@3301089b48c42b87b396e246ea3f56fa4bfc9678

```

#### FICM-C (Optional)

If you want to use FICM-C architecture, you'll need to compile for correlation layer additionally.

```bash

cd correlation_layer

make all

```

> Note: You might encounter some errors raised from the source codes of tensorflow, you can easily change the header file of

'cuda_kernel_helper.h', 'cuda_device_functions.h', and 'cuda_launch_config.h'

#### SuperMario (Optional)

If you want to train an agent to play SuperMario, you need to install `retro` and import `SuperMarioBros-Nes` game.

```bash

pip install retro

```

Read the official guide [here](https://retro.readthedocs.io/en/latest/getting_started.html#importing-roms)

### Example usage

#### Mario

```bash

python run.py --feat_learning flowS --env_kind mario

```

#### Atari

```bash

python run.py --feat_learning flowS --env SeaquestNoFrameskip-v4 --seed 666

```

### Citation

If you find this paper or code implementation helpful, please consider to cite:

```

@inproceedings{flowbasedcuriosity2020,

  title     = {Flow-based Intrinsic Curiosity Module},

  author    = {Hsuan-Kung Yang, Po-Han Chiang, Min-Fong Hong and Chun-Yi Lee},

  booktitle = {Proceedings of the Twenty-Ninth International Joint Conference on

               Artificial Intelligence, {IJCAI-20}},

  publisher = {International Joint Conferences on Artificial Intelligence Organization},             

  editor    = {Christian Bessiere}	

  pages     = {2065--2072},

  year      = {2020},

  month     = {7},

  note      = {Main track}

  doi       = {10.24963/ijcai.2020/286},

  url       = {https://doi.org/10.24963/ijcai.2020/286},

}

```

### Reference

    @inproceedings{largeScaleCuriosity2018,

        Author = {Burda, Yuri and Edwards, Harri and

                  Pathak, Deepak and Storkey, Amos and

                  Darrell, Trevor and Efros, Alexei A.},

        Title = {Large-Scale Study of Curiosity-Driven Learning},

        Booktitle = {arXiv:1808.04355},

        Year = {2018}

    }

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/hellochick/FICM

Awesome Lists containing this project

README