https://github.com/micahcarroll/unimask

Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
https://github.com/micahcarroll/unimask

bert-model deep-learning imitation-learning pytorch reinforcement-learning self-supervised-learning

Last synced: 5 months ago
JSON representation

Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"

Host: GitHub
URL: https://github.com/micahcarroll/unimask
Owner: micahcarroll
License: mit
Created: 2022-10-10T07:59:05.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-07-03T02:52:39.000Z (about 1 year ago)
Last Synced: 2024-10-08T10:45:27.190Z (9 months ago)
Topics: bert-model, deep-learning, imitation-learning, pytorch, reinforcement-learning, self-supervised-learning
Language: Python
Homepage: https://arxiv.org/abs/2211.10869
Size: 19.2 MB
Stars: 54
Watchers: 6
Forks: 4
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

[!["Open Issues"](https://img.shields.io/github/issues-raw/micahcarroll/uniMASK.svg)](https://github.com/micahcarroll/uniMASK)
[![arXiv](https://img.shields.io/badge/arXiv-2211.10869-32CD32.svg)](https://arxiv.org/abs/2211.10869)

# Introduction

uniMASK is a generalization of BERT models with flexible abstractions for performing inference on subportions of
sequences. Masking and prediction can occur both on the token level (as in traditional transformer), or even on
subportions of tokens.

You can find the full paper [here](https://arxiv.org/abs/2211.10869)

# Getting Started
To install uniMASK, run:

```bash
conda create -n uniMASK python=3.7
conda activate uniMASK
pip install -e .
```

uniMASK requires [D4RL](https://github.com/Farama-Foundation/D4RL).
You may install as detailed [in the D4RL repo](https://github.com/Farama-Foundation/D4RL#setup), e.g., by running:
```bash
pip install git+https://github.com/Farama-Foundation/d4rl@master#egg=d4rl
```

For CUDA support, you may need to reinstall `pytorch` in CUDA mode, for example:
```bash
pip install torch --extra-index-url https://download.pytorch.org/whl/cu116
```

To verify that the installation was successful, run `pytest`.

# Reproducing results from the paper
### Minigrid heatmap (figure 7)
Note: Reproducing all runs can a long time. We recommend parallelizing runs.
In each script, the first line (comment) contains an example of how to use GNU Parallel towards this end.

1. Run the commands found in `minigrid_repro.sh`.
2. Fine-tune the pre-trained models generated in the previous step by running the commands in `minigrid_ft_repro.sh`.
3. Generate the heatmaps from these runs by running `minigrid_heatmap.sh` (no parallelization here).
4. You may then find the heatmap at uniMASK/scripts.

### Maze2D results

To reproduce the Maze2D table in the paper:

1. Run `wandb` sweeps: `medium_maze_sweep_all.yaml` and `medium_maze_sweep_DT.yaml`
2. Run finetuning runs: `maze_ft_final.sh`
3. Parse results with `Parse wandb Maze Experiments.ipynb`

# File structure
- `scripts/train.py`: the main script from running uniMASK -- start here.
- `data/`: where rollouts (`datasets`) and trained models (`transformer_runs`) are stored.
- `envs/`: data-handling and evaluation for each supported environment. Currently
- `scripts/`: reproducing results from the paper, and running uniMASK in general.
- `batches.py`: has all data pipeline processing classes (`FactorSeq, TokenSeq, FullTokenSeq, Batch, SubBatch`)
- `sequences.py`:
- `trainer.py`: the Trainer class handles the training loop for all models.
- `transformer.py`: contains the transformer model class itself.
- `transformer_train.py`: interface and config setting for training a transformer, through `Trainer` class.
- `utils.py`: misc utilities, namely math functions, gpu handling, profiling, etc.
- `transformer_eval.py`: interface for getting predictions from transformer (currently empty).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/micahcarroll/unimask

Awesome Lists containing this project

README