https://github.com/ljvmiranda921/gym-lattice

An HP 2D Lattice Environment with a Gym-like API for the Protein Folding Problem
https://github.com/ljvmiranda921/gym-lattice

environments hp-model machine-learning protein-sequences protein-structure python reinforcement-learning

Last synced: 3 days ago
JSON representation

An HP 2D Lattice Environment with a Gym-like API for the Protein Folding Problem

Host: GitHub
URL: https://github.com/ljvmiranda921/gym-lattice
Owner: ljvmiranda921
License: mit
Created: 2018-04-03T11:06:31.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2022-12-08T01:34:19.000Z (almost 3 years ago)
Last Synced: 2025-04-23T04:17:03.254Z (6 months ago)
Topics: environments, hp-model, machine-learning, protein-sequences, protein-structure, python, reinforcement-learning
Language: Python
Homepage:
Size: 286 KB
Stars: 54
Watchers: 3
Forks: 8
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # gym-lattice

[![Build Status](https://travis-ci.org/ljvmiranda921/gym-lattice.svg?branch=master)](https://travis-ci.org/ljvmiranda921/gym-lattice)

![python 3.4+](https://img.shields.io/badge/python-3.4+-blue.svg)

[![DOI](https://zenodo.org/badge/127895338.svg)](https://zenodo.org/badge/latestdoi/127895338)

An HP 2D Lattice Environment with a Gym-like API for the protein folding

problem

This is a Python library that formulates Lau and Dill's (1989)

hydrophobic-polar two-dimensional lattice model as a reinforcement learning

problem. It follows OpenAI Gym's API, easing integration for reinforcement

learning solutions. This library only implements a two-dimensional square

lattice, but different lattice structures will be done in the future.

Screenshots from `render()`:

![](/assets/demo1.png)

![](/assets/demo2.png)

![](/assets/demo3.png)

## Dependencies

- numpy==1.14.2

- gym==0.10.4

- six==1.11.0

- pytest==3.2.1 *(dev)*

- setuptools==39.0.1 *(dev)*

## Installation

This package is only compatible with Python 3.4 and above. To install this

package, please follow the instructions below:

1. Install [OpenAI Gym](https://gym.openai.com/docs/#installation) and its dependencies.

2. Install the package itself:

```

git clone https://github.com/ljvmiranda921/gym-lattice.git

cd gym-lattice

pip install -e .

```

## Environment

### Objective

Given a sequence of *H* and *P* molecules, find a configuration with the

highest number of adjacent **H-H** pairs. Your base score is determined by

the number of **H-H** pairs you can create.



### Folding Rules

As with most "games", there are some rules that must be observed when folding

proteins. We then pattern our rules from [Dill and Lau's (1989) lattice

statistical mechanics](https://pubs.acs.org/doi/abs/10.1021/ma00200a030) for

protein conformation. For our purposes, here it is in its simplest form:

1. You can only perform the following actions: left, down, up, or right.

2. You can only put a molecule adjacent to the previous one you've placed.

3. Assigning a molecule to an occupied space will incur a penalty.

4. Trapping yourself and running out of moves will result in a heavy deduction.



## Basic Usage

Initializing the environment requires the protein sequence of type `str`.

In addition, you can also set the amount of penalty incurred during a

collision or whenever the agent is trapped.

Let's try this out with a random agent on the protein sequence **HHPHH**!

```python

from gym_lattice.envs import Lattice2DEnv

from gym import spaces

import numpy as np

np.random.seed(42)

seq = 'HHPHH' # Our input sequence

action_space = spaces.Discrete(4) # Choose among [0, 1, 2 ,3]

env = Lattice2DEnv(seq)

for i_episodes in range(5):

    env.reset()

    while not env.done:

        # Random agent samples from action space

        action = action_space.sample()

        obs, reward, done, info = env.step(action)

        env.render()

        if done:

            print("Episode finished! Reward: {} | Collisions: {} | Actions: {}".format(reward, info['collisions'], info['actions']))

            break

```

Sample output:

```

Episode finished! Reward: -2 | Collisions: 1 | Actions: ['U', 'L', 'U', 'U']

Episode finished! Reward: -2 | Collisions: 0 | Actions: ['D', 'L', 'L', 'U']

Episode finished! Reward: -2 | Collisions: 0 | Actions: ['R', 'U', 'U', 'U']

Episode finished! Reward: -3 | Collisions: 1 | Actions: ['U', 'L', 'D', 'D']

Episode finished! Reward: -2 | Collisions: 2 | Actions: ['D', 'R', 'R', 'D']

```

### Actions

Your agent can perform four possible actions: `0` (left), `1` (down), `2`

(up), and `3` (right). The number choices may seem funky at first but just

remember that it maps to the standard vim keybindings.

### Observations

Observations are represented as a 2-dimensional array with `1` representing

hydrophobic molecules (H), `-1` for polar molecules (P), and `0` for free

spaces. If you wish to obtain the chain itself, you can do so by accessing

`info['state_chain']`.

An episode ends when all polymers are added to the lattice OR if the sequence

of actions traps the polymer chain (no more valid moves because surrounding

space is fully-occupied). Whenever a collision is detected, the agent should

enter another action.

### Rewards

This environment computes the reward in the following manner:

```python

# Reward at timestep t

reward_t = state_reward + collision_penalty + trap_penalty

```

- The `state_reward` is the number of adjacent H-H molecules in the **final** state. In protein folding, the state_reward is synonymous to computing the Gibbs free energy, i.e., thermodynamic assumption of a stable molecule. Its value is 0 in all timesteps and is only computed at the end of the episode.  

- The `collision_penalty` at timestep t accounts for collision events whenever the agent chooses to put a molecule at an already-occupied space. Its default value is -2, but this can be adjusted by setting the `collision_penalty` at initialization.

- The `trap_penalty` is only computed whenever the agent has no more moves left and is unable to finish the task. The episode ends, thus computing the `state_reward`, but subtracts a deduction dependent on the length of the actual sequence.

## Cite us!

Are you using `gym-lattice` in your paper or project? Cite us!

```

@misc{miranda2018gymlattice,

  author       = {Lester James V. Miranda},

  title        = {gym-lattice: an HP 2D Lattice Environment with a 

                  Gym-like API for the protein folding problem},

  month        = apr,

  year         = 2018,

  doi          = {10.5281/zenodo.1214967},

  url          = {https://doi.org/10.5281/zenodo.1214967}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ljvmiranda921/gym-lattice

Awesome Lists containing this project

README