Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/lucidrains/molecule-attention-transformer

Pytorch reimplementation of Molecule Attention Transformer, which uses a transformer to tackle the graph-like structure of molecules
https://github.com/lucidrains/molecule-attention-transformer

artificial-intelligence deep-learning molecules transformers

Last synced: 18 days ago
JSON representation

Pytorch reimplementation of Molecule Attention Transformer, which uses a transformer to tackle the graph-like structure of molecules

Host: GitHub
URL: https://github.com/lucidrains/molecule-attention-transformer
Owner: lucidrains
License: mit
Created: 2020-11-30T19:06:47.000Z (almost 4 years ago)
Default Branch: main
Last Pushed: 2020-12-02T05:25:38.000Z (almost 4 years ago)
Last Synced: 2024-10-15T00:16:44.059Z (about 1 month ago)
Topics: artificial-intelligence, deep-learning, molecules, transformers
Language: Python
Homepage:
Size: 90.8 KB
Stars: 58
Watchers: 4
Forks: 7
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        

## Molecule Attention Transformer - Pytorch (wip)

Pytorch reimplementation of Molecule Attention Transformer, which uses a slightly modified transformer to tackle the graph-like structure of molecules. The repository is also meant to be educational, to understand the limitations of transformers for processing graphs (or perhaps lack thereof).

Update: Reread the paper and results do look convincing. However, I do not like how it still takes hyperparameter sweeps of the relative contributions of the distance, adjacency, and self attention matrices to achieve good results. There must be a more hands-off way

## Install

```bash

$ pip install molecule-attention-transformer

```

## Usage

```python

import torch

from molecule_attention_transformer import MAT

model = MAT(

    dim_in = 26,

    model_dim = 512,

    dim_out = 1,

    depth = 6,

    Lg = 0.5,                   # lambda (g)raph - weight for adjacency matrix

    Ld = 0.5,                   # lambda (d)istance - weight for distance matrix

    La = 1,                     # lambda (a)ttention - weight for usual self-attention

    dist_kernel_fn = 'exp'      # distance kernel fn - either 'exp' or 'softmax'

)

atoms           = torch.randn(2, 100, 26)

mask            = torch.ones(2, 100).bool()

adjacency_mat   = torch.empty(2, 100, 100).random_(2).float()

distance_mat    = torch.randn(2, 100, 100)

out = model(

    atoms,

    mask = mask,

    adjacency_mat = adjacency_mat,

    distance_mat = distance_mat

) # (2, 1)

```

## Citations

```bibtex

@misc{maziarka2020molecule,

    title={Molecule Attention Transformer}, 

    author={Łukasz Maziarka and Tomasz Danel and Sławomir Mucha and Krzysztof Rataj and Jacek Tabor and Stanisław Jastrzębski},

    year={2020},

    eprint={2002.08264},

    archivePrefix={arXiv},

    primaryClass={cs.LG}

}

```