https://github.com/mfinzi/equivariant-MLP

A library for programmatically generating equivariant layers through constraint solving
https://github.com/mfinzi/equivariant-MLP

deep-learning equivariance

Last synced: 3 months ago
JSON representation

A library for programmatically generating equivariant layers through constraint solving

Host: GitHub
URL: https://github.com/mfinzi/equivariant-MLP
Owner: mfinzi
License: mit
Created: 2020-09-11T17:40:49.000Z (almost 5 years ago)
Default Branch: master
Last Pushed: 2023-05-08T00:30:03.000Z (about 2 years ago)
Last Synced: 2024-10-15T03:55:14.325Z (8 months ago)
Topics: deep-learning, equivariance
Language: Jupyter Notebook
Homepage:
Size: 19.8 MB
Stars: 254
Watchers: 9
Forks: 21
Open Issues: 5
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-jax - Equivariant MLP - Construct equivariant neural network layers. <img src="https://img.shields.io/github/stars/mfinzi/equivariant-MLP?style=social" align="center"> (Libraries / New Libraries)

README

        






# A Practical Method for Constructing Equivariant Multilayer Perceptrons for Arbitrary Matrix Groups

[![Documentation](https://readthedocs.org/projects/emlp/badge/)](https://emlp.readthedocs.io/en/latest/) | [![Paper](https://img.shields.io/badge/arXiv-2104.09459-red)](https://arxiv.org/abs/2104.09459) | [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/mfinzi/equivariant-MLP/blob/master/docs/notebooks/colabs/all.ipynb) | 

[![codecov.io](https://codecov.io/github/mfinzi/equivariant-MLP/coverage.svg)](https://codecov.io/github/mfinzi/equivariant-MLP)

| [![PyPI version](https://img.shields.io/pypi/v/emlp)](https://pypi.org/project/emlp/) 

*EMLP* is a jax library for the automated construction of equivariant layers in deep learning based on the ICML2021 paper [A Practical Method for Constructing Equivariant Multilayer Perceptrons for Arbitrary Matrix Groups](https://arxiv.org/abs/2104.09459). You can read the documentation [here](https://emlp.readthedocs.io/en/latest/).

## What EMLP is great at doing

- Computing equivariant linear layers between finite dimensional

representations. You specify the symmetry group (discrete, continuous,

non compact, complex) and the representations (tensors, irreducibles, induced representations, etc), and we will compute the basis of equivariant

maps mapping from one to the other.

- Automatic construction of full equivariant models for small data. E.g.

if your inputs and outputs (and intended features) are a small collection of elements like scalars, vectors, tensors, irreps with a total dimension less than 1000, then you will likely be able to use EMLP as a turnkey solution for making the model or atleast function as a strong baseline.

- As a tool for building larger models, but where EMLP is just one component in a larger system. For example, using EMLP as the convolution kernel in an equivariant PointConv network.

## What EMLP is not great at doing

- An efficient implementation of CNNs, Deep Sets, typical translation + rotation equivariant GCNNs, graph neural networks.

- Handling large data like images, voxel grids, medium-large graphs, point clouds.

Given the current approach, EMLP can only ever be as fast as an MLP. So if flattening the inputs into a single vector would be too large to train with an MLP, then it will also be too large to train with EMLP.

--------------------------------------------------------------------------------

# Showcasing some examples of computing equivariant bases

We provide a type system for representations. With the operators ρᵤ⊗ρᵥ, ρᵤ⊕ρᵥ, ρ* implemented as `*`,`+` and `.T` build up different representations. The basic building blocks for representations are the base vector representation `V` and tensor representations `T(p,q) = V**p*V.T**q`. 

For any given matrix group and representation formed in our type system, you can get the equivariant basis with [`rep.equivariant_basis()`](https://emlp.readthedocs.io/en/latest/package/emlp.reps.html#emlp.reps.equivariant_basis) or a matrix which projects to that subspace with [`rep.equivariant_projector()`](https://emlp.readthedocs.io/en/latest/package/emlp.reps.html#emlp.reps.equivariant_projector). 

For example to find all O(1,3) (Lorentz) equivariant linear maps from from a 4-Vector Xᶜ to a rank (2,1) tensor Mᵇᵈₐ, you can run

```python

from emlp.reps import V,T

from emlp.groups import *

G = O13()

Q = (T(1,0)>>T(2,1))(G).equivariant_basis()

```

or how about equivariant maps from one Rubik's cube to another?

```python

G = RubiksCube()

Q = (V(G)>>V(G)).equivariant_basis()

```

Using `+` and `*` you can put together composite representations (where multiple representations are concatenated together). For example lets find all equivariant linear maps from 5 node features and 2 edge features to 3 global invariants and 1 edge feature of a graph of size n=5:

```python

G=S(5)

repin = 10*T(1)+5*T(2)

repout = 3*T(0)+T(2)

Q = (repin(G)>>repout(G)).equivariant_basis()

```

From the examples above, there are many different ways of writing a representation like `10*T(1)+5*T(2)` which are all equivalent.

`10*T(1)+5*T(2)` = `10*V+5*V**2` = `5*V*(2+V)` 

You can even mix and match representations from different groups. For example with the cyclic group ℤ₃, the permutation group 𝕊₄, and the orthogonal group O(3)

```python

rep = 2*V(Z(3))*V(S(4))+V(O(3))**2

Q = (rep>>rep).equivariant_basis()

```

Outside of these tensor representations, our type system works with any finite dimensional linear representation and you can even build your own bespoke representations following the instructions [here](https://emlp.readthedocs.io/en/latest/notebooks/4new_representations.html).

You can visualize these equivariant bases with [`vis(repin,repout)`](https://emlp.readthedocs.io/en/latest/package/emlp.reps.html#emlp.reps.vis), such as with the three examples above

   

Checkout our [documentation](https://emlp.readthedocs.io/en/latest/) to see how to use our system and some worked examples.

# Simple example of using EMLP as a full equivariant model

Suppose we want to construct a Lorentz equivariant model for particle physics data that takes in the input and output 4-momentum of two particles

in a collision, as well as a some metadata about these particles like their charge, and we want to classify the output

as belonging to 3 distinct classes of collisions. Since the outputs are simple logits, they should be unchanged by

Lorentz transformation, and similarly with the charges.

```python

import emlp

from emlp.reps import T

from emlp.groups import Lorentz

import numpy as np

repin = 4*T(1)+2*T(0) # 4 four vectors and 2 scalars for the charges

repout = 3*T(0) # 3 output logits for the 3 classes of collisions

group = Lorentz()

model = emlp.nn.EMLP(repin,repout,group=group,num_layers=3,ch=384)

x = np.random.randn(32,repin(group).size()) # Create a minibatch of data

y = model(x) # Outputs the 3 class logits

```

Here we have used the default Objax EMLP, but you can also use our [PyTorch](https://emlp.readthedocs.io/en/latest/notebooks/pytorch_support.html), [Haiku](https://emlp.readthedocs.io/en/latest/notebooks/haiku_support.html), or [Flax](https://emlp.readthedocs.io/en/latest/notebooks/flax_support.html) versions of the models. To see more examples, or how to use your own representations or symmetry groups, check out the documentation.

# Installation instructions

To install as a package, run 

```bash

pip install emlp

```

To run the scripts you will instead need to clone the repo and install it locally which you can do with

```bash

git clone https://github.com/mfinzi/equivariant-MLP.git

cd equivariant-MLP

pip install -e .[EXPTS]

```

# Experimental Results from Paper

Assuming you have installed the repo locally, you can run the experiments we described in the paper. 

To train the regression models on one of the `Inertia`, `O5Synthetic`, or `ParticleInteraction` datasets found in [`emlp.datasets.py`](https://github.com/mfinzi/equivariant-MLP/blob/master/emlp/datasets.py) you can run the script [`experiments/train_regression.py`](https://github.com/mfinzi/equivariant-MLP/blob/master/experiments/train_regression.py) with command line arguments specifying the dataset, network, and symmetry group. For example to train [`EMLP`](https://emlp.readthedocs.io/en/latest/package/emlp.nn.html#emlp.nn.EMLP) with [`SO(3)`](https://emlp.readthedocs.io/en/latest/package/emlp.groups.html#emlp.groups.SO) equivariance on the `Inertia` dataset, you can run

```

python experiments/train_regression.py --dataset Inertia --network EMLP --group "SO(3)"

```

or to train the MLP baseline you can run

```

python experiments/train_regression.py --dataset Inertia --network MLP

```

Other command line arguments such as `--aug=True` for data augmentation or `--ch=512` for number of hidden units and others are available, and you can browse the options and their defaults with `python experiments/train_regression.py -h`. If no group is specified, EMLP will automatically choose the one matched to the dataset, but you can also go crazy with any of the other groups implemented in [`groups.py`](https://github.com/mfinzi/equivariant-MLP/blob/master/emlp/groups.py) provided the dimensions match the data (e.g. for the 3D inertia dataset you could do `--group=` [`"Z(3)"`](https://emlp.readthedocs.io/en/latest/package/emlp.groups.html#emlp.groups.Z) or [`"DkeR3(3)"`](https://emlp.readthedocs.io/en/latest/package/emlp.groups.html#emlp.groups.DkeR3) but not [`"Sp(2)"`](https://emlp.readthedocs.io/en/latest/package/emlp.groups.html#emlp.groups.Sp) or [`"SU(5)"`](https://emlp.readthedocs.io/en/latest/package/emlp.groups.html#emlp.groups.SU)).

For the dynamical systems modeling experiments you can use the scripts

[`experiments/neuralode.py`](https://github.com/mfinzi/equivariant-MLP/blob/master/experiments/neuralode.py) to train (equivariant) Neural ODEs and [`experiments/hnn.py`](https://github.com/mfinzi/equivariant-MLP/blob/master/experiments/hnn.py) to train (equivariant) Hamiltonian Neural Networks.

For the dynamical system task, the Neural ODE and HNN models have special names. [`EMLPode`](https://emlp.readthedocs.io/en/latest/package/emlp.nn.html#emlp.nn.EMLPode) and [`MLPode`](https://emlp.readthedocs.io/en/latest/package/emlp.nn.html#emlp.nn.MLPode) for the Neural ODEs in `neuralode.py` and [`EMLPH`](https://emlp.readthedocs.io/en/latest/package/emlp.nn.html#emlp.nn.EMLPH) and [`MLPH`](https://emlp.readthedocs.io/en/latest/package/emlp.nn.html#emlp.nn.MLPH) for the HNNs in `hnn.py`. For example,

```

python experiments/neuralode.py --network EMLPode --group="O2eR3()"

```

or 

```

python experiments/hnn.py --network EMLPH --group="DkeR3(6)"

```

These models are trained to fit a double spring dynamical system. 30s rollouts of the dataset, along with rollout error on these trajectories, and conservation of angular momentum are shown below.

   

If you find our work helpful, please cite it with

```bibtex

@article{finzi2021emlp,

  title={A Practical Method for Constructing Equivariant Multilayer Perceptrons for Arbitrary Matrix Groups},

  author={Finzi, Marc and Welling, Max and Wilson, Andrew Gordon},

  journal={Arxiv},

  year={2021}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mfinzi/equivariant-MLP

Awesome Lists containing this project

README