https://github.com/janosh/torch-mnf

Multiplicative Normalizing Flows in PyTorch.
https://github.com/janosh/torch-mnf

generative-modeling normalizing-flows pytorch

Last synced: 8 months ago
JSON representation

Multiplicative Normalizing Flows in PyTorch.

Host: GitHub
URL: https://github.com/janosh/torch-mnf
Owner: janosh
License: mit
Created: 2020-02-10T15:10:10.000Z (about 6 years ago)
Default Branch: main
Last Pushed: 2025-07-07T16:31:17.000Z (8 months ago)
Last Synced: 2025-07-07T17:51:17.383Z (8 months ago)
Topics: generative-modeling, normalizing-flows, pytorch
Language: Python
Homepage:
Size: 1.38 MB
Stars: 24
Watchers: 6
Forks: 2
Open Issues: 1
Metadata Files:
- Readme: readme.md
- License: license

Awesome Lists containing this project

README

          # Torch MNF

[![Tests](https://github.com/janosh/torch-mnf/actions/workflows/test.yml/badge.svg)](https://github.com/janosh/torch-mnf/actions/workflows/test.yml)

[![pre-commit.ci status](https://results.pre-commit.ci/badge/github/janosh/torch-mnf/master.svg)](https://results.pre-commit.ci/latest/github/janosh/torch-mnf/master)

![GitHub Repo Size](https://img.shields.io/github/repo-size/janosh/torch-mnf?label=Repo+Size)

PyTorch implementation of Multiplicative Normalizing Flows [[1]](#mnf-bnn).

With flow implementations courtesy of [Andrej Karpathy](https://github.com/karpathy/pytorch-normalizing-flows).

## Files of Interest

New here? Check out the example notebooks:

- [`examples/half_moons.ipynb`](examples/half_moons.ipynb)

- [`examples/mnf_mnist.ipynb`](examples/mnf_mnist.ipynb)

Interested in the implementation? See

- [`models/mnf_lenet.py`](torch_mnf/models/mnf_lenet.py)

- [`flows/*.py`](torch_mnf/flows)

- [`layers/*.py`](torch_mnf/layers)

## MNF Results

### MNIST

Rotating an MNIST 9 by 180° in steps of 20°, the MNF LeNet (left) does not produce overconfident predictions on out-of-sample data unlike the regular LeNet (right), indicating it captures its own uncertainty well. The violin distributions in the top plot were generated by the MNF LeNet predicting each image 500 times. The predictions run in parallel so this is fast. Both models trained for 3 epochs on MNIST with Adam. The MNF model has 696,950 trainable parameters, the regular LeNet 258,582.

| MNF Lenet                                                  | Regular LeNet                                                 |

| ---------------------------------------------------------- | ------------------------------------------------------------- |

| ![RNVP Point Flow](assets/mnf/mnist/rot-9-mnf-lenet-s.png) | ![RNVP x to 2 and z to x](assets/mnf/mnist/rot-9-lenet-s.png) |

## Flow Results

### Real Non-Volume Preserving Flows

Flow: `[RNVP, RNVP, RNVP, RNVP, RNVP, RNVP, RNVP, RNVP, RNVP]`

Final loss: 0.47

| Trained for 1400 steps with Adam (`lr=1e-4, wd=1e-5`) | Parameters: 22,914                                       |

| ----------------------------------------------------- | -------------------------------------------------------- |

| ![RNVP Point Flow](assets/rnvp/moons/point-flow.png)  | ![RNVP x to 2 and z to x](assets/rnvp/moons/z2x+x2z.png) |

### Masked Autoregressive Flow

Flow: `[MAF, MAF, MAF, MAF, MAF, MAF, MAF, MAF, MAF]`

Final loss: 36.21

| Trained for 1400 steps with Adam (`lr=1e-4, wd=1e-5`) | Parameters: 12,348                                     |

| ----------------------------------------------------- | ------------------------------------------------------ |

| ![MAF Point Flow](assets/maf/moons/point-flow.png)    | ![MAF x to 2 and z to x](assets/maf/moons/z2x+x2z.png) |

### Neural Spline Flow Autoregressive Layer

Flow: `[ActNormFlow, Glow, NSF_AR, ActNormFlow, Glow, NSF_AR, ActNormFlow, Glow, NSF_AR]`

Final loss: 19.13

| Trained for 1400 steps with Adam (`lr=1e-4, wd=1e-5`)    | Parameters: 3,012                                            |

| -------------------------------------------------------- | ------------------------------------------------------------ |

| ![NSF-AR Point Flow](assets/nsf_ar/moons/point-flow.png) | ![NSF-AR x to 2 and z to x](assets/nsf_ar/moons/z2x+x2z.png) |

### Neural Spline Flow Coupling Layer

Flow: `[ActNormFlow, Glow, NSF_CL, ActNormFlow, Glow, NSF_CL, ActNormFlow, Glow, NSF_CL]`

Final loss: 6.06

| Trained for 1400 steps with Adam (`lr=1e-4, wd=1e-5`)    | Parameters: 5,844                                            |

| -------------------------------------------------------- | ------------------------------------------------------------ |

| ![NSF-CL Point Flow](assets/nsf_cl/moons/point-flow.png) | ![NSF-CL x to 2 and z to x](assets/nsf_cl/moons/z2x+x2z.png) |

## References

1.  **MNF**: _Multiplicative Normalizing Flows for Variational Bayesian Neural Networks_ | Christos Louizos, Max Welling (Mar 2017) | [1703.01961](https://arxiv.org/abs/1703.01961)

2.  **VI-NF**: _Variational Inference with Normalizing Flows_ | Danilo Rezende, Shakir Mohamed (May 2015) | [1505.05770](https://arxiv.org/abs/1505.05770)

3.  **MADE**: _Masked Autoencoder for Distribution Estimation_ | Mathieu Germain, Karol Gregor, Iain Murray, Hugo Larochelle (Jun 2015) | [1502.03509](https://arxiv.org/abs/1502.03509)

4.  **NICE**: _Non-linear Independent Components Estimation_ | Laurent Dinh, David Krueger, Yoshua Bengio (Oct 2014) | [1410.8516](https://arxiv.org/abs/1410.8516)

5.  **RNVP**: _Density estimation using Real NVP_ | Laurent Dinh, Jascha Sohl-Dickstein, Samy Bengio (May 2016) | [1605.08803](https://arxiv.org/abs/1605.08803)

6.  **MAF**: _Masked Autoregressive Flow for Density Estimation_ | George Papamakarios, Theo Pavlakou, Iain Murray (Jun 2018) | [1705.07057](https://arxiv.org/abs/1705.07057)

7.  **IAF**: _Improving Variational Inference with Inverse Autoregressive Flow_ | Diederik Kingma et al. (Jun 2016) | [1606.04934](https://arxiv.org/abs/1606.04934)

8.  **NSF**: _Neural Spline Flows_ | Conor Durkan, Artur Bekasov, Iain Murray, George Papamakarios (Jun 2019) | [1906.04032](https://arxiv.org/abs/1906.04032)

## Debugging Tips

A great method of checking for infinite or `NaN` gradients is

```py

for name, param in model.named_parameters():

    print(name, torch.isfinite(param.grad).all())

    print(name, torch.isnan(param.grad).any())

```

There's also [`torch.autograd.detect_anomaly()`](https://pytorch.org/docs/stable/autograd.html#torch.autograd.detect_anomaly) used as context manager:

```py

with torch.autograd.detect_anomaly():

    x = torch.rand(10, 10, requires_grad=True)

    out = model(x)

    out.backward()

```

and [`torch.autograd.set_detect_anomaly(True)`](https://pytorch.org/docs/stable/autograd.html#torch.autograd.set_detect_anomaly). See [here](https://discuss.pytorch.org/t/87594) for an issue that used these tools.

## Requirements

[`requirements.txt`](requirements.txt) created with [`pipreqs .`](https://github.com/bndr/pipreqs). Find new dependencies manually with `pipreqs --diff requirements.txt`.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/janosh/torch-mnf

Awesome Lists containing this project

README