https://github.com/ml-jku/mc-lstm

Experiments with Mass Conserving LSTMs
https://github.com/ml-jku/mc-lstm

deep-learning lstm machine-learning

Last synced: about 1 month ago
JSON representation

Experiments with Mass Conserving LSTMs

Host: GitHub
URL: https://github.com/ml-jku/mc-lstm
Owner: ml-jku
License: mit
Created: 2021-01-14T12:30:18.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2021-07-20T14:47:30.000Z (almost 4 years ago)
Last Synced: 2025-04-05T13:23:36.333Z (about 2 months ago)
Topics: deep-learning, lstm, machine-learning
Language: Python
Homepage:
Size: 134 KB
Stars: 38
Watchers: 3
Forks: 14
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # MC-LSTM: Mass-Conserving LSTM

Pieter-Jan Hoedt, Frederik Kratzert, 

Daniel Klotz, Christina Halmich, 

Markus Holzleitner, Grey Nearing, 

Sepp Hochreiter, Günter Klambauer

MC-LSTM is an adaptation of LSTM [(Hochreiter & Schmidhuber, 1997)](#lstm) that allows to enforce conservation laws in regression problems. To test the benefits of this inductive bias in practice, we conducted experiments on a) arithmetic tasks (cf. [Madsen et al., 2020](#nau)), b) Traffic forecasting, c) energy prediction for pendulum (cf. [Greydanus et al., 2019](#hamiltonian)), and d) Rainfall-runoff modeling ([Kratzert et al., 2019](#hydrolstm)).

### Sub-repositories

This repository contains code for the LSTM addition, traffic forecasting and pendulum experiments.

The neural arithmetic experiments were conducted on a [fork](https://github.com/hoedt/stable-nalu) of the repository from [Madsen et al. (2020)](#nau).

The experiments in hydrology were conducted using the [neuralhydrology](https://github.com/neuralhydrology/neuralhydrology) framework.

## Paper

[Openreview](https://openreview.net/forum?id=Rld-9OxQ6HU),

[Pre-print](https://arxiv.org/abs/2101.05186),

[ICML](http://proceedings.mlr.press/v139/hoedt21a.html)

### Abstract

The success of Convolutional Neural Networks (CNNs) in computer vision is mainly driven by their strong inductive bias, which is strong enough to allow CNNs to solve vision-related tasks with random weights, meaning without learning. Similarly, Long Short-Term Memory (LSTM) has a strong inductive bias towards storing information over time. However, many real-world systems are governed by conservation laws, which lead to the redistribution of particular quantities — e.g. in physical and economical systems. Our novel Mass-Conserving LSTM (MC-LSTM) adheres to these conservation laws by extending the inductive bias of LSTM to model the redistribution of those stored quantities. MC-LSTMs set a new state-of-the-art for neural arithmetic units at learning arithmetic operations, such as addition tasks, which have a strong conservation law, as the sum is constant overtime. Further, MC-LSTM is applied to traffic forecasting, modeling a pendulum, and a large benchmark dataset in hydrology, where it sets a new state-of-the-art for predicting peak flows. In the hydrology example, we show that MC-LSTM states correlate with real world processes and are therefore interpretable.

### Citation

To cite this work, you can use the following bibtex entry:

 ```bib

@inproceedings{pmlr-v139-hoedt21a,

  title     = {MC-LSTM: Mass-Conserving LSTM},

  author    = {Hoedt, Pieter-Jan and Kratzert, Frederik and Klotz, Daniel and Halmich, Christina and Holzleitner, Markus and Nearing, Grey S and Hochreiter, Sepp and Klambauer, Guenter},

  booktitle = {Proceedings of the 38th International Conference on Machine Learning},

  pages     = {4275--4286},

  year      = {2021},

  editor    = {Meila, Marina and Zhang, Tong},

  volume    = {139},

  series    = {Proceedings of Machine Learning Research},

  month     = {18--24 Jul},

  publisher = {PMLR},

  pdf       = {http://proceedings.mlr.press/v139/hoedt21a/hoedt21a.pdf},

  url       = {http://proceedings.mlr.press/v139/hoedt21a.html},

}

```

## Environment

The code in this repository (excluding the sub-repositories) should run as-is in an environment as specified by `requirements.txt`.

When using `conda`, such an environment can be set up using

```

conda create -n mclstm --file requirements.txt -c pytorch

```

**if** you remove the `autograd` dependency!

`autograd` must be installed with `pip` individually. 

Alternatively, you can use `-c conda-forge`, which does provide `autograd`.

## References

 - Greydanus, S., Dzamba, M., & Yosinski, J. (2019). [Hamiltonian neural networks](https://proceedings.neurips.cc/paper/2019/hash/26cd8ecadce0d4efd6cc8a8725cbd1f8-Abstract.html). Advances in Neural Information Processing Systems, 32, 15379-15389.

 - Hochreiter, S., & Schmidhuber, J. (1997). [Long short-term memory](https://www.mitpressjournals.org/doi/abs/10.1162/neco.1997.9.8.1735). Neural computation, 9(8), 1735-1780. ([pdf](https://www.bioinf.jku.at/publications/older/2604.pdf))

 - Madsen, A., & Johansen, A. R. (2020). [Neural Arithmetic Units](https://openreview.net/forum?id=H1gNOeHKPS). In International Conference on Learning Representations.

 - Kratzert, F., Klotz, D., Shalev, G., Klambauer, G., Hochreiter, S., & Nearing, G. (2019). [Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets](https://hess.copernicus.org/articles/23/5089/2019/hess-23-5089-2019.html). Hydrology and Earth System Sciences, 23(12), 5089-5110.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ml-jku/mc-lstm

Awesome Lists containing this project

README