https://github.com/epfml/sparsifiedsgd

Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599
https://github.com/epfml/sparsifiedsgd

Last synced: 12 months ago
JSON representation

Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599

Host: GitHub
URL: https://github.com/epfml/sparsifiedsgd
Owner: epfml
License: apache-2.0
Created: 2018-05-24T15:25:14.000Z (almost 8 years ago)
Default Branch: master
Last Pushed: 2018-10-25T09:22:17.000Z (over 7 years ago)
Last Synced: 2024-03-15T21:28:04.928Z (about 2 years ago)
Language: Jupyter Notebook
Homepage:
Size: 3.28 MB
Stars: 53
Watchers: 10
Forks: 11
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Sparsified SGD with Memory

Code for the experimental part of the paper [Sparsified SGD with Memory](https://arxiv.org/abs/1809.07599). It contains the code for the following experiments:

- Theoretical convergence with different sparsification operator

- Comparison with QSGD

- Multi-core experiments

Use `notebooks/plots.ipynb` to visualize the results.

Please open an issue if you have questions or problems.

### Environment set up

Install [Anaconda](https://anaconda.org) and create the `sparsifedSGD` environment

```bash

conda env create -f environment.yaml

source activate sparsifedSGD

...

source deactivate # at the end

```

For LaTeX support in plots

```

sudo apt-get install texlive-full msttcorefonts

```

### Reproduce the results

To reproduce the results, you can download the datasets from [LibSVM](https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html)

```bash

mkdir data

cd data/

wget https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary/rcv1_test.binary.bz2

wget https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary/epsilon_normalized.bz2

```

We decompress the libsvm file and use pickle format instead. It takes more space but is faster to load. You can create a file as follow

```python

import pickle

import os

from sklearn.datasets import load_svmlight_file

if not os.path.exists('data'):

    os.makedirs('data')

X, y = load_svmlight_file('data/rcv1_test.binary.bz2')

with open('rcv1.pickle', 'wb') as f:

    pickle.dump((X, y), f)

X, y = load_svmlight_file('data/epsilon_normalized.bz2')

with open('epsilon.pickle', 'wb') as f:

    pickle.dump((X, y), f)

```

You can run the baseline

```bash

python experiments/baselines.py ./data results/baselines

```

Run our experiments, for example

```bash

python experiments/rcv-th.py ./data results/rcv-th

python experiments/rcv-par.sh ./data results/rcv-par

```

And visualize the results with the notebooks.

# Reference

If you use this code, please cite the following [paper](https://arxiv.org/abs/1809.07599)

    @inproceedings{scj2018sparseSGD,

      author = {Sebastian U. Stich and Jean-Baptiste Cordonnier and Martin Jaggi},

      title = "{Sparsified {SGD} with Memory}",

      booktitle = {NIPS 2018 - Advances in Neural Information Processing Systems},

      year = 2018,

      url = {https://arxiv.org/abs/1809.07599}

    }

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/epfml/sparsifiedsgd

Awesome Lists containing this project

README