Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/sahagobinda/GPM

Official [ICLR] Code Repository for "Gradient Projection Memory for Continual Learning"
https://github.com/sahagobinda/GPM

computer-vision continual-learning deep-learning optimizaiton pytorch

Last synced: 3 months ago
JSON representation

Official [ICLR] Code Repository for "Gradient Projection Memory for Continual Learning"

Host: GitHub
URL: https://github.com/sahagobinda/GPM
Owner: sahagobinda
License: mit
Created: 2021-03-03T19:31:07.000Z (almost 4 years ago)
Default Branch: main
Last Pushed: 2021-06-25T04:33:01.000Z (over 3 years ago)
Last Synced: 2024-08-04T03:11:08.471Z (7 months ago)
Topics: computer-vision, continual-learning, deep-learning, optimizaiton, pytorch
Language: Python
Homepage:
Size: 39.1 KB
Stars: 79
Watchers: 3
Forks: 16
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# GPM
Official Pytorch implementation for "**Gradient Projection Memory for Continual Learning**", **ICLR 2021 (Oral)**.

[[Paper]](https://openreview.net/forum?id=3AOj0RCNC2) [[ICLR Presentation Video]](https://slideslive.com/38953615/gradient-projection-memory-for-continual-learning?ref=account-84503-popular)

## Abstract
The ability to learn continually without forgetting the past tasks is a desired attribute for artificial learning systems. Existing approaches to enable such learning in artificial neural networks usually rely on network growth, importance based weight update or replay of old data from the memory. In contrast, we propose a novel approach where a neural network learns new tasks by taking gradient steps in the orthogonal direction to the gradient subspaces deemed important for the past tasks. We find the bases of these subspaces by analyzing network representations (activations) after learning each task with Singular Value Decomposition (SVD) in a single shot manner and store them in the memory as Gradient Projection Memory (GPM). With qualitative and quantitative analyses, we show that such orthogonal gradient descent induces minimum to no interference with the past tasks, thereby mitigates forgetting. We evaluate our algorithm on diverse image classification datasets with short and long sequences of tasks and report better or on-par performance compared to the state-of-the-art approaches.

## Authors
Gobinda Saha, Isha Garg, Kaushik Roy

## Experiments
This repository currently contains experiments reported in the paper for Permuted MNIST, 10-split CIFAR-100, 20-tasks CIFAR-100 Superclass datasets and 5-datasets. All these experiments can be run using the following command:
```python
source run_experiments.sh
```

## Citation
```
@inproceedings{
saha2021gradient,
title={Gradient Projection Memory for Continual Learning},
author={Gobinda Saha and Isha Garg and Kaushik Roy},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=3AOj0RCNC2}
}
```