https://github.com/titu1994/keras-adabound

Keras implementation of AdaBound
https://github.com/titu1994/keras-adabound

deep-learning keras optimizer

Last synced: 3 months ago
JSON representation

Keras implementation of AdaBound

Host: GitHub
URL: https://github.com/titu1994/keras-adabound
Owner: titu1994
License: mit
Created: 2019-02-27T05:18:04.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2019-11-04T01:02:07.000Z (over 5 years ago)
Last Synced: 2025-02-27T18:27:00.109Z (4 months ago)
Topics: deep-learning, keras, optimizer
Language: Python
Homepage:
Size: 3.5 MB
Stars: 130
Watchers: 6
Forks: 35
Open Issues: 5
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # AdaBound for Keras

Keras port of [AdaBound Optimizer for PyTorch](https://github.com/Luolc/AdaBound), from the paper [Adaptive Gradient Methods with Dynamic Bound of Learning Rate.](https://openreview.net/forum?id=Bkg3g2R9FX)

## Usage

Add the `adabound.py` script to your project, and import it. Can be a dropin replacement for `Adam` Optimizer. 

Also supports `AMSBound` variant of the above, equivalent to `AMSGrad` from Adam.

```python

from adabound import AdaBound

optm = AdaBound(lr=1e-03,

                final_lr=0.1,

                gamma=1e-03,

                weight_decay=0.,

                amsbound=False)

```

## Results

With a wide ResNet 34 and horizontal flips data augmentation, and 100 epochs of training with batchsize 128, it hits 92.16% (called v1).

Weights are available inside the [Releases tab](https://github.com/titu1994/keras-adabound/releases/tag/0.1)

#### NOTE

 - The smaller ResNet 20 models have been removed as they did not perform as expected and were depending on a flaw during the initial implementation. The ResNet 32 shows the actual performance of this optimizer.

> With a small ResNet 20 and width + height data + horizontal flips data augmentation, and 100 epochs of training with batchsize 1024, it hits 89.5% (called v1).

> On a small ResNet 20 with only width and height data augmentations, with batchsize 1024 trained for 100 epochs, the model gets close to 86% on the test set (called v3 below).

### Train Set Accuracy



### Train Set Loss



### Test Set Accuracy



### Test Set Loss



# Requirements

- Keras 2.2.4+ & Tensorflow 1.12+ (Only supports TF backend for now).

- Numpy

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/titu1994/keras-adabound

Awesome Lists containing this project

README