Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nazya/KATE

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/nazya/KATE
Owner: nazya
Created: 2024-01-30T10:06:14.000Z (8 months ago)
Default Branch: main
Last Pushed: 2024-05-22T08:03:15.000Z (4 months ago)
Last Synced: 2024-07-23T05:17:38.770Z (2 months ago)
Language: Jupyter Notebook
Size: 1.19 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # [Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad](https://arxiv.org/abs/2403.02648)

This repository documents the code to reproduce the experiments reported in the paper:

> [Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad](https://arxiv.org/pdf/2403.02648.pdf)

In this work, we introduce a novel optimization algorithm called KATE, a scale invariant adaptation of AdaGrad. Here we provide a screenshot of KATE's pseudocode from the paper.

![KATE pseudocode](image/KATE_pseudocode.png)

In this repository we compare the performance of KATE with well-known algorithms like AdaGrad anbd ADAM on logistic regression, image classification and text classification problems. If you use this code for your research, please cite the paper as follow

```

@article{choudhury2024remove,

  title={Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad},

  author={Choudhury, Sayantan and Tupitsa, Nazarii and Loizou, Nicolas and Horvath, Samuel and Takac, Martin and Gorbunov, Eduard},

  journal={arXiv preprint arXiv:2403.02648},

  year={2024}

}

```

## Table of Contents

   * [Requirements](#requirements)

   * [Logistic Regression](#logistic-regression)

      * [Scale Invariance](#scale-invariance)

      * [Robustness of KATE](#robustness-of-KATE)

      * [Performance of KATE on Real Data](#performance-of-KATE-on-real-data)

   * [Training of Neural Network](#training-of-neural-network)

     * [Image Classification](#image-classification)

     * [Text Classification](#text-classification)

## Requirements

The anaconda environment can be easily created by the following command:

```setup

conda env create -f environment.yml

```

## Logistic Regression

### Scale Invariance 

In Figure 1 of our paper, we compare the performance of KATE on scaled and un-scaled data and empirically show the scale-invariance property. Please run the code in [KATEscaleinvariance.py](logistic_regression/KATEscaleinvariance.py) to reproduce the plots of Figure 1.

![Scale Invariance](image/scale_invariance.png)

### Robustness of KATE

In Figure 2 of our paper, we compare the performance of KATE with AdGrad, AdaGradNorm, SGD-Decay and SGD-constant to examine the robustness of KATE. Please run the code in [RobustKATE.py](logistic_regression/RobustKATE.py) to reproduce the plots of Figure 2.

![Robustness of KATE](image/robust_KATE.png)

### Performance of KATE on Real Data

In Figure 3 of our paper, we compare the performance of KATE with AdGrad, AdaGradNorm, SGD-Decay and SGD-constant on real data. Please run the code in [KATEheart.py](logistic_regression/KATEheart.py), [KATEaustralian.py](logistic_regression/KATEaustralian.py) and [KATEsplice.py](logistic_regression/KATEsplice.py) to reproduce the performance of KATE on heart, australian and splice dataset, respectively.

![KATE on real data](image/KATE_realdata.png)

## Training of Neural Network

In Figure 4 of our paper, we compare the performance of KATE with AdGrad and ADAM on two tasks. 

- **Image Classification**: For training ResNet18 in CIFAR10 dataset.

- **Text Classification**: BERT fine-tuning on the emotions dataset from the Hugging Face Hub.

Please run the code in [train.ipynb](train.ipynb) to reproduce the plots for these two tasks.

![KATE on Neural Net](image/neural_net.png)