https://github.com/kyegomez/aoa-torch

Implementation of Attention on Attention in Zeta
https://github.com/kyegomez/aoa-torch

ai artificial-intelligence gpt4 machine-learning multi-modal multi-modality research

Last synced: about 1 year ago
JSON representation

Implementation of Attention on Attention in Zeta

Host: GitHub
URL: https://github.com/kyegomez/aoa-torch
Owner: kyegomez
License: mit
Created: 2023-12-04T22:28:30.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-04-19T12:55:33.000Z (about 1 year ago)
Last Synced: 2025-04-19T20:16:58.201Z (about 1 year ago)
Topics: ai, artificial-intelligence, gpt4, machine-learning, multi-modal, multi-modality, research
Language: Python
Homepage: https://discord.gg/GYbXvDGevY
Size: 2.19 MB
Stars: 4
Watchers: 2
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

README

          [![Multi-Modality](agorabanner.png)](https://discord.gg/qUtxnK2NMf)

# Attention on Attention Implementation

This is a practice implementation after randomly finding it on Lucidrain's repo, I'm implementing the model architecture just for practice!

Basically the architecture is:

x => q, k, v -> multihead attn with residual q -> concat -> 2 linear projects

->sigmoid -> mult -> add -> norm -> ffn -> add -> norm with residual of first add and norm



# Install

`pip3 install --upgrade aoa-torch `

## Usage

### `AoA` Module

```python

import torch

from aoa.main import AoA

x = torch.randn(1, 10, 512)

model = AoA(512, 8, 64, 0.1)

out = model(x)

print(out.shape)

```

### `AoATransformer`

```python

import torch 

from aoa.main import AoATransformer

x = torch.randint(0, 100, (1, 10))

model = AoATransformer(512, 1, 100)

out = model(x)

print(out.shape)

```

## Citations

```bibtex

@misc{rahman2020improved,

    title   = {An Improved Attention for Visual Question Answering}, 

    author  = {Tanzila Rahman and Shih-Han Chou and Leonid Sigal and Giuseppe Carenini},

    year    = {2020},

    eprint  = {2011.02164},

    archivePrefix = {arXiv},

    primaryClass = {cs.CV}

}

```

```bibtex

@misc{huang2019attention,

    title   = {Attention on Attention for Image Captioning}, 

    author  = {Lun Huang and Wenmin Wang and Jie Chen and Xiao-Yong Wei},

    year    = {2019},

    eprint  = {1908.06954},

    archivePrefix = {arXiv},

    primaryClass = {cs.CV}

}

```

# License

MIT

# Todo

- [ ] Create sample training code using enwiki8

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/kyegomez/aoa-torch

Awesome Lists containing this project

README