https://github.com/zabir-nabil/keras-attn_aug_cnn

Extension of the `Attention Augmented Convolutional Networks` paper for 1-D convolution operation.
https://github.com/zabir-nabil/keras-attn_aug_cnn

attention attention-augmented-cnn-1d attention-augmented-conv attention-cnn keras-models tensorflow-models

Last synced: 8 months ago
JSON representation

Extension of the `Attention Augmented Convolutional Networks` paper for 1-D convolution operation.

Host: GitHub
URL: https://github.com/zabir-nabil/keras-attn_aug_cnn
Owner: zabir-nabil
License: mit
Created: 2019-09-03T12:09:43.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2019-09-19T08:56:57.000Z (about 6 years ago)
Last Synced: 2025-01-31T03:01:52.112Z (8 months ago)
Topics: attention, attention-augmented-cnn-1d, attention-augmented-conv, attention-cnn, keras-models, tensorflow-models
Language: Jupyter Notebook
Size: 164 KB
Stars: 25
Watchers: 2
Forks: 9
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # keras-attn_aug_cnn

Extension of the `Attention Augmented Convolutional Networks` paper for hacky 1-D convolution operation implementation.

Can be used in tensorflow graph too.

## Properties

```

depth_k | filters, depth_v | filters,  Nh | depth_k, Nh | filters-depth_v

```

## 1-D CNN

```

from aug_attn import *

from tensorflow.keras.layers import Input

from tensorflow.keras.models import Model

ip = Input(shape=(None, 10))

cnn1 = Conv1D(filters = 10, kernel_size=3, strides=1,padding='same')(ip)

x = augmented_conv1d(cnn1, shape = (32, 10), filters=20, kernel_size=5,

                     strides = 1,

                     padding = 'causal', # if causal convolution is needed

                     depth_k=4, depth_v=4,  

                     num_heads=4, relative_encodings=True)

# depth_k | filters, depth_v | filters,  Nh | depth_k, Nh | filters-depth_v

model = Model(ip, x)

model.summary()

x = tf.ones((1, 32, 10))

print(x.shape)

y = model(x)

print(y.shape)

```

```

Model: "model_2"

__________________________________________________________________________________________________

Layer (type)                    Output Shape         Param #     Connected to                     

==================================================================================================

input_3 (InputLayer)            [(None, None, 10)]   0                                            

__________________________________________________________________________________________________

conv1d_8 (Conv1D)               (None, None, 10)     310         input_3[0][0]                    

__________________________________________________________________________________________________

conv1d_10 (Conv1D)              (None, None, 12)     132         conv1d_8[0][0]                   

__________________________________________________________________________________________________

reshape_11 (Reshape)            (None, 32, 1, 12)    0           conv1d_10[0][0]                  

__________________________________________________________________________________________________

attention_augmentation2d_2 (Att (None, None, None, N 64          reshape_11[0][0]                 

__________________________________________________________________________________________________

reshape_12 (Reshape)            (None, 32, 4)        0           attention_augmentation2d_2[0][0] 

__________________________________________________________________________________________________

conv1d_9 (Conv1D)               (None, None, 16)     816         conv1d_8[0][0]                   

__________________________________________________________________________________________________

conv1d_11 (Conv1D)              (None, 32, 4)        20          reshape_12[0][0]                 

__________________________________________________________________________________________________

reshape_10 (Reshape)            (None, 32, 1, 16)    0           conv1d_9[0][0]                   

__________________________________________________________________________________________________

reshape_13 (Reshape)            (None, 32, 1, 4)     0           conv1d_11[0][0]                  

__________________________________________________________________________________________________

concatenate_2 (Concatenate)     (None, 32, 1, 20)    0           reshape_10[0][0]                 

                                                                 reshape_13[0][0]                 

__________________________________________________________________________________________________

reshape_14 (Reshape)            (None, 32, 20)       0           concatenate_2[0][0]              

==================================================================================================

Total params: 1,342

Trainable params: 1,342

Non-trainable params: 0

__________________________________________________________________________________________________

(1, 32, 10)

(1, 32, 20)

```

## 2-D CNN

```

from aug_attn import *

from tensorflow.keras.layers import Input

from tensorflow.keras.models import Model

ip = Input(shape=(32, 32, 10))

cnn1 = Conv2D(filters = 10, kernel_size=3, strides=1,padding='same')(ip)

x = augmented_conv2d(cnn1, filters=20, kernel_size=5, # shape parameter is not needed

                     strides = 1,

                     depth_k=4, depth_v=4,  # padding is by default, same

                     num_heads=4, relative_encodings=True)

# depth_k | filters, depth_v | filters,  Nh | depth_k, Nh | filters-depth_v

model = Model(ip, x)

model.summary()

x = tf.ones((1, 32, 32, 10))

print(x.shape)

y = model(x)

print(y.shape)

```

```

__________________________________________________________________________________________________

Layer (type)                    Output Shape         Param #     Connected to                     

==================================================================================================

input_16 (InputLayer)           (None, 32, 32, 10)   0                                            

__________________________________________________________________________________________________

conv2d_11 (Conv2D)              (None, 32, 32, 10)   910         input_16[0][0]                   

__________________________________________________________________________________________________

conv2d_13 (Conv2D)              (None, 32, 32, 12)   132         conv2d_11[0][0]                  

__________________________________________________________________________________________________

attention_augmentation2d_14 (At (None, 32, 32, 4)    126         conv2d_13[0][0]                  

__________________________________________________________________________________________________

conv2d_12 (Conv2D)              (None, 32, 32, 16)   4016        conv2d_11[0][0]                  

__________________________________________________________________________________________________

conv2d_14 (Conv2D)              (None, 32, 32, 4)    20          attention_augmentation2d_14[0][0]

__________________________________________________________________________________________________

concatenate_14 (Concatenate)    (None, 32, 32, 20)   0           conv2d_12[0][0]                  

                                                                 conv2d_14[0][0]                  

==================================================================================================

Total params: 5,204

Trainable params: 5,204

Non-trainable params: 0

__________________________________________________________________________________________________

(1, 32, 32, 10)

(1, 32, 32, 20)

```

# Implementations

* https://github.com/titu1994/keras-attention-augmented-convs

* https://github.com/gan3sh500/attention-augmented-conv

* https://github.com/leaderj1001/Attention-Augmented-Conv2d

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/zabir-nabil/keras-attn_aug_cnn

Awesome Lists containing this project

README