https://github.com/StarostinV/convkan

Convolutional layer for Kolmogorov-Arnold Network (KAN)
https://github.com/StarostinV/convkan

Last synced: 6 months ago
JSON representation

Convolutional layer for Kolmogorov-Arnold Network (KAN)

Host: GitHub
URL: https://github.com/StarostinV/convkan
Owner: StarostinV
License: mit
Created: 2024-05-09T19:19:25.000Z (about 1 year ago)
Default Branch: master
Last Pushed: 2024-06-09T13:41:09.000Z (12 months ago)
Last Synced: 2024-06-10T13:05:25.247Z (12 months ago)
Language: Python
Homepage:
Size: 30.3 KB
Stars: 46
Watchers: 1
Forks: 6
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-kan - convkan - in replacement of Conv2d) ｜ ![Github stars](https://img.shields.io/github/stars/StarostinV/convkan.svg) (Library / ConvKANs)
awesome-kan - convkan - in replacement of Conv2d) ｜ ![Github stars](https://img.shields.io/github/stars/StarostinV/convkan.svg) (Library / ConvKANs)

README

        # Convolutional KAN layer

## Implementation of the Convolutional Kolmogorov-Arnold Network layer in PyTorch.

A drop-in replacement for the torch.nn.Conv2d layer that uses the Kolmogorov-Arnold Network (KAN) as a convolutional kernel.

Currently, supports grouped convolution, padding with different modes, dilation, and stride. 

The KAN implementation is taken from the https://github.com/Blealtan/efficient-kan/ repository.

## Installation

From PyPI:

```bash

pip install convkan

```

From source:

```bash

git clone [email protected]:StarostinV/convkan.git

cd convkan

pip install .

```

## Usage

Training a simple model on MNIST (96% accuracy after the first epoch):

```python

import torch

from torch import nn

from torchvision import datasets, transforms

from tqdm import tqdm

from convkan import ConvKAN, LayerNorm2D

# Define the model

model = nn.Sequential(

    ConvKAN(1, 32, padding=1, kernel_size=3, stride=1),

    LayerNorm2D(32),

    ConvKAN(32, 32, padding=1, kernel_size=3, stride=2),

    LayerNorm2D(32),

    ConvKAN(32, 10, padding=1, kernel_size=3, stride=2),

    nn.AdaptiveAvgPool2d(1),

    nn.Flatten(),

).cuda()

# Define transformations and download the MNIST dataset

transform = transforms.Compose([

    transforms.ToTensor(),

    transforms.Normalize((0.1307,), (0.3081,))

])

train_dataset = datasets.MNIST(root='./data', train=True, download=True, transform=transform)

test_dataset = datasets.MNIST(root='./data', train=False, download=True, transform=transform)

train_loader = torch.utils.data.DataLoader(train_dataset, batch_size=128, shuffle=True)

test_loader = torch.utils.data.DataLoader(test_dataset, batch_size=128, shuffle=False)

# Define the loss function and optimizer

criterion = nn.CrossEntropyLoss()

optimizer = torch.optim.AdamW(model.parameters(), lr=1e-2)

# Train the model

model.train()

pbar = tqdm(train_loader)

for i, (x, y) in enumerate(pbar):

    x, y = x.cuda(), y.cuda()

    optimizer.zero_grad()

    y_hat = model(x)

    loss = criterion(y_hat, y)

    loss.backward()

    optimizer.step()

    pbar.set_description(f'Loss: {loss.item():.2e}')

model.eval()

correct = 0

total = 0

with torch.no_grad():

    pbar = tqdm(test_loader)

    for x, y in pbar:

        x, y = x.cuda(), y.cuda()

        y_hat = model(x)

        _, predicted = torch.max(y_hat, 1)

        total += y.size(0)

        correct += (predicted == y).sum().item()

        pbar.set_description(f'Accuracy: {100 * correct / total:.2f}%')

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/StarostinV/convkan

Awesome Lists containing this project

README