https://github.com/codingfisch/affine_image.np

Pure NumPy implementation of affine image transformations
https://github.com/codingfisch/affine_image.np
Last synced: 7 months ago
JSON representation
Pure NumPy implementation of affine image transformations
Host: GitHub
URL: https://github.com/codingfisch/affine_image.np
Owner: codingfisch
License: mit
Created: 2024-07-28T11:07:50.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-08-02T14:31:10.000Z (over 1 year ago)
Last Synced: 2025-03-06T12:51:17.159Z (8 months ago)
Language: Python
Size: 1.19 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

README

          # affine_image.np

Affine transformations on (currently only 3D, maybe later 2D) arrays via NumPy, intended to be

- an **alternative** to [`F.affine_grid`](https://pytorch.org/docs/stable/generated/torch.nn.functional.affine_grid.html), [`F.grid_sample`](https://pytorch.org/docs/stable/generated/torch.nn.functional.grid_sample.html) or [`scipy.ndimage.affine_transform`](https://docs.scipy.org/doc/scipy/reference/generated/scipy.ndimage.affine_transform.html) (with [caveats](https://github.com/codingfisch/affine_image.np?tab=readme-ov-file#compatibility-))

- used as **pseudocode** since it is **<100 lines of code** -> easy to port into other array frameworks

🛠️ Install via: `pip install affine-image`

## Usage 💡

### How-to (Basic)

Given you can read affine matrices, the following conventions (taken from PyTorch) + an example should get you started

- 1️⃣ **batch** and **channel** dim. prior to image dim. -> (batch, channel, x, y, z) for 3D images

- 2️⃣ **affine** acts in **inverse order** on image dim. (first row acts on z, second row acts on y, third row acts on x)

- 3️⃣ **translation** parameter (=value in last column of affine) **of 1 moves the image by half its size** in the respective dim.

```python

import numpy as np

from affine_image import affine_transform_3d

b, c, x, y, z = (1, 3, 5, 5, 5)

im = np.random.rand(b, c, x, y, z)  # 1️⃣ shape: (batch, channels, x, y, z)

affine = np.array([[[1.5, 0, 0, 0],  # 2️⃣ acts on z-dim.: zoom of 150%

                    [0, 1, 0, 1.0],  # 2️⃣ acts on y-dim.: 3️⃣ translation by 2.5 pixels (=y/2)

                    [0, 0, 1, 0]]])  # 2️⃣ acts on x-dim.: zoom of 100% (i.e. no change)

shape = (x, y, z)

# Apply affine ✨ Since we are in the README, show all possible arguments (with default values)

im_out = affine_transform_3d(im, affine, shape, nearest=False, padding='zeros', align_corners=False, scipy_affine=False)

```

`affine_transform_3d` is the main function of this package and takes the arguments

- `im`: Input image array with 5 dimensions (batch, channel, x, y, z)

- `affine`: Affine transformation matrix with 3 dimensions (batch, 3, 4)

- `shape`: Desired output shape 

- `nearest`: Use nearest-neighbor interpolation if `True`, otherwise use linear (=trilinear for 3D) interpolation

- `padding`: Padding mode, either `'zeros'`, `'border'`, `'reflection'` or int/float (=padding value).

  (`'border'` and `'reflection'` are analogous to `'nearest'` and `'mirror'` in scipy)

- `align_corners`: Align corners flag (see [PyTorch's docs](https://pytorch.org/docs/stable/generated/torch.nn.functional.grid_sample.html))

- `scipy_affine`: Use SciPy affine convention if `True`

If `scipy_affine` is set to `True`, the conventions 2️⃣ and 3️⃣ are replaced with

- 2️⃣* **affine** acts in **normal order** on image dim. (first row acts on x, second row acts on y, third row acts on y)

- 3️⃣* **translation** parameter (=value in last column of affine) **of 1 moves the image by one pixel** in the respective dim.

### Why? (Interlude for the Curious 🤓)

This subsection serves readers who are not familiar with PyTorch who probably ask: 

Why did `affine_image` (per default) follow the weird PyTorch conventions?

Let's start with a rewrite of the above example in `torch` (=PyTorch)

```python

import torch

import torch.nn.functional as F

b, c, x, y, z = (1, 3, 5, 5, 5)

im = torch.rand(b, c, x, y, z) 

affine = torch.tensor([[[1.5, 0, 0, 0],

                        [0, 1, 0, 1.0],

                        [0, 0, 1, 0]]])

shape = (x, y, z)

# Apply affine in torch

grid = F.affine_grid(affine, size=[1, 3, *shape], align_corners=True)

im_out = F.grid_sample(im, grid, mode='bilinear', padding_mode='zeros', align_corners=True)

```

Note that `torch` requires two steps to apply an affine to an image

1. Pass `affine` to `F.affine_grid` which returns a `grid`

2. Apply the `grid` to the image using `F.grid_sample`

Let's look at the shape of the grid to understand it

```python

print(grid.shape)  # Output: [1, 5, 5, 5, 3] = [1, *shape, 3]

```

The `grid` contains coordinates w.r.t the input image from which the output image is sampled, e.g.

```python

print(grid[0, 0, 0, 0, :])  # Output: [-1.5000,  0.0000, -1.0000] (align_corners=True in code above made coordinates more understandable here)

```

are the z, y and x coordinate in the input image from which the first (a corner) pixel of the output image are sampled.

Ok, everything is set up to **finally tackle** the **Why...?s**:

- **Why two steps?**: When applying an affine to multiple arrays/tensors, `grid` can be reused to avoid recalculation

- **Why 1️⃣?**: Stacking images along the **batch** dim. enables **parallel** application of multiple affines

- **Why 2️⃣?**: No idea 🤔 Probably something about speed in the underlying C++/CUDA code of `torch`

  - **...but why does `affine-image` follow it anyway?**: To avoid the introduction of another set of conventions

- **Why 3️⃣?**: Since `grid` coordinates of -1/1 indicate edges of the input images with 0 indicating the center...

  - **...but why?**: Makes `grid` coordinates more general since they are independent of image shapes

### How-to (Advanced)?

Similar to PyTorch, `affine_transform_3d` behind the scenes uses a `grid` to resample the image. 

Let's rewrite the first example to explicitly work with a `grid` via `affine_image`

```python

import numpy as np

from affine_image import affine_grid_3d, sample_linear_3d, sample_nearest_3d

b, c, x, y, z = (1, 3, 5, 5, 5)

im = np.random.rand(b, c, x, y, z)

affine = np.array([[[1.5, 0, 0, 0],

                    [0, 1, 0, 1.0],

                    [0, 0, 1, 0]]])

shape = (x, y, z)

grid = affine_grid_3d(affine, shape, align_corners=False)

im_out = sample_linear_3d(im, grid, padding='zeros', align_corners=False)

```

To run nearest-neighbor interpolation, replace `sample_linear_3d` with `sample_nearest_3d`

```python

im_out = sample_nearest_3d(im, grid, padding='zeros', align_corners=False)

```

If you have read the full **Usage 💡** section, here, take a cookie 🍪

## Speed 💨

Compared to `torch` and `scipy`, `affine-image` runs at ~25% the speed for trilinear interpolation and ~50% speed for nearest interpolation 

🤓 Pretty OK for being a naive NumPy implementation!

*Runtimes on AMD Ryzen 9 5950X CPU with 16 cores*

### Default runtime (in seconds)

| Image size (Interpolation) | torch     | scipy | affine-image |

|-------------|-----------|-------|--------------|

| 64³ (nearest) | **0.004** | 0.005 | 0.009 |

| 64³ (trilinear) | **0.006** | 0.008 | 0.029 |

| 128³ (nearest) | **0.029** | 0.043 | 0.096 |

| 128³ (trilinear) | **0.048** | 0.064 | 0.262 |

| 256³ (nearest) | **0.306** | 0.355 | 0.749 |

| 256³ (trilinear) | **0.434** | 0.529 | 2.237 |

### Single-thread runtime (in seconds)

| Image size (Interpolation) | torch     | scipy     | affine-image |

|-------------|-----------|-----------|--------------|

| 64³ (nearest) | **0.005** | **0.005** | 0.009 |

| 64³ (trilinear) | **0.007** | 0.009     | 0.031 |

| 128³ (nearest) | **0.042** | 0.043     | 0.093 |

| 128³ (trilinear) | **0.062** | 0.064     | 0.251 |

| 256³ (nearest) | 0.413     | **0.353** | 0.757 |

| 256³ (trilinear) | 0.569     | **0.536** | 2.262 |

## Compatibility 📏

`affine_grid_3d` is compatible with `F.affine_grid` (meaning their respective outputs match) 🎉

Besides that, the outputs of `affine-image` currently slightly differ from the outputs of `torch` and `scipy`.

For `nearest=True` (especially with `align_corners=True`) `affine-image` almost matches `torch`:

![issue1](https://github.com/user-attachments/assets/26653dcc-149a-4830-87c0-475367deb5a2)

The test script offers plots (like the one above) and colorful terminal output to chase the remaining mismatches.

Contributions (see [Issues](https://github.com/codingfisch/affine_image.np/issues)) are much appreciated 🤗
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/codingfisch/affine_image.np

Awesome Lists containing this project

README