https://github.com/jaxony/unet-pytorch

U-Net implementation for PyTorch based on https://arxiv.org/abs/1505.04597
https://github.com/jaxony/unet-pytorch

deep-learning neural-network python pytorch semantic-segmentation

Last synced: 2 months ago
JSON representation

U-Net implementation for PyTorch based on https://arxiv.org/abs/1505.04597

Host: GitHub
URL: https://github.com/jaxony/unet-pytorch
Owner: jaxony
License: mit
Created: 2017-08-13T01:35:37.000Z (almost 8 years ago)
Default Branch: master
Last Pushed: 2017-12-04T03:23:16.000Z (over 7 years ago)
Last Synced: 2025-03-31T11:21:58.660Z (3 months ago)
Topics: deep-learning, neural-network, python, pytorch, semantic-segmentation
Language: Python
Homepage:
Size: 41 KB
Stars: 275
Watchers: 9
Forks: 63
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# U-Net implementation in PyTorch

The U-Net is an encoder-decoder neural network used for **semantic segmentation**. The implementation in this repository is a modified version of the U-Net proposed in [this paper](https://arxiv.org/abs/1505.04597).

![U-Net Architecture](unet-architecture.png)

## Features

1. **You can alter the U-Net's depth.**
The original U-Net uses a depth of 5, as depicted in the diagram above. The word "depth" specifically
refers to the number of *different* spatially-sized convolutional outputs. With this U-Net implementation, you can easily vary the depth.

2. **You can merge decoder and encoder pathways in two ways.**
In the original U-Net, the decoder and encoder activations are merged by concatenating channels.
I've implemented a ResNet-style merging of the decoder and encoder activations by adding
these activations. This was easy to code up, but it may not make sense theoretically and has not been tested.

## Pixel-wise loss for semantic segmentation
I had some trouble getting the pixel-wise loss working correctly for a semantic segmentation task.
Here's how I got it working in the end.

```python
from model import UNet

model = UNet()

# set up dataloaders, etc.

output = model(some_input_data)

# permute is like np.transpose: (N, C, H, W) => (H, W, N, C)
# contiguous is required because of this issue: https://github.com/pytorch/pytorch/issues/764
# view: reshapes the output tensor so that we have (H * W * N, num_class)
# NOTE: num_class == C (number of output channels)
output = output.permute(2, 3, 0, 1).contiguous().view(-1, num_classes)
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jaxony/unet-pytorch

Awesome Lists containing this project

README