Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/IbrahimSobh/Segmentation

In this tutorial, you will perform inference across 10 well-known pre-trained semantic segmentors and fine-tune on a custom dataset. Design and train your own segmentor.
https://github.com/IbrahimSobh/Segmentation

computer-vision deep-learning semantic-segmentation

Last synced: 3 months ago
JSON representation

In this tutorial, you will perform inference across 10 well-known pre-trained semantic segmentors and fine-tune on a custom dataset. Design and train your own segmentor.

Host: GitHub
URL: https://github.com/IbrahimSobh/Segmentation
Owner: IbrahimSobh
License: mit
Created: 2022-03-19T17:44:06.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2022-04-24T00:14:21.000Z (over 2 years ago)
Last Synced: 2024-04-17T22:09:04.514Z (7 months ago)
Topics: computer-vision, deep-learning, semantic-segmentation
Language: Jupyter Notebook
Homepage:
Size: 5.87 MB
Stars: 69
Watchers: 2
Forks: 10
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        #  Semantic segmentation

![seg_cov](images/segcover.png)

Semantic image segmentation is the task of classifying each pixel in an image from a predefined set of classes.

In this tutorial, you will:

- Perform inference with 10 well-known semantic segmentors 

- Fine tune semantic segmentors on a custom dataset 

- Design and train your own semantic segmentation model 

This work is based on [MMSegmentation](https://github.com/open-mmlab/mmsegmentation): [OpenMMLab](https://github.com/open-mmlab) segmentation toolbox and benchmark.

## 10 Semantic segmentors

| Segmentor     | Paper         |

| ------------- |:-------------:|

| [FCN](https://arxiv.org/abs/1411.4038)    | Fully Convolutional Networks for Semantic Segmentation (2017)

| [U-net](https://arxiv.org/abs/1505.04597)    | Convolutional networks for biomedical image segmentation (2015)

| [PSPNet](https://arxiv.org/abs/1612.01105)    | Pyramid Scene Parsing Network (2017)

| [DeepLabV3](https://arxiv.org/abs/1706.05587)    | Rethinking atrous convolution for semantic image segmentation (2017)

| [UPerNet](https://arxiv.org/pdf/1807.10221.pdf)    | Unified Perceptual Parsing for Scene Understanding (2018)

| [CCNet](https://arxiv.org/abs/1811.11721)    | Criss-Cross Attention for Semantic Segmentation (2019)

| [FastFCN](https://arxiv.org/abs/1903.11816)    | Rethinking Dilated Convolution in the Backbone for Semantic Segmentation (2019)

| [SETR](https://arxiv.org/abs/2012.15840)    | Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers (2020)

| [Segmenter](https://arxiv.org/abs/2105.05633)    | Transformer for Semantic Segmentation (2021)

| [SegFormer](https://arxiv.org/abs/2105.15203)    | Simple and Efficient Design for Semantic Segmentation with Transformers (2021)

![res_yolof](https://github.com/open-mmlab/mmsegmentation/raw/master/resources/seg_demo.gif)

-----

## Perform inference 



Here is how to load a pretrained model, perfrom inference and vizualize the results.

```

model = init_segmentor(config, checkpoint, device='cuda:0')

result = inference_segmentor(model, img)

model.show_result(img, result, out_file='result.jpg', win_name=m_name)

```

### Results

| Segmentor     | Result         |

| ------------- |:-------------:|

| [FCN](https://arxiv.org/abs/1411.4038)    | ![res_yolof](images/fcn.png)

| [U-net](https://arxiv.org/abs/1505.04597)    | ![res_yolof](images/unet.png)

| [PSPNet](https://arxiv.org/abs/1612.01105)    | ![res_yolof](images/pspnet.png)

| [DeepLabV3](https://arxiv.org/abs/1706.05587)    | ![res_yolof](images/deeplab.png)

| [UPerNet](https://arxiv.org/pdf/1807.10221.pdf)    | ![res_yolof](images/upernet.png)

| [CCNet](https://arxiv.org/abs/1811.11721)    | ![res_yolof](images/ccnet.png)

| [FastFCN](https://arxiv.org/abs/1903.11816)    | ![res_yolof](images/fastfcn.png)

| [SETR](https://arxiv.org/abs/2012.15840)    | ![res_yolof](images/setr.png)

| [Segmenter](https://arxiv.org/abs/2105.05633)    | ![res_yolof](images/segmenter.png)

| [SegFormer](https://arxiv.org/abs/2105.15203)    | ![res_yolof](images/segformer.png)

-----

## Fine tune semantic segmentors on a custom dataset



- Select a semantic segmentation model: **SEgmentation TRansformer** [SETR](https://arxiv.org/abs/2012.15840)

- Add a new dataset class: [Scene Understanding Datasets](http://dags.stanford.edu/projects/scenedataset.html)

- Create a config file.

- Conduct training and evaluation.

![setr_arch](images/setr_arch.png)

-----

## Design and train your own semantic segmentor



- Select a semantic segmentation model: [UPerNet](https://arxiv.org/pdf/1807.10221.pdf) Unified Perceptual Parsing for Scene Understanding

- Replabe the ResNet backbone with a new one: [ConvNeXt](https://arxiv.org/pdf/2201.03545.pdf) A ConvNet for the 2020s

- Config the heads.

- Add a new dataset class: [Scene Understanding Datasets](http://dags.stanford.edu/projects/scenedataset.html)

- Conduct training and evaluation.

![convnext](images/convnext.png)

Regards!