Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/navalnica/stablelungssegmentation

Diploma Thesis project to segment lungs on 3D CT scans
https://github.com/navalnica/stablelungssegmentation

Last synced: about 2 months ago
JSON representation

Diploma Thesis project to segment lungs on 3D CT scans

Host: GitHub
URL: https://github.com/navalnica/stablelungssegmentation
Owner: navalnica
Created: 2019-12-09T21:51:41.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2021-03-31T19:39:51.000Z (almost 4 years ago)
Last Synced: 2023-06-25T21:07:25.242Z (over 1 year ago)
Language: Python
Homepage:
Size: 1.44 MB
Stars: 1
Watchers: 3
Forks: 0
Open Issues: 8
Metadata Files:
- Readme: readme.md

Awesome Lists containing this project

README

# Stable Lungs Segmentation

This repository contains the pipeline to train and use
deep learning model to segment lungs on 3D CT scans.
The code is written with [PyTorch](https://pytorch.org/) framework.

There are 2 available backbones for segmentation models:
* [Unet](https://arxiv.org/abs/1505.04597)
* [MovileNetV2](https://arxiv.org/abs/1801.04381)
that is used as encoder together with Unet-similar decoder

This project is a part of my Diploma Thesis dedicated to solving
the problem of Stable Lungs Segmentation in cases of:
* pathologies present in thoracic cavity
* lungs damaged due to different diseases
* irregular lungs shape

### Example of segmentation improvements obtained during the work on Diploma Thesis

![comparison](img/comparison.png)

Left - __original__ lungs mask for 512x512 scan.
Right - **new mask** for 128x128 scan
(zoom was used to reduce the model train time).

### Requirements and installation
To use the project you will need the Python3
with additional packages installed.

To create a virtual environment with all the packages use
```
python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt
```
Or simply execute `venv_creation.sh` if you use Linux.

To run the code you will need to activate the virtual environment:

`source venv/bin/activate`

### 1-cycle learning

The project is __going to implement__ the 1-cycle learning policy proposed by
[Leslie Smith](https://arxiv.org/abs/1803.09820)
that significantly decreases the time required to train the model
and acts as a regularization method allowing for training at high learning rates.

This part consists from 2 steps:
* [x] Add LR-finder module to be able to choose optimal learning rates for 1-cycle policy
* [ ] Implement 1-cycle learning rate scheduler (change LR after each batch).

LR-finder is implemented as a separate endpoint (usage is described below).
It performs training for only 1 epoch with learning rate increasing from a very low
value (~10 ** -8) to a very high (~10 ** 2).

Results are stored as `.csv` dataframe
and `.png` plot with loss value dynamics that looks like this:
![lr_finder_plots](img/lr_finder_plots.png)

### Pipeline endpoints

There are couple of command line endpoints implemented with
[Click](https://click.palletsprojects.com/en/7.x/) module
that make it easy to use the pipeline for different tasks.

To call any endpoint use:
```
(venv) $ python main.py
```

All the endpoints are to be called from the project root folder.

1. `train`

Build and train the model. Heavy augs and warm start are supported.

```
Usage: main.py train [OPTIONS]

Options:
--launch [local|server] launch location. used to determine default
paths [default: server]

--architecture [unet|mnet2] model architecture (unet, mnet2) [default:
unet]

--device [cpu|cuda:0|cuda:1] device to use [default: cuda:0]
--dataset [nifti|numpy] dataset type [default: numpy]
--heavy-augs / --no-heavy-augs whether to apply different number of
augmentations for hard and regular train
images (uses docs/hard_cases_mapping.csv to
identify hard cases) [default: True]

--epochs INTEGER max number of epochs to train [required]
--out TEXT directory path to store artifacts
--max-batches INTEGER max number of batches to process. use as
sanity check. if no value passed than will
process the whole dataset.

--checkpoint TEXT path to initial .pth checkpoint for warm
start

--help Show this message and exit.
```

2. `segment-scans`

Segment Nifti `.nii.gz` scans with already trained model stored in `.pth` file.

```
Usage: main.py segment-scans [OPTIONS]

Options:
--launch [local|server] launch location. used to determine default
paths [default: server]

--architecture [unet|mnet2] model architecture (unet, mnet2) [default:
unet]

--device [cpu|cuda:0|cuda:1] device to use [default: cuda:0]
--checkpoint TEXT path to checkpoint .pth file
--scans TEXT path to directory with nifti scans
--subset [all|validation] what scans to segment under --scans dir:
either all, or the ones from "validation"
dataset [default: all]

--out TEXT path to output directory with segmented masks
--postfix TEXT postfix to set for segmented masks [default:
autolungs]

--help Show this message and exit.
```

3. `lr-find`

Find optimal LR for training with 1-cycle policy.

```
Usage: main.py lr-find [OPTIONS]

Options:
--launch [local|server] launch location. used to determine default
paths [default: server]

--architecture [unet|mnet2] model architecture (unet, mnet2) [default:
unet]

--device [cpu|cuda:0|cuda:1] device to use [default: cuda:0]
--dataset [nifti|numpy] dataset type [default: numpy]
--out TEXT directory path to store artifacts
--help Show this message and exit.
```

4. `create-numpy-dataset`

Create `numpy` dataset from initial Nifti `.nii.gz` scans to speedup the training.

```
Usage: main.py create-numpy-dataset [OPTIONS]

Options:
--launch [local|server] launch location. used to determine default paths
[default: server]

--scans TEXT path to directory with nifti scans
--masks TEXT path to directory with nifti binary masks
--zoom FLOAT zoom factor for output images [default: 0.25]
--out TEXT path to output directory with numpy dataset
--help Show this message and exit.
```