Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ranpy13/portrait-blur-bokeh

Attempt to recreate the DSLR like portrait bokeh blur effect on any normal image using deep learning, resnet and unet.
https://github.com/ranpy13/portrait-blur-bokeh

bokeh deep-neural-networks opencv resnet unet-image-segmentation

Last synced: about 2 months ago
JSON representation

Attempt to recreate the DSLR like portrait bokeh blur effect on any normal image using deep learning, resnet and unet.

Awesome Lists containing this project

README

        

# Bokeh Effect

## Deep Learning Framework for Depth Estimation
We aim to build a robust model that produces bokeh effect from an input image using Deep Learning without using advanced camera lenses or dual lenses. We approach this problem with GANs for depth estimation of input image. We therefore model the Bokeh mode problem to background-foreground seperation (segmentation) problem.
### NYU Depth Dataset
For this we used NYU Depth Dataset V2 to train both models. This dataset consists of 1400 images annotated with depth values. We process this depth values to generate depth maps.


![enter image description here](https://lh3.googleusercontent.com/gvOy1fJS1P8zSchk5yBWAPB5SKej8Bl0m0r-w0AsuQ8gZx7cwD2keq1q5nota5hJjSPz_omsgycw)![enter image description here](https://lh3.googleusercontent.com/kNSPA7VaOGPiubHXan0Bpyus2RIqhHTvIDY5dEQxcNYk1mVFLIm5KPrOkCceR60qLWudGgm5BI5C)
### Pix2Pix
We have trained both Resnet and Unet based Pix2Pix models (but Resnet model is used for final testing). In one step generator is trained twice and discriminator is trained once. It performs very well on NYU Depth Dataset but fails to seperate boundaries for other images (black and white appear in patches). This is possibly because it overfits the dataset (since number of parameters in pix2pix are quite less).

* Model Architecture

![enter image description here](https://lh3.googleusercontent.com/QADfPo1O5AFOKMplFnPiSpEPYCNg2Rxzwqy6s8MNTJIMj1clcWO8jsxCV5B9t9fReRj2hPQ4Go77 "Pix2Pix")

* Model Results




### Cycle GAN
Cycle-GAN consists of two generators (Resnet Generator) and two discrimantors (Patch-GAN Discriminator). We trained on both 6-block and 9-block generators (9-block generator was found to be better). In one step generator is trained twice and discriminator is trained once.
Cycle-GAN outperforms Pix2Pix on real world dataset in terms of boundary seperation though it sometimes mis-identifies foreground and background (especially when there is very less color change from background to foreground i.e. low image gradient at the point).

* Model Architecture

![enter image description here](https://lh3.googleusercontent.com/OrKT0VSEgIMstHKxzW5DW2JY25HtAJxafb_4XB_3Bcnc8Dw45puzHt_bqKsLqxup_LtzY9ms7DJH)

* Model losses





* Model Results




## Results
|Models |Abs Rel Error |RMS Error |
|----------------|-------------------------------|-----------------------------|
|Pix2Pix| 0.2448 | 2.0075 |
|Cycle-GAN |0.3728 |3.1207 |

Here are some of the loss graphs for our models.

Pix2Pix (using Resnet-9 Generator)






CycleGAN (also using Resnet-9 Generator)



## Bokeh from Depth + Input

After depth estimation is completed we use it to generate bokeh effect on input image. For this we heuristically manipulate Depth data to compensate for errors in it to get a good bokeh effect. We apply disc blurring (averaging pixels over a circle) on the input image and use that image to get the background pixels and use original image to get foreground pixels (boundary between foreground and background is typically set between 100-150 for pixel range 0-255). Now for the model to perform better on human images we use opencv implementation of face detection (using haar features) and penalize the regions containing human images to bring them to foreground and prevent them from blurring (in case the model wrongly predicts human faces).

### Bokeh Testing from Human Images

Even though our model was not trained on the images of humans, it still manages to find the boundaries and detect foreground and background in some cases

* DepthMap Generated






* Bokeh generated



## Requirements

Kindly run the requirements.txt to install all the required libraries for replicating this project.

## Running the code

For training the models from scratch

Run`bash get_dataset.sh` in order to download the NYUv-2 dataset and save it as .png images


Then run `bash make_folders.sh` to create the required directorie


After that run `python3 src/train.py <'CycleGAN/P2P'> [ ]` for training the CycleGAN or Pix2Pix model (The results will be visible in the
results folder)

Finding the Bokeh-Image from the given Image


Run `python3 src/bokehEffect.py ` to get the resultant Bokeh Image (bokeh.jpg)

Finding the Depthmap for the given RGB image


Run `python3 src/testModel.py ` to get the resultant depthMap (testOutput.jpeg)

## Downloading preTrained Models

The weights of the CycleGAN model can be downloaded from [here](https://drive.google.com/open?id=15vmPYVIL53xLbGQY5YYJpDmqeIfq3x2s)

Ther weights of the Pix2Pix model can be downloaded from [here](https://drive.google.com/open?id=1i4d4VFz4caLrcJ415T2uX2td_CweJaDz)

## References

--> https://neurohive.io/en/popular-networks/pix2pix-image-to-image-translation/

--> https://cs.nyu.edu/~silberman/datasets/nyu_depth_v2.html