Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/DmitryUlyanov/texture_nets

Code for "Texture Networks: Feed-forward Synthesis of Textures and Stylized Images" paper.
https://github.com/DmitryUlyanov/texture_nets

neural-style style-transfer texture-networks torch

Last synced: about 2 months ago
JSON representation

Code for "Texture Networks: Feed-forward Synthesis of Textures and Stylized Images" paper.

Host: GitHub
URL: https://github.com/DmitryUlyanov/texture_nets
Owner: DmitryUlyanov
License: apache-2.0
Created: 2016-03-07T11:53:05.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2018-01-07T06:56:34.000Z (over 6 years ago)
Last Synced: 2024-02-07T00:40:46.123Z (5 months ago)
Topics: neural-style, style-transfer, texture-networks, torch
Language: Lua
Homepage:
Size: 9.31 MB
Stars: 1,222
Watchers: 60
Forks: 224
Open Issues: 42
Metadata Files:
- Readme: README.md
- License: LICENSE

Lists

awesome-neural-art - texture_nets - Code for "Texture Networks: Feed-forward Synthesis of Textures and Stylized Images". (Style Transfer)
awesome-torch - Texture Networks: Feed-forward Synthesis of Textures and Stylized Images
awesome-computer-vision-papers - 2016ICML - talk.pdf) (LowLevelVision / 3D SemanticSeg)

README

# Texture Networks + Instance normalization: Feed-forward Synthesis of Textures and Stylized Images

In the paper [Texture Networks: Feed-forward Synthesis of Textures and Stylized Images](http://arxiv.org/abs/1603.03417) we describe a faster way to generate textures and stylize images. It requires learning a feedforward generator with a loss function proposed by [Gatys et al.](http://arxiv.org/abs/1505.07376). When the model is trained, a texture sample or stylized image of any size can be generated instantly.

[Improved Texture Networks: Maximizing Quality and Diversity in Feed-forward Stylization and Texture Synthesis](https://arxiv.org/abs/1701.02096) presents a better architectural design for the generator network. By switching `batch_norm` to `Instance Norm` we facilitate the learning process resulting in much better quality.

This also implements the stylization part from [Perceptual Losses for Real-Time Style Transfer and Super-Resolution](https://arxiv.org/abs/1603.08155).

You can find an oline demo [here](https://riseml.com/DmitryUlyanov/texture_nets) (thanks to RiseML).
# Prerequisites
- [Torch7](http://torch.ch/docs/getting-started.html) + [loadcaffe](https://github.com/szagoruyko/loadcaffe)
- cudnn + torch.cudnn (optionally)
- [display](https://github.com/szym/display) (optionally)

Download VGG-19.
```
cd data/pretrained && bash download_models.sh && cd ../..
```

# Stylization

![](data/readme_pics/all.jpg " ")

### Training

#### Preparing image dataset

You can use an image dataset of any kind. For my experiments I tried `Imagenet` and `MS COCO` datasets. The structure of the folders should be the following:
```
dataset/train
dataset/train/dummy
dataset/val/
dataset/val/dummy
```

The dummy folders should contain images. The dataloader is based on one used in[fb.resnet.torch](https://github.com/facebook/fb.resnet.torch).

Here is a quick example for MSCOCO:
```
wget http://msvocds.blob.core.windows.net/coco2014/train2014.zip
wget http://msvocds.blob.core.windows.net/coco2014/val2014.zip
unzip train2014.zip
unzip val2014.zip
mkdir -p dataset/train
mkdir -p dataset/val
ln -s `pwd`/val2014 dataset/val/dummy
ln -s `pwd`/train2014 dataset/train/dummy
```

#### Training a network

Basic usage:
```
th train.lua -data -style_image path/to/img.jpg
```

These parameters work for me:
```
th train.lua -data -style_image path/to/img.jpg -style_size 600 -image_size 512 -model johnson -batch_size 4 -learning_rate 1e-2 -style_weight 10 -style_layers relu1_2,relu2_2,relu3_2,relu4_2 -content_layers relu4_2
```
Check out issues tab, you will find some useful advices there.

To achieve the results from the paper you need to play with `-image_size`, `-style_size`, `-style_layers`, `-content_layers`, `-style_weight`, `-tv_weight`.

Do not hesitate to set `-batch_size` to one, but remember the larger `-batch_size` the larger `-learning_rate` you can use.

### Testing

```
th test.lua -input_image path/to/image.jpg -model_t7 data/checkpoints/model.t7
```

Play with `-image_size` here. Raise `-cpu` flag to use CPU for processing.

You can find a **pretrained model** [here](https://yadi.sk/d/GwL9jNJovBwQg). It is *not* the model from the paper.

## Generating textures

soon

# Hardware
- The code was tested with 12GB NVIDIA Titan X GPU and Ubuntu 14.04.
- You may decrease `batch_size`, `image_size` if the model do not fit your GPU memory.
- The pretrained models do not need much memory to sample.

# Credits

The code is based on [Justin Johnson's great code](https://github.com/jcjohnson/neural-style) for artistic style.

The work was supported by [Yandex](https://www.yandex.ru/) and [Skoltech](http://sites.skoltech.ru/compvision/).