Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/DmitryUlyanov/texture_nets
Code for "Texture Networks: Feed-forward Synthesis of Textures and Stylized Images" paper.
https://github.com/DmitryUlyanov/texture_nets
neural-style style-transfer texture-networks torch
Last synced: 2 months ago
JSON representation
Code for "Texture Networks: Feed-forward Synthesis of Textures and Stylized Images" paper.
- Host: GitHub
- URL: https://github.com/DmitryUlyanov/texture_nets
- Owner: DmitryUlyanov
- License: apache-2.0
- Created: 2016-03-07T11:53:05.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2018-01-07T06:56:34.000Z (about 7 years ago)
- Last Synced: 2024-11-11T12:44:42.013Z (2 months ago)
- Topics: neural-style, style-transfer, texture-networks, torch
- Language: Lua
- Homepage:
- Size: 9.31 MB
- Stars: 1,224
- Watchers: 58
- Forks: 217
- Open Issues: 42
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-neural-art - texture_nets - Code for "Texture Networks: Feed-forward Synthesis of Textures and Stylized Images". (Style Transfer)
README
# Texture Networks + Instance normalization: Feed-forward Synthesis of Textures and Stylized Images
In the paper [Texture Networks: Feed-forward Synthesis of Textures and Stylized Images](http://arxiv.org/abs/1603.03417) we describe a faster way to generate textures and stylize images. It requires learning a feedforward generator with a loss function proposed by [Gatys et al.](http://arxiv.org/abs/1505.07376). When the model is trained, a texture sample or stylized image of any size can be generated instantly.
[Improved Texture Networks: Maximizing Quality and Diversity in Feed-forward Stylization and Texture Synthesis](https://arxiv.org/abs/1701.02096) presents a better architectural design for the generator network. By switching `batch_norm` to `Instance Norm` we facilitate the learning process resulting in much better quality.
This also implements the stylization part from [Perceptual Losses for Real-Time Style Transfer and Super-Resolution](https://arxiv.org/abs/1603.08155).
You can find an oline demo [here](https://riseml.com/DmitryUlyanov/texture_nets) (thanks to RiseML).
# Prerequisites
- [Torch7](http://torch.ch/docs/getting-started.html) + [loadcaffe](https://github.com/szagoruyko/loadcaffe)
- cudnn + torch.cudnn (optionally)
- [display](https://github.com/szym/display) (optionally)Download VGG-19.
```
cd data/pretrained && bash download_models.sh && cd ../..
```# Stylization
![](data/readme_pics/all.jpg " ")
### Training
#### Preparing image dataset
You can use an image dataset of any kind. For my experiments I tried `Imagenet` and `MS COCO` datasets. The structure of the folders should be the following:
```
dataset/train
dataset/train/dummy
dataset/val/
dataset/val/dummy
```The dummy folders should contain images. The dataloader is based on one used in[fb.resnet.torch](https://github.com/facebook/fb.resnet.torch).
Here is a quick example for MSCOCO:
```
wget http://msvocds.blob.core.windows.net/coco2014/train2014.zip
wget http://msvocds.blob.core.windows.net/coco2014/val2014.zip
unzip train2014.zip
unzip val2014.zip
mkdir -p dataset/train
mkdir -p dataset/val
ln -s `pwd`/val2014 dataset/val/dummy
ln -s `pwd`/train2014 dataset/train/dummy
```#### Training a network
Basic usage:
```
th train.lua -data -style_image path/to/img.jpg
```These parameters work for me:
```
th train.lua -data -style_image path/to/img.jpg -style_size 600 -image_size 512 -model johnson -batch_size 4 -learning_rate 1e-2 -style_weight 10 -style_layers relu1_2,relu2_2,relu3_2,relu4_2 -content_layers relu4_2
```
Check out issues tab, you will find some useful advices there.To achieve the results from the paper you need to play with `-image_size`, `-style_size`, `-style_layers`, `-content_layers`, `-style_weight`, `-tv_weight`.
Do not hesitate to set `-batch_size` to one, but remember the larger `-batch_size` the larger `-learning_rate` you can use.
### Testing
```
th test.lua -input_image path/to/image.jpg -model_t7 data/checkpoints/model.t7
```Play with `-image_size` here. Raise `-cpu` flag to use CPU for processing.
You can find a **pretrained model** [here](https://yadi.sk/d/GwL9jNJovBwQg). It is *not* the model from the paper.
## Generating textures
soon
# Hardware
- The code was tested with 12GB NVIDIA Titan X GPU and Ubuntu 14.04.
- You may decrease `batch_size`, `image_size` if the model do not fit your GPU memory.
- The pretrained models do not need much memory to sample.# Credits
The code is based on [Justin Johnson's great code](https://github.com/jcjohnson/neural-style) for artistic style.
The work was supported by [Yandex](https://www.yandex.ru/) and [Skoltech](http://sites.skoltech.ru/compvision/).