https://github.com/visionscaper/stateful_multi_gpu

Experimental utility to build stateful RNN models for multi GPU training.
https://github.com/visionscaper/stateful_multi_gpu

gru keras keras-models keras-tensorflow lstm multi-gpu rnn

Last synced: 3 months ago
JSON representation

Experimental utility to build stateful RNN models for multi GPU training.

Host: GitHub
URL: https://github.com/visionscaper/stateful_multi_gpu
Owner: visionscaper
License: mit
Created: 2017-12-07T19:32:37.000Z (almost 8 years ago)
Default Branch: master
Last Pushed: 2018-04-05T19:38:49.000Z (over 7 years ago)
Last Synced: 2025-04-19T21:24:32.538Z (6 months ago)
Topics: gru, keras, keras-models, keras-tensorflow, lstm, multi-gpu, rnn
Language: Python
Homepage:
Size: 5.86 KB
Stars: 4
Watchers: 3
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # stateful_multi_gpu

Experimental utility in development to build stateful RNN models for multi GPU training.

## How to use stateful_multi_gpu()

See `example.py` for a toy example of how the utility function can be used.

To make stateful models in Keras you need to provide the batch size you are using.  

`stateful_multi_gpu()` ensures that the right batch size is used throughout the model.

To use this utility function you need to create a `inputs_generator` method that can make 

inputs for your model, for any batch size, and a `model_generator` method that can make

your model for any batch size.

For instance, as follows:

```python

def inputs_generator(batch_size):

    rnn_input = Input(

        name="rnn-input-%d" % batch_size,

        batch_shape=(batch_size, seq_len, num_symbols))

    return rnn_input

```

```python

def model_generator(batch_size):

    inputs = inputs_generator(batch_size)

    layer_output = RNNLayer(

        state_size,

        stateful=stateful_model,

        return_sequences=True)(inputs)

    outputs = TimeDistributed(Dense(num_classes))(layer_output)

    return Model(inputs=inputs, outputs=outputs)

```

`stateful_multi_gpu` uses these generator methods to 

create model inputs that expect batch size `batch_size` and model replica's (one per GPU) that expect 

batch size `batch_size` // `num_gpus`. E.g.:

```python

parallel_model = stateful_multi_gpu(inputs_generator, model_generator, training_batch_size, num_gpus)

```

It is important that `batch_size` is wholly dividable by `num_gpus`.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/visionscaper/stateful_multi_gpu

Awesome Lists containing this project

README