https://github.com/hwalsuklee/tensorflow-mnist-mlp-batch_normalization-weight_initializers

MNIST classification using Multi-Layer Perceptron (MLP) with 2 hidden layers. Some weight-initializers and batch-normalization are implemented.
https://github.com/hwalsuklee/tensorflow-mnist-mlp-batch_normalization-weight_initializers

batch-normalization he he-initializer mnist mnist-classification tensorflow weight-initializers xavier xavier-initializer

Last synced: 3 months ago
JSON representation

MNIST classification using Multi-Layer Perceptron (MLP) with 2 hidden layers. Some weight-initializers and batch-normalization are implemented.

Host: GitHub
URL: https://github.com/hwalsuklee/tensorflow-mnist-mlp-batch_normalization-weight_initializers
Owner: hwalsuklee
Created: 2017-01-02T13:18:47.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2017-01-20T11:18:35.000Z (over 8 years ago)
Last Synced: 2025-03-27T01:23:00.262Z (4 months ago)
Topics: batch-normalization, he, he-initializer, mnist, mnist-classification, tensorflow, weight-initializers, xavier, xavier-initializer
Language: Python
Homepage:
Size: 1.62 MB
Stars: 55
Watchers: 3
Forks: 17
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Various initializers and batch normalization

An implementation of weight/bias initializers and batch normalization in Tensorflow.

MNIST database is used to show performance-comparison

## Network architecture

In order to examine the effect of initializers and batch normalization, a simple network architecture called multilayer perceptrons (MLP) is employed.

MLP has following architecture.

+ input layer : 784 nodes (MNIST images size)

+ first hidden layer : 256 nodes

+ second hidden layer : 256 nodes

+ output layer : 10 nodes (number of class for MNIST)

## Various initializers

The following initializers for weights/biases of network are considered.

+ Weight Initializer

  * normal, trucated normal

  * xaiver : [Understanding the difficulty of training deep feedforward neural networks](http://jmlr.org/proceedings/papers/v9/glorot10a/glorot10a.pdf)

  * he : [Delving Deep into Rectifiers](http://arxiv.org/pdf/1502.01852v1.pdf)

+ Bias Initializer

  * normal, constant

  

### Simulation results

+ Weight Initializer : he > trauncated normal = xaiver > normal

+ Bias Initilaizer : zero > normal

Sample results are following.

![weight_init](fig/results_weight_initializers.jpg)

|Index|Weight Initializer|Bias Initializer|Accuracy|

|:---:|:---:|:---:|:---:|

|normal_w_normal_b_0.9451|normal|normal|0.9451|

|normal_w_zero_b_0.9485|normal|zero|0.9485|

|truncated_normal_w_normal_b_0.9788|truncated_normal|normal|0.9788|

|truncated_normal_w_zero_b_0.9790|truncated_normal|zero|0.9790|

|xavier_w_normal_b_0.9800|xavier|normal|0.9800|

|xavier_w_zero_b_0.9806|xavier|zero|0.9806|

|he_w_normal_b_0.9798|he|normal|0.9798|

|he_w_zero_b_0.9811|he|zero|0.9811|

## Batch normalization

Batch normalization improves performance of network in terms of final accuracy and convergence rate.

In this simulation, bias initalizer was zero-constant initializer and weight initializers were xavier / he.

Sample results are following.

![batch_norm](fig/results_batch_normalization.jpg)

|Index|Weight Initializer|Batch Normalization|Accuracy|

|:---:|:---:|:---:|:---:|

|xavier_woBN_0.9806|xavier|Unused|0.9806|

|xavier_withBN_0.9812|xavier|Used|0.9812|

|he_woBN_0.9811|he|Unused|0.9811|

|he_withBN_0.9837|he|Used|0.9837|

## Usage

`python run_main.py --weight-init  --bias-init  --batch-norm `

`` must be selected in [normal, truncated_normal, xavier, he].

`` must be selected in [normal, zero].

You may command like `python run_main.py --weight-init xavier --bias-init zero --batch-norm True`.

## Acknowledgement

This implementation has been tested on Tensorflow r0.12.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/hwalsuklee/tensorflow-mnist-mlp-batch_normalization-weight_initializers

Awesome Lists containing this project

README