https://github.com/zsdonghao/stackednet

轻量、易于修改、可组层训练的神经网络 / Lightweight Greedy Layer-Wise Training Neural Network
https://github.com/zsdonghao/stackednet

deep-learning deep-learning-library multi-layer-perceptron theano

Last synced: 14 days ago
JSON representation

轻量、易于修改、可组层训练的神经网络 / Lightweight Greedy Layer-Wise Training Neural Network

Host: GitHub
URL: https://github.com/zsdonghao/stackednet
Owner: zsdonghao
License: mit
Created: 2016-04-28T12:10:46.000Z (about 9 years ago)
Default Branch: master
Last Pushed: 2016-05-01T15:51:05.000Z (about 9 years ago)
Last Synced: 2025-03-31T18:05:47.523Z (about 2 months ago)
Topics: deep-learning, deep-learning-library, multi-layer-perceptron, theano
Language: Python
Homepage:
Size: 55.7 KB
Stars: 7
Watchers: 2
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# StackedNet - Lightweight greedy layer-wise training

> The "StackedNet" class implemented Staked Sparse Autoencoders, on the top
> of Theano so as to use GPU acceleration. It's behaviour can be modified
> easily according to your demand. For advanced version, please contact author.

## Motivation

Most of the open access libraries do not support greedy layer-wise
initialization, developers and researches are suffer from implement it.
For this reason, I release this lightweight code which is easy to
modify.

## start_greedy_layer_training (unsupervised)

This method is used to initialize the weights in the layer-wise manner, using Sparse
Autoencoder, sigmoid and mean square error by default. This class is
easy to modify to any expected behaviour.

## start_fine_tune_training (supervised)

This method is used to train the whole network after greedy layer-wise training, using
softmax output and cross-entropy by default, without any dropout and
regularization. However, this example will save all parameters' value in the end, so the
author suggests you to design your own fine-tune behaviour if you want
to use dropout or dropconnect.

## This code available for python 2 and python 3

stacked_autoencoder.py took about 2 seconds when training the 1st hidden layer
on NVIDIA GeForce GT 750M.

## Author:

Hao Dong

Department of Computing & Data Science Institute

Imperial College London

# StackedNet - 一个轻量级的逐层训练代码

>"StackedNet" 类，基于Theano的GPU加速，实现了栈式稀疏自编码器，代码可以很容易地根据你的需求修改。
>若需要高级版本代码，请联系作者

## 目的

目前开源的深度学习库基本都不提供逐层训练的功能，开发者和研究者需要花很多精力来自己
开发这些功能。因此，发布这个非常容易做二次修改的轻量级的代码，可以帮助大家提高效率。

## start_greedy_layer_training (unsupervised)

该方法用于隐层的逐层预训练，以初始化weights。每个隐层默认为稀疏自编码器，使用sig-
moid函数和mean square error作为损失函数。用户可以自行修改损失函数，以实现自己想
要的效果。

## start_fine_tune_training (supervised)

该方法用于微调逐层训练后的整个神经网络，使用softmax作为输出层，cross-entropy作为
损失函数，没用使用任何的dropout和regularization。
然而，该代码例子会保存所有weights和bias的值，若用户想要设计自己的微调方法，作者建
议自己重写微调算法，或者用其他开源库来微调之。

## 该代码支持 python 2 和 python 3

stacked_autoencoder.py 在使用NVIDIA GeForce GT 750M显卡训练第一层时，大约要
2秒一个epoch。

## 作者:

董豪

计算机系及数据科学研究所

伦敦帝国理工

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/zsdonghao/stackednet

Awesome Lists containing this project

README