Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mmz33/nn-from-scratch

Neural Network implementation with Numpy
https://github.com/mmz33/nn-from-scratch

backpropagation-algorithm deep-learning neural-networks

Last synced: 27 days ago
JSON representation

Neural Network implementation with Numpy

Host: GitHub
URL: https://github.com/mmz33/nn-from-scratch
Owner: mmz33
Created: 2019-04-25T21:38:04.000Z (almost 6 years ago)
Default Branch: master
Last Pushed: 2022-06-25T09:37:13.000Z (over 2 years ago)
Last Synced: 2024-11-22T06:06:57.415Z (3 months ago)
Topics: backpropagation-algorithm, deep-learning, neural-networks
Language: Python
Homepage:
Size: 82 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Neural Network From Scratch

The purpose of this piece of code is to have a taste about what is really going on inside a Neural Network (NN). It is done for educational purposes and for fun :) Breifly, a NN is a function mapping an input to some output depending on the task. In between lie what are called the "hidden layers" and there where all the magic is done. These layers have trainable parameters. The goal at the end is to minimize some loss function which is dependent on the network parameters. In order to find the optimal parameters, we use some kind of gradient descent optimizer (e.g stocastic gradient descent, Momentum, etc) with the help of the backpropagation algorithm.

## Datasets

Currently, there is a python script in `datasets` folder called `mnist.py` which basically downloads (if necessary) and prepares automatically this dataset for you. This can be extended later to support different datasets and of course you can implement your own dataset class and integrate it with the current code.

## Components

- `main.py`: the main entry point.

- `config.py`: parse the json config file that contains the network and other (hyper)parameter.

- `engine.py`: backend engine that extracts content from the parsed json, construct the network layers, implements train and test functions, etc.

- `nn_module.py`: represents a NN module such as a layer, activation function, loss function, etc.

- `model.py`: represents the NN model which is a stack of modules.

- `log.py`: represents a logger to control the output logs by using a log verbosity integer.

- `utils.py`: contains some helper functions.

- `tests.py`: contains functionality test functions.

## Dependencies

For dependencies, it is recommended to create a virtual enviroment and do `pip3 install -r requirements.txt`. But anyway the versions are not so important as the code uses basic methods. Moreover, `matplotlib` is only needed for this code in case you want to plot some images.

## Training

To train your model, define the network and other parameters as a json file (see `configs/network1.json` for an example). For training, `task` should be set to `train` in the json config file. After that, you can just do:

`python3 main.py json_file`

The models will be saved in `model_file` which is defined in the json config file. They will be dumped as pickle files and can be loaded again for testing.

## Testing

For testing, just change the `task` in the json config to `test`. The results on the MNIST dataset with `configs/network1.json` config are:

```

Number of errors: 156/10000

Test accuracy: 98.44%

```