https://github.com/ashvegeta/neural-network

working of neural network
https://github.com/ashvegeta/neural-network

Last synced: 4 months ago
JSON representation

working of neural network

Host: GitHub
URL: https://github.com/ashvegeta/neural-network
Owner: ashvegeta
License: mit
Created: 2020-06-04T06:52:31.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2023-12-27T18:17:47.000Z (over 1 year ago)
Last Synced: 2023-12-27T21:07:58.560Z (over 1 year ago)
Language: Jupyter Notebook
Size: 252 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # Neural Network ([to access the latest work click here](https://github.com/ashvegeta/Neural-Network/blob/master/vectorised_nn.ipynb))

what is a neural network?

a neural network is a representation of connections made by neurons in the brain. each unit of a network is called a neuron which contains an input, an assigned weight, and its output. a network consists of layers of these neurons which form a network.

what is forward propagation?

forward propagation is the dot product of activations of a layer and its assigned weights to produce output or activations in the next layer.

working of neural network 

a neural network consists of input, hidden, and output layers, using the input we forward propagate the respective activations. this process is continued till we reach the output layer.

what is backpropagation?

after forward propagating the the activation from the input through the hidden layers to the output we find the errors in the

activation of the output layers and propagate these errors all the way to the input layers to update the weights.

what is a bias unit?

a bias unit is basically a neuron with activation 1 and has no input to it i.e its activation is independent of previous layers' activations.

steps in backpropagation

1. err = sum of square of the difference in hypothesis and actual output, our objective is to find out the rate at which error changes

w.r.t the weights i.e

                        dE/dW(L) 

2. we will use the chain rule to find out the above derivative so we can update the weights of each layer

         given E=sum((hyp-y)^2) and the activation function is f(z)=1/(1+exp(-z)) where z=sum(W.a(L-1)) and (a(L-1)-> activation of previous layer)

         dE/dW(L)=dE/da * da/df * df/dW (where a->activation of layer l , z->activation fucntion , W->weights)

  

          1. dE/da = d(sum(hyp-y)^2)/da   =  -2 * sum (hyp-y)

          

          2. da/df = d(f(z))/df = 1

          

          3. df/dW = d(f(z))/dW 

             = d(f(sum(W.a(L-1))))/dW 

             = d(sum(W.a(L-1)))/dW * df/dw 

             = a(L-1) * d(f(z))/dw 

             = a(L-1) * f(z) * ( 1-f(z))

          

          combining all three equations we get 

          dE/dW = -2 * sum(hyp-y) * 1 * f(z) * (1-f(z)) * a(L-1)

          

           dE/dW = delta * a(L-1) [where delta = -2 * sum(hyp-y) * 1 * f(z) * (1-f(z)) ]

           

           for biases, dE/DB = delta

           

 3. update the weights accordingly 

     W + = learning_rate * dE/dW ( learning rate decides the rate at which we perform gradient descent)

     

     B + = learning_rate * dE/dB

 

 steps in optimizing a neural network 

 1. perform forward propagation.

 2. perform backpropagation and update the weights.

 3. perform the above two steps till you reach the required degree of convergence.

References :

[1] http://neuralnetworksanddeeplearning.com/

Backpropagation Notes :

1. Backprop equations -> https://drive.google.com/file/d/1zsC6jDtkNNd8Bz0v5LGEgYHqe865HZdM/view?usp=sharing

2. Error Propagation -> https://drive.google.com/file/d/1k4-R81M2RxBF5-DSWnA4lV8KL6gzjkUd/view?usp=sharing

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ashvegeta/neural-network

Awesome Lists containing this project

README