https://github.com/shreyb2091/pclub_recruitment_task

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/shreyb2091/pclub_recruitment_task
Owner: ShreyB2091
Created: 2022-08-16T16:16:15.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2022-08-16T18:30:54.000Z (almost 3 years ago)
Last Synced: 2025-02-05T19:43:25.027Z (5 months ago)
Language: Jupyter Notebook
Size: 25.4 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # TASK 4 - Under The Hood!

## About the Task

I have implemented a single neural network. It is a multi-class classification neural network with a single hidden layer. I have used only NumPy, Pandas and Matplotlib libraries to implement this algorithm.

## Data

The training data is present in the file `Surgical-deepnet.csv`.

This dataset was taken from Kaggle. To know more [click here](https://www.causeweb.org/tshs/datasets/Surgery%20Timing%20Data%20Dictionary.pdf).

## Approach

First I have uploaded the data using Pandas DataFrame and created the training data. Then I made a class to setup the network.

To initialize the network you can give input of learning rate, the number of nodes in the hidden layer (more the depth of the hidden layer, more the accuracy - but the running time of the code also increases!) and the number of iterations you want to carry out.

The dimensions for the various matrices are stored in a list `dims`. The weights and biases for each layer of the network(input layer, hidden layer and output layer) is stored in a dictoniory `param`. The dictionary `ch` stores the cached parameters which are later used in the back propagation step. The forwards function takes are initializing my data `tanh` and `sigmoid`. I used the Cross Entropy Loss Fucntion to calculate the loss at each iteration and update the parameters automaticlly.

Here is some maths behind the algorithm - 

For one example $x^{(i)}$:

$$z^{[1] (i)} =  W^{[1]} x^{(i)} + b^{[1]}$$ 

$$a^{[1] (i)} = \tanh(z^{[1] (i)})$$

$$z^{[2] (i)} = W^{[2]} a^{[1] (i)} + b^{[2]}$$

$$\hat{y}^{\(i\)} = a^{\[2\]\(i\)} = \sigma(z^{\[2\]\(i\)})$$

Given the predictions on all the examples, you can also compute the cost $J$ as follows: 

$$J = - \frac{1}{m} \sum\limits_{i = 0}^{m} \large\left(\small y^{\(i\)}\log\left(a^{\[2\] \(i\)}\right) + (1-y^{\(i\)})\log\left(1- a^{\[2\] \(i\)}\right)  \large  \right) \small$$

I tried running the code for many values of learning rate, number of iterations and the depth of the hidden layer.

I found the best result for `learning rate = 0.02`, `number of iterations = 5000` and `number of nodes in hidden layer = 30`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/shreyb2091/pclub_recruitment_task

Awesome Lists containing this project

README