Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bl0nder/regressify
A multi-task neural network coded from scratch that performs simultaneous regression and classification.
https://github.com/bl0nder/regressify
classification deep-learning multitask-learning neural-networks numpy regression
Last synced: 8 days ago
JSON representation
A multi-task neural network coded from scratch that performs simultaneous regression and classification.
- Host: GitHub
- URL: https://github.com/bl0nder/regressify
- Owner: bl0nder
- License: mit
- Created: 2023-06-24T10:17:09.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-06-27T07:51:06.000Z (over 1 year ago)
- Last Synced: 2024-01-31T18:57:57.456Z (11 months ago)
- Topics: classification, deep-learning, multitask-learning, neural-networks, numpy, regression
- Language: Jupyter Notebook
- Homepage:
- Size: 142 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# π¦Ύ Regressify
Regressify is a multi-task neural network coded from scratch that performs simultaneous regression and classification.## π οΈ Tools Used
π Data
The neural network has been trained on a dummy dataset of 2 features F1 and F2, and 2 target variables T1 and T2, corresponding to a class label and a regression value respectively. Here is a sample subset of the same:| F1 | F2 | T1 (class label) | T2 (regression value) |
|---|---|---|---|
| 1.4 | 0.2 | 1 | 0.28 |
| 1.6 | 0.2 | 1 | 0.32 |
| 4 | 1.2 | 2 | 4.8 |
| 3.3 | 1 | 2 | 3.3 |
| ... | ... | ... | ... |- There are **150 samples** in the dataset
- Each sample belongs to one of **2 classes** where the class ID is either 1 or 2ποΈ Neural Network Architecture
### Input Layer (Layer 0)
Consists of two neurons, each corresponding to one of the two feature variables, F1 and F2.
### Hidden Layers
- Controlled by the `num_layers` hyperparameter. Note: `num_layers` includes the last (output) layer. Thus, the number of hidden layers is `num_layers - 1`
- Each hidden layer can have either a `sigmoid` or a `tanh` activation function applied to it. Hyperparameter: `layer_activation_fns`
- `num_neurons` controls the number of neurons in each layer
### Output Layer
- Has two neurons, one for a classification prediction, and the other for a regression value
- The matrix output by the penultimate layer undergoes:
+ a `sigmoid` function to obtain the respective probabilities of each sample belonging to one of the two classes
+ a `linear` function to obtain regression values corresponding to each sampleβΎοΈ Math
### Forward Propagation
Used to product an output in the forward direction by sequentially processing the input data through each layer of the neural network.This processing involves mutliplying each layer's input with its corresponding weight matrix and then passing the product to an activation function
For each layer L,
$$Z^{[L]} = W^{[L]}A^{[L-1]} + b^{[L]}$$
$$A^{[L]} = g^{[L]}(Z^{[L]})$$
where
$W^{[L]} =$ Weight matrix of layer L
$b^{[L]} =$ Bias vector of layer L
$A^{[L]} =$ Output matrix of layer L $=$ Input to layer L+1
$g^{[L]} =$ Activation function of layer L
### Backward Propagation
Involves computation of losses in the backward direction which in turn allows for changes in layer weights to make the network produce more accurate outputs.For each layer L,
$$\frac{\partial C}{\partial Z^{[L]}} = W^{[L+1]^T} \frac{\partial C}{\partial Z^{[L+1]}} \odot g'^{[L]}(Z^{[L]})$$
$$\frac{\partial C}{\partial W^{[L]}} = \frac{\partial C}{\partial Z^{[L]}}A^{[L-1]^T}$$
$$\frac{\partial C}{\partial b^{[L]}} = \frac{\partial C}{\partial Z^{[L]}}$$
$$W^{[L]} = W^{[L]} - \alpha \frac{\partial C}{\partial W^{[L]}}$$
$$b^{[L]} = b^{[L]} - \alpha \frac{\partial C}{\partial b^{[L]}}$$
where
$C =$ Cost/loss computed by neural network
$\odot =$ Element-wise multiplication
$\alpha =$ Learning rate
Note for the last layer:
$$\frac{\partial C}{\partial Z^{[Last]}} = \frac{\partial C}{\partial A^{[Last]}} \odot g'^{[Last]}(Z^{[Last]})$$
π Performance Analysis
### Training
Classification accuracy: `0.975`Regression $R^2$ score: `0.9712041992163881`
### Validation
Classification accuracy: `1.0`Regression $R^2$ score: `0.723742056420074`