https://github.com/geekquad/knn-from-scratch

A basic project to implement the KNN classifier from Scratch.
https://github.com/geekquad/knn-from-scratch

knn knn-classifier python scratch-implementation sklearn

Last synced: 3 months ago
JSON representation

A basic project to implement the KNN classifier from Scratch.

Host: GitHub
URL: https://github.com/geekquad/knn-from-scratch
Owner: geekquad
Created: 2020-06-21T18:59:23.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2020-06-23T19:38:47.000Z (about 6 years ago)
Last Synced: 2025-01-07T05:16:26.847Z (over 1 year ago)
Topics: knn, knn-classifier, python, scratch-implementation, sklearn
Language: Jupyter Notebook
Homepage:
Size: 12.7 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# KNN-from-Scratch
KNN which stands for K-Nearest Neighbours is a simple algorithm that is used for **classification** and **regression** problems in Machine Learning. KNN is a **non-parametric** and **lazy learning algorithm**. Non-parametric means there is no assumption for underlying data distribution. In other words, the model structure determined from the dataset. This will be very helpful in practice where most of the real world datasets do not follow mathematical theoretical assumptions. Lazy algorithm means it does not need any training data points for model generation. All training data used in the testing phase. This makes training faster and testing phase slower and costlier. Costly testing phase means time and memory.

In the worst case, KNN needs more time to scan all data points and scanning all data points will require more memory for storing training data.

### Working of KNN:
In KNN, K is the number of nearest neighbors. The number of neighbors is the core deciding factor. K is generally an odd number if the number of classes is 2. When K=1, then the algorithm is known as the nearest neighbor algorithm. This is the simplest case.

### Documentation of KNN:
https://scikit-learn.org/stable/modules/generated/sklearn.neighbors.KNeighborsClassifier.html

### Parameters:

n_neighbors (default value = 5)

p: (default=”minkowski”)

> When p = 2, this is equivalent to using euliddean_distance(l2)

### Evaluation of the model (without parameters tuning):

precision recall f1-score support
0 0.77 0.83 0.80 12
1 0.72 0.54 0.62 24
2 0.48 0.61 0.54 18
avg / total 0.65 0.63 0.63 54

#### Accuracy: 0.62

### Evaluation of the model (after parameters tuning):

precision recall f1-score support
0 0.77 0.83 0.80 12
1 0.71 0.62 0.67 24
2 0.50 0.56 0.53 18
avg / total 0.66 0.65 0.65 54

#### Accuracy: 0.64
As we can see, after parameters tuning, the accuracy of the model **increased.**

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/geekquad/knn-from-scratch

Awesome Lists containing this project

README