https://github.com/alishobeiri/applied-machine-learning
Several projects exploring different classical machine learning concepts
https://github.com/alishobeiri/applied-machine-learning
Last synced: 6 months ago
JSON representation
Several projects exploring different classical machine learning concepts
- Host: GitHub
- URL: https://github.com/alishobeiri/applied-machine-learning
- Owner: alishobeiri
- Created: 2018-12-14T05:14:36.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2018-12-14T05:50:20.000Z (almost 7 years ago)
- Last Synced: 2025-02-14T13:50:32.741Z (8 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 51.7 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Applied Machine Learning COMP 551
This repo is a collection of all assignments completed as part of an applied machine learning course I took at McGill University. The description of each individual assignment is shown below:#### Assignment 1: Linear Regression, Data Imputation
* Linear regression implementation by hand (using only Numpy)
* L2 Regularization
* Stochastic Gradient Descent Implementation by hand
* Data imputation, missing features
* Linear regression on [communities and crime dataset](http://archive.ics.uci.edu/ml/datasets/communities+and+crime)#### Assignment 2: GDA, KNN
* Gaussian Discriminant Analysis by hand (using only Numpy)
* Shared covariance
* Non shared covariance
* K nearest neighbors by hand (using only Numpy)
* Comparison of decision boundary and performance for all classifiers#### Assignment 3: Sentiment Analysis, BBoW, FBoW
Preprocessed data into frequency bag of words representation and binary bag of words representation, comparing performance of each on the classifiers listed below.
Classifiers used and cross validated for best hyperparameter performance:
* SVM (linear)
* Decision Trees
* Gaussian Naive Bayes* Natural Language Processing on:
* YELP Dataset:
* 10000 reviews
* 5 class problem (ratings 1 to 5)
* IMDB Dataset:
* 50000 reviews
* 2 class problem (1 positive, 0 negative)