https://github.com/rcompton/ml_cheat_sheet

My notes and superstitions about common machine learning algorithms
https://github.com/rcompton/ml_cheat_sheet

Last synced: 4 months ago
JSON representation

My notes and superstitions about common machine learning algorithms

Host: GitHub
URL: https://github.com/rcompton/ml_cheat_sheet
Owner: rcompton
Created: 2015-05-03T19:34:30.000Z (about 10 years ago)
Default Branch: master
Last Pushed: 2017-05-12T19:40:15.000Z (about 8 years ago)
Last Synced: 2024-07-31T19:15:32.791Z (12 months ago)
Language: Jupyter Notebook
Size: 18.7 MB
Stars: 364
Watchers: 43
Forks: 190
Open Issues: 2
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

my-awesome-starred - ml_cheat_sheet - My notes and superstitions about common machine learning algorithms (Others)

README

# Supervised learning superstitions cheat sheet

This notebook contains my notes and beliefs about several commonly-used supervised learning algorithms. My dream is that it will be useful as a quick reference or for people who are studying for machine learning interviews/quizzes/etc..

After some setup code, the methods discussed are:
+ Logistic regression
+ Decision trees
+ Support vector machines
+ K Nearest neighbors
+ Naive Bayes

To better understand each classifier we train on various versions of the "two moons" dataset and plot empirical decision boundaries. Each plot shows the training data on top of a few thousand randomly chosen points which have been colored by the output of the learned model. *Superstition #1:* The plots suggest that linear classifiers are often out performed on high quality training sets but still produce sane results on noisy small datasets. **Note: not all the plots have the same
xy dimensions.**

Other resources:
+ http://blog.echen.me/2011/04/27/choosing-a-machine-learning-classifier/ (good blog about choosing a classifier)
+ http://hunch.net/?p=22 (about overfitting)
+ http://www.dataschool.io/comparing-supervised-learning-algorithms/ (table of superstitions)
+ https://github.com/soulmachine/machine-learning-cheat-sheet (more like cheat 100 sheets)
+ http://scott.fortmann-roe.com/docs/BiasVariance.html (blog about the bias/variance problem)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/rcompton/ml_cheat_sheet

Awesome Lists containing this project

README