Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/rcompton/ml_cheat_sheet
My notes and superstitions about common machine learning algorithms
https://github.com/rcompton/ml_cheat_sheet
Last synced: about 2 months ago
JSON representation
My notes and superstitions about common machine learning algorithms
- Host: GitHub
- URL: https://github.com/rcompton/ml_cheat_sheet
- Owner: rcompton
- Created: 2015-05-03T19:34:30.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2017-05-12T19:40:15.000Z (over 7 years ago)
- Last Synced: 2024-07-31T19:15:32.791Z (5 months ago)
- Language: Jupyter Notebook
- Size: 18.7 MB
- Stars: 364
- Watchers: 43
- Forks: 190
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- my-awesome-starred - ml_cheat_sheet - My notes and superstitions about common machine learning algorithms (Others)
README
# Supervised learning superstitions cheat sheet
This notebook contains my notes and beliefs about several commonly-used supervised learning algorithms. My dream is that it will be useful as a quick reference or for people who are studying for machine learning interviews/quizzes/etc..
After some setup code, the methods discussed are:
+ Logistic regression
+ Decision trees
+ Support vector machines
+ K Nearest neighbors
+ Naive BayesTo better understand each classifier we train on various versions of the "two moons" dataset and plot empirical decision boundaries. Each plot shows the training data on top of a few thousand randomly chosen points which have been colored by the output of the learned model. *Superstition #1:* The plots suggest that linear classifiers are often out performed on high quality training sets but still produce sane results on noisy small datasets. **Note: not all the plots have the same
xy dimensions.**Other resources:
+ http://blog.echen.me/2011/04/27/choosing-a-machine-learning-classifier/ (good blog about choosing a classifier)
+ http://hunch.net/?p=22 (about overfitting)
+ http://www.dataschool.io/comparing-supervised-learning-algorithms/ (table of superstitions)
+ https://github.com/soulmachine/machine-learning-cheat-sheet (more like cheat 100 sheets)
+ http://scott.fortmann-roe.com/docs/BiasVariance.html (blog about the bias/variance problem)