Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/howardyclo/kmeans-dbscan-tutorial
A clustering tutorial with scikit-learn for beginners.
https://github.com/howardyclo/kmeans-dbscan-tutorial
clustering-algorithm dbscan ipython-notebook kmeans scikit-learn tutorial
Last synced: 3 months ago
JSON representation
A clustering tutorial with scikit-learn for beginners.
- Host: GitHub
- URL: https://github.com/howardyclo/kmeans-dbscan-tutorial
- Owner: howardyclo
- Created: 2017-02-03T06:28:58.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2017-02-07T14:11:31.000Z (almost 8 years ago)
- Last Synced: 2024-09-28T23:24:03.041Z (4 months ago)
- Topics: clustering-algorithm, dbscan, ipython-notebook, kmeans, scikit-learn, tutorial
- Language: HTML
- Homepage:
- Size: 13.7 MB
- Stars: 21
- Watchers: 4
- Forks: 13
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# kmeans-dbscan-tutorial
A clustering tutorial with **scikit-learn** for beginners.## Contents
1. Introduction to **k-means**, **k-means++** and **DBSCAN (Density-Based Spatial Clustering Algorithm with Noise)**.2. Explore common drawbacks of k-means, such as:
- Need to choose the right number of clusters.
- Cannot handle Noise Data and Outliers.
- Cannot handle Non-spherical Data.
And of course, present solutions for the above drawbacks.3. Introduction to supervised and unsupervised methods for measuring cluster quality such as homogeneity, completeness and the Silhouette Coefficient (part of section 2).
4. Two simple exercises (k-means & DBSCAN) along with the tutorial.
## Get Started
- Please refer to the slides in `slides/` or review then on google drive, there are [Chinese version](https://docs.google.com/presentation/d/1sgo4Bx0mF9fZXGZoD6F8wEUBPRWhR90ucoKwz8aLmCM/edit?usp=sharing) and [English version](https://docs.google.com/presentation/d/1o_rTjzkK7_q672rociNBu11R5dEDlACtrWrfR34FQ3s/edit?usp=sharing).
- Codes are in `tutorial_and_labs/`, each `.ipynb` has its corresponding `.html`.