https://github.com/mongshil553/scikit-learn-clustering
Sleep Health and Lifestyle Clustering using Scikit-Learn
https://github.com/mongshil553/scikit-learn-clustering
clustering machine-learning scikit-learn-python
Last synced: 9 months ago
JSON representation
Sleep Health and Lifestyle Clustering using Scikit-Learn
- Host: GitHub
- URL: https://github.com/mongshil553/scikit-learn-clustering
- Owner: mongshil553
- Created: 2024-07-18T15:20:51.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-07-25T03:06:11.000Z (over 1 year ago)
- Last Synced: 2025-01-13T14:18:54.601Z (10 months ago)
- Topics: clustering, machine-learning, scikit-learn-python
- Language: Python
- Homepage:
- Size: 3.14 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Sleep Health and Lifestyle Clustering using Scikit-Learn
Data
https://www.kaggle.com/datasets/informateur234/sleep-health-and-lifestyle-dataset
Data Distribution
After preprocessing, the data distribution are shown in the images above. Applied linear discriminant analysis(LAD) for each class to be seperated which will be helpful for the algorithm to learn. Oversampling for train set is used.
Clustering Models
Selected Models are the following;
1. KMeans Clustering
2. MeanShift Clustering
3. Gaussian Mixture Clustering
4. DBSCAN Clustering
Clustering Result Before Hyper-Tuning
Clustering Result After Hyper-Tuning
Model Analysis
KMeans:
MeanShift:
Gaussian Mixture:
DBSCAN: