Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/msikorski93/seed-clustering

Performing basic clustering on a seeds dataset.
https://github.com/msikorski93/seed-clustering

agglomerative clustering dbscan gaussian-mixture-model gmm mini-batch-kmeans scikit-learn seeds

Last synced: 29 days ago
JSON representation

Performing basic clustering on a seeds dataset.

Awesome Lists containing this project

README

        

# Seed-Clustering
![ alt text ](https://img.shields.io/badge/license-MIT-green?style=&logo=)
![ alt text ](https://img.shields.io/badge/Python-3776AB?logo=python&logoColor=fff)
![ alt text ](https://img.shields.io/badge/-Jupyter-F37626?logo=Jupyter&logoColor=white)
![ alt text ](https://img.shields.io/badge/-pandas-150458?logo=Pandas&logoColor=white)
![ alt text ](https://img.shields.io/badge/-SciPy-8CAAE6?logo=SciPy&logoColor=fff)
![ alt text ](https://img.shields.io/badge/-scikit--learn-F7931E?logo=scikitlearn&logoColor=white)

The subject of this repository was to perform basic cluster analysis on a seed dataset. The dataset contains geometrical properties of kernels belonging to three different varieties of wheat. We performed four different clustering approaches and obtained these results:



The agglomerative clustering turned out to be the best choice. DBSCAN had the lowest performance, however it looks more suitable for detecting outliers in datasets.