https://github.com/chapmanjacobd/clustersort-validation
https://github.com/chapmanjacobd/clustersort-validation
Last synced: 4 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/chapmanjacobd/clustersort-validation
- Owner: chapmanjacobd
- Created: 2024-09-24T04:58:52.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-09-24T09:55:10.000Z (over 1 year ago)
- Last Synced: 2025-08-26T09:56:14.444Z (5 months ago)
- Language: Python
- Size: 5.86 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# clustersort-validation
Validating the output of TF-IDF+kmeans and wordllama cluster
python evaluate_wordllama_cluster.py simple.txt
num_clusters min_iter inertia speed ram_usage label_quality
-------------- ---------- ------------ -------- ----------- ---------------
2 2 36.9962 0.281219 187.8 MiB 0.082634
2 26 35.914 0.282369 206.2 MiB 0.157248
2 51 35.914 0.314825 187.2 MiB 0.157248
2 75 35.914 0.387392 168.3 MiB 0.157248
2 100 35.914 0.509942 165.4 MiB 0.157248
5 2 21.3687 0.239941 163.6 MiB 0.430233
5 26 21.2784 0.270611 165.4 MiB 0.514851
5 51 21.2784 0.383254 163.2 MiB 0.514851
5 75 21.2784 0.525094 165.4 MiB 0.514851
5 100 21.2784 0.75452 163.4 MiB 0.514851
8 2 8.30436 0.248233 165.5 MiB 0.778281
8 26 8.10929 0.3 162.9 MiB 0.778281
8 51 8.10929 0.433403 165.4 MiB 0.778281
8 75 8.10929 0.650738 162.7 MiB 0.778281
8 100 8.10929 0.984076 165.4 MiB 0.778281
11 2 7.55382e-14 0.240138 165.4 MiB 1
11 26 7.55382e-14 0.324339 165.4 MiB 1
11 51 7.55382e-14 0.497623 162.6 MiB 1
11 75 7.55382e-14 0.818201 165.5 MiB 1
11 100 7.55382e-14 1.20669 162.9 MiB 1
14 2 7.55382e-14 0.243643 165.4 MiB 1
14 26 7.55382e-14 0.330196 165.4 MiB 1
14 51 7.55382e-14 0.561486 165.4 MiB 1
14 75 7.55382e-14 0.917747 162.8 MiB 1
14 100 7.55382e-14 1.4413 163.4 MiB 1
Best Combination: {'num_clusters': np.int64(11), 'min_iter': np.int64(51), 'inertia': 7.553819494823116e-14, 'speed': 0.4976229667663574, 'ram_usage': 170541056, 'label_quality': 1.0}