An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with hierarchical-clustering

A curated list of projects in awesome lists tagged with hierarchical-clustering .

https://github.com/shubhamjha97/hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

agglomerative-clustering clustering data-mining data-mining-algorithms dendrogram divisive-clustering hierarchical-clustering

Last synced: 26 Mar 2025

https://github.com/gcorso/neuroseed

Implementation of Neural Distance Embeddings for Biological Sequences (NeuroSEED) in PyTorch (NeurIPS 2021)

bioinformatics biological-sequences hierarchical-clustering machine-learning multiple-sequence-alignment neurips-2021 pytorch

Last synced: 07 May 2025

https://github.com/iesl/xcluster

Algorithms and evaluation tools for extreme clustering

clustering extreme-clustering hierarchical-clustering perch

Last synced: 12 Apr 2025

https://github.com/sdq/deepvis

machine learning algorithms in Swift

hierarchical-clustering kmeans lda machine-learning pca unsupervised-learning

Last synced: 17 Dec 2024

https://github.com/div-lab/dendromap

Interactively and visually explore large-scale image datasets used in machine learning using treemaps. VIS 2022

hierarchical-clustering image-dataset treemap visualization

Last synced: 04 Apr 2025

https://github.com/nmonath/graphgrove

A framework for building (and incrementally growing) graph-based data structures used in hierarchical or DAG-structured clustering and nearest neighbor search

clustering hierarchical-clustering incremental-clustering nearest-neighbor-search nearest-neighbors online-clustering

Last synced: 12 Apr 2025

https://github.com/QuaCau-TheSphere/Graphvidian

Obsidian plugin to export Graphviz graphs from vault's notes

graphviz hierarchical-clustering obsidian-md obsidian-plugin plugin

Last synced: 29 Jan 2025

https://github.com/koonimaru/radialtree

A python module to draw a circular dendrogram

circular clustering dendrogram hierarchical-clustering python3 radial

Last synced: 15 Apr 2025

https://github.com/nicolasfguillaume/Strategic-Business-Analytics-with-R

Foundations of Strategic Business Analytics - ESSEC Business school via Coursera.org

dplyr hierarchical-clustering hr predictive-maintenance r scoring strategic-business-analytics survival-analysis

Last synced: 04 Dec 2024

https://github.com/greenelab/hclust

Agglomerative hierarchical clustering in JavaScript

agglomerative-clustering clustering hierarchical-clustering javascript math node package

Last synced: 05 May 2025

https://github.com/ivan-pi/fortran-flann

Fortran bindings to the FLANN library for performing fast approximate nearest neighbor searches in high dimensional spaces.

approximate-nearest-neighbor-search hierarchical-clustering high-dimensional-data kdtree kmeans-clustering nearest-neighbor-search spatial-search

Last synced: 27 Mar 2025

https://github.com/chen0040/java-clustering

Package provides java implementation of various clustering algorithms

clustering-algorithm dbscan dbscan-clustering hierarchical-clustering k-means

Last synced: 16 Dec 2024

https://github.com/wolny/phash-hierarchical-clustering

Hierarchical clustering of images using phash and Hamming distance

hamming-distance hierarchical-clustering image-clusterization perceptual-hashing

Last synced: 16 May 2025

https://github.com/doaa-altarawy/lascad

LASCAD: Language-Agnostic Software Categorization and Similar Application Detection

hierarchical-clustering lda mining-software-repositories software-engineering topic-modeling

Last synced: 13 Apr 2025

https://github.com/moindalvs/assignment_east-west_airlines

Problem Statement Perform clustering (Hierarchical,K means clustering and DBSCAN) for the airlines data to obtain optimum number of clusters

clustering-algorithm data-science dbscan-clustering epsilon-greedy hierarchical-clustering kmeans-clustering

Last synced: 23 Apr 2025

https://github.com/kevalmorabia97/agglomerative-hierarchical-clustering-comparison

Comparison of different linkages like complete, averagem ward on different datasets by plotting data on 2D plane by matplotlib

agglomerative-clustering clustering hierarchical-clustering python3

Last synced: 21 Nov 2024

https://github.com/shuyib/phylogenetic-tree-study

Estimating Phylogenetic trees using six microorganisms 16S rRNA gene with Unsupervised Learning, web based tools and Molecular Evolutionary Genetics Analysis MEGA7

16s-rrna bokehplots hierarchical-clustering jupyter-notebook kenya machine-learning microorganism motif-analysis motif-discovery numpy-arrays pandas phylogenetics python scipy unsupervised-learning

Last synced: 19 Apr 2025

https://github.com/zekeriyyaa/anomaly-detection-based-on-clustering-of-mobile-robot-data

We proposed a new approach to detect anomalies of mobile robot data. We investigate each data seperately with two clustering method hierarchical and k-means. There are two sub-method that we used for produce an anomaly score. Then, we merge these two score and produce merged anomaly score as a result.

agv anomaly clustering currency dtw dynamic-time-warping feature-extraction hierarchical-clustering k-means-clustering mobile-robots normalization python robotics time-series-analysis time-series-data vibration

Last synced: 23 Feb 2025

https://github.com/chandraprakash-bathula/hierarchical-clustering

Hierarchical Clustering Algorithm on Wine Clustering data with dendograms visualization.

clustering hierarchical-clustering ipynb-jupyter-notebook

Last synced: 09 Mar 2025

https://github.com/ggeop/flag-study

Nations Flags Classification & Clustering project. :flags:

cart classification clustering desiciontree hierarchical-clustering r rpart svm

Last synced: 03 Apr 2025

https://github.com/niloth-p/phylogenetic-tree-construction-using-agglomerative-and-divisive-hierarchical-clustering

The prog is written to construct the phylogenetic tree (dendrogram) based on DNA/Protein sequences of species given in a dataset using Agglomerative and Divisive Hierarchical Clustering and to compare Agglomerative and Divisive methods

agglomerative-clustering dendrogram divisive-clustering hierarchical-clustering min-linkage needleman-wunsch-algorithm phylogenetic-tree proximity-matrix

Last synced: 25 Feb 2025

https://github.com/walidbosso/r_data_mining

Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees

association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio

Last synced: 23 Mar 2025

https://github.com/vidhi1290/malware-detection

Welcome to the Malicious Executable Detection project! This repository explores the world of machine learning and clustering analysis to detect malicious executable files 🔥🔐

clustering-algorithm cybersecurity hierarchical-clustering k-means-clustering machine-learning malware-detection python silhouette

Last synced: 28 Mar 2025

https://github.com/peteprattis/user-clusters-and-k-means-fold-for-classifier-evaluation

A Matlab script that applies the basic sequential clustering to evaluate the number of user groups by using the hierarchical clustering and k-means algorithms. Using the k-means fold the classifiers that are a neural network and the other least squares to evaluate them.

classifier clusters computer-science hierarchical-clustering k-fold k-means-clustering matlab program sequential-clustering student

Last synced: 18 Jan 2025

https://github.com/gandhasiri-rahul-mohan/clustering-q1-crime_data

Perform Clustering(Hierarchical, Kmeans & DBSCAN) for the crime data and identify the number of clusters formed and draw inferences.

clustering data-science dbscan-clustering hierarchical-clustering kmeans-clustering machine-learning python

Last synced: 04 Mar 2025

https://github.com/vhtua/group4_data_analysis

Hierarchical Cluster Analysis: Movie Genres Preferences

data-analysis hierarchical-clustering r unsupervised-learning

Last synced: 29 Mar 2025

https://github.com/pprattis/user-clusters-and-k-means-fold-for-classifier-evaluation

A Matlab script that applies the basic sequential clustering to evaluate the number of user groups by using the hierarchical clustering and k-means algorithms. Using the k-means fold the classifiers that are a neural network and the other least squares to evaluate them.

classifier clusters computer-science hierarchical-clustering k-fold k-means-clustering matlab program sequential-clustering student

Last synced: 29 Mar 2025

https://github.com/jmsquare/canonical-correlation-analysis

This project implements canonical correlation analysis between two data matrices. I first create the latent dimensions between the two data matrices. Then I use Kmeans and hierarchical clustering on principal component to group individuals using the latent dimensions and the distance created by the canonical analysis. Last step, I give a profiling of the different groups using descriptive statistics and provide an automated method to export the results.

canonical-correlation-analysis hierarchical-clustering kmeans kmeans-clustering

Last synced: 25 Feb 2025

https://github.com/gandhasiri-rahul-mohan/clustering-q2-eastwestairline

Perform clustering (hierarchical,K means clustering and DBSCAN) for the airlines data to obtain optimum number of clusters. Draw the inferences from the clusters obtained.

clustering data-science dbscan-clustering hierarchical-clustering kmeans-clustering machine-learning python

Last synced: 04 Mar 2025

https://github.com/priyanshu501/causalgeneanalysis

This repository contains analysis and exploration of causal and non-causal relationships between genes and phenotypes using embeddings generated from GPT-3.5. The project applies vector analysis, dimensionality reduction, and clustering techniques (K-Means, Hierarchical, and DBSCAN) to uncover potential patterns and insights into causality.

clustering-analysis dbscan hierarchical-clustering jupyter-notebook kmeans

Last synced: 18 Mar 2025

https://github.com/nafisalawalidris/machine-learning-with-python

Machine Learning with Python final project: Apply ML algorithms to solve real-world problem. Hands-on experience in data preprocessing, model selection, evaluation. Showcase ML proficiency in Python.

classification data-science hierarchical-clustering machine-learning regression scikit-learn-python scipy

Last synced: 16 Mar 2025

https://github.com/moindalvs/assignment_crime_data_clustering

Content This data set contains statistics, in arrests per 100,000 residents for assault, murder, and rape in each of the 50 US states in 1973. Also given is the percent of the population living in urban areas.This is a systematic approach for identifying and analyzing patterns and trends in crime using USArrest dataset.

clustering-algorithm data-science dbscan-clustering epsilon hierarchical-clustering kmeans-clustering

Last synced: 11 Mar 2025

https://github.com/razamehar/financial-stock-analysis-and-clustering

Analyzed 157 US Energy stocks (Jan-Dec '23), identified Bullish/Bearish trends and risk categories. Used KMeans, Hierarchical, Spectral Clustering, revealing balanced returns and low volatility. Integrated data with Kafka for seamless subscriptions.

financial-analysis hierarchical-clustering kafka-producer kmeans-clustering pandas python rdd stock-analysis

Last synced: 18 Mar 2025

https://github.com/ksmooi/mscs_ml_unsupervised_learning

Unsupervised learning finds patterns or structures in unlabeled data, useful for tasks like clustering and dimensionality reduction. It helps with preprocessing, discovering hidden patterns, and making data easier to visualize and analyze.

hierarchical-clustering k-means-clustering matrix-factorization pca unsupervised-learning

Last synced: 21 Jun 2025

https://github.com/sirinemaaroufi/ml_clustering_explorations

This repository contains a series of notebooks exploring various clustering techniques in machine learning.

clustering-methods comparison dbscan-clustering dimentionality-reduction gmm-clustering hierarchical-clustering kmeans-clustering ml pca python

Last synced: 08 Apr 2025

https://github.com/shridhar1504/mall-customers-clustering-datascience-project

In this project, we analyze a dataset of mall customers to understand their characteristics, preferences, and behaviors. By applying data analysis techniques and clustering algorithms, we aim to identify customer segments based on their shopping patterns and annual income.

clustering data-science hierarchical-clustering k-means-clustering machine-learning unsupervised-learning

Last synced: 09 Apr 2025

https://github.com/cego669/dirtycategoriesencoding

Repository containing two classes (StringAgglomerativeEncoder and StringDistanceEncoder) useful for grouping or visualizing the distance between dirty categorical variables. They are compatible with the scikit-learn API.

category clustering dimensionality-reduction dirty hierarchical-clustering machine-learning scikit-learn singular-value-decomposition svd

Last synced: 07 Apr 2025

https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries

A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.

clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization

Last synced: 07 Apr 2025

https://github.com/ahammadmejbah/create-your-first-hierarchical-clustering-model

A common technique for grouping things is called hierarchical clustering. It divides items into groups such that they are similar to one another and distinct from those in other groups. A dendrogram is a type of hierarchical tree that graphically depicts clusters.

data-science hierarchical-clustering hierarchical-models python

Last synced: 26 Feb 2025

https://github.com/saritaphd/hierarchial-clustering

Hiearchial clustering manual with example

hierarchical-clustering python

Last synced: 06 Mar 2025

https://github.com/samkazan/fraud-detection-ml

Machine learning models for enhanced fraud detection in e-commerce transactions, exploring feature engineering, distance prediction, and clustering analysis.

clustering data-science data-visualization dataanalytics dbscan eda hierarchical-clustering kmeans-clustering knn-imputer matplotlib mlxtend python scikit-learn seaborn xgboost

Last synced: 09 Apr 2025

https://github.com/mehrab-kalantari/supermarket-customers-clustering

Supermarket customers clustering using k-means, DBScan, hierarchical clustering and RFM analysis

dbscan-clustering hierarchical-clustering k-means-clustering machine-learning rfm-analysis t-sne unsupervised-learning

Last synced: 05 Mar 2025

https://github.com/gusenov/reachability-plots-2-cluster-tree-js

Algorithm for constructing a cluster tree from a reachability plot.

algorithm-implementations clustering hierarchical-clustering

Last synced: 20 Feb 2025

https://github.com/aelmahraoui/machine-learning-with-python

IBM Machine Learning with Python course : full description + Projects

classification hierarchical-clustering machine-learning regression scipy-and-scikit-learn

Last synced: 25 Mar 2025

https://github.com/cjunwon/odaq-sda

Applying Categorical Exploratory Data Analysis (CEDA) methods to study audio quality perception

audio audio-processing categorical-cross-entropy categorical-data categorical-data-visualization hierarchical-clustering ranking

Last synced: 28 Mar 2025

https://github.com/saniyaabushakimova/brewing-insights-with-unsupervised-learning

Conducted a comprehensive clustering analysis to categorize beers based on features such as Astringency, Alcohol content, Bitterness, Sourness, and more. Utilized k-medoids and hierarchical agglomerative clustering algorithms to achieve this classification. Tech: Python (numpy, pandas, seaborn, matplotlib, sklearn, scipy)

dendrogram hierarchical-clustering k-medoids-clustering python silhouette-score t-sne unsupervised-learning

Last synced: 05 Mar 2025

https://github.com/mishraanuraagx/finsight

FinSight is a machine learning-driven financial analytics tool designed to explore, cluster, and visualize different financial assets based on their risk and return behaviors.

clustering cryptocurrency data-science data-visualization deep-learning fastapi finance financial-analysis gru hierarchical-clustering investment-analysis lstm machine-learning portfolio-analysis predictive-modeling quantitative-finance risk-management stock-market time-series yfinance

Last synced: 21 Jun 2025

https://github.com/subhashpolisetti/clustering-techniques-and-embeddings

This repository includes Colab notebooks demonstrating various clustering algorithms, from scratch-based methods to advanced deep learning models and embeddings. Each notebook features explanations, visualizations, and quality evaluation metrics for clustering performance.

anomaly-detection clustering-algorithm hierarchical-clustering kmeans-clustering multimodal time-series

Last synced: 02 Apr 2025

https://github.com/felipeversiane/face-cluster

application that receives a dataset of faces and creates a cluster of images that have similarity.

docker face-clustering face-recognition hierarchical-clustering python

Last synced: 15 Jun 2025

https://github.com/giatraskon/hyperspectral-image-clustering

Analysis of the Salinas hyperspectral image dataset using advanced clustering algorithms, focusing on identifying homogeneous regions in the image. Implementations of cost-function optimization and hierarchical clustering techniques, along with evaluations and visualizations in reduced-dimensional spaces.

adjusted-rand-index calinski-harabasz-index clustering data-visualization dimensionality-reduction fuzzy-cmeans-clustering hierarchical-clustering hyperspectral-imaging image-processing k-means-clustering machine-learning matlab pca possibilistic-clustering-algorithms probabilistic-clustering remote-sensing salinas-dataset silhouette-score spectral-bands unsupervised-learning

Last synced: 14 Mar 2025

https://github.com/xhan97/hunger

A python library for evaluating Hierarchical Clustering

dendrogram evaluation-metrics hierarchical-clustering python scipy

Last synced: 14 Apr 2025

https://github.com/srosalino/clustering_on_starbucks_beverages

Grouping of drinks according to their nutritional values, making it easier to categorize them in a future catalog, increasing organization and facilitating the search depending on individual preferences

gaussian-mixture-models hierarchical-clustering k-means-clustering pam pca unsupervised-learning

Last synced: 25 Feb 2025

https://github.com/lefteris-souflas/election-classification-and-clustering-analysis

Creating predictive models to classify Trump's vote share and clustering counties based on demographics and economic variables. Report findings in PDF with detailed methodologies, model assessments, and R code for the project.

agglomerative-algorithm bootstrap-sampling classification clustering cross-validation data-cleaning decision-tree hierarchical-clustering model-evaluation model-interpretation predictive-analytics r random-forest silhouette-analysis statistics support-vector-machine variable-importance

Last synced: 02 Mar 2025

https://github.com/teja-1403/coursera-machine-learning-with-python-honors

This project involves building a classifier to predict rainfall for the next day based on weather data from the Australian Government's Bureau of Meteorology. Various machine learning techniques such as Linear Regression, KNN, Decision Trees, Logistic Regression, and SVM were implemented and evaluated.

classification hierarchical-clustering machine-learning regression scikit-learn scipy

Last synced: 22 Feb 2025

https://github.com/shuyib/skateproject

A report about a contest held on 7th January 2017. It involved skateboarders from all over the world performing an ollie over an obstacle as well recording the heights achieved - this was calibrated on the obstacle. The event was organized by skate-aid (http://www.skate-aid.org). For more information check out the document. It is also published here http://rpubs.com/BMainye/skatecontest_2017

extremesports hierarchical-clustering html involved-skateboarders machine-learning obstacle ollie r rmarkdown rprogramming skateboarding

Last synced: 22 Mar 2025

https://github.com/kmohamedalie/clustering-algorithms

unsupervised machine learning with clustering algorithms

dbscan hierarchical-clustering kmeans-clustering unsupervised-machine-learning

Last synced: 22 Feb 2025

https://github.com/evangks/hierarchical-clustering-mall-customers

A comprehensive machine learning project demonstrating hierarchical clustering for customer segmentation on the Mall Customers dataset. Includes EDA, preprocessing, multiple linkage/distance comparisons, and professional visualizations.

clustering data-science hierarchical-clustering jupyter-notebook machine-learning mall-customers portfolio-project python scikit-learn unsupervised-learning

Last synced: 22 Jun 2025