An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with clustering-algorithm

A curated list of projects in awesome lists tagged with clustering-algorithm .

https://github.com/vidhi1290/malware-detection

Welcome to the Malicious Executable Detection project! This repository explores the world of machine learning and clustering analysis to detect malicious executable files 🔥🔐

clustering-algorithm cybersecurity hierarchical-clustering k-means-clustering machine-learning malware-detection python silhouette

Last synced: 28 Mar 2025

https://github.com/kyrczak/clustering-algorithms-analysis

Clustering Algorithms Analysis is an artificial intelligence project analyzing data clustering algorithms and comparing their pros and cons.

ai artificial-intelligence clustering clustering-algorithm python

Last synced: 23 Feb 2025

https://github.com/george-mountain/data-extraction-integration-and-analysis---clustering-operations

This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.

clustering-algorithm dataingestion dataintegration dataproject datascience datascraping foursquare-api machine-learning

Last synced: 14 Mar 2025

https://github.com/ganeshkbhat/loadbalancer

A simple threaded and clustered load balancer for nodejs with different forwarding algorithms and server request handling options

clustering clustering-algorithm loadbalancer nodejs threading

Last synced: 29 Jul 2025

https://github.com/gauravkoradiya/ticket-clustering

This Repository contains various methodology for cluster unstructured user tickets.

cluster-analysis clustering-algorithm deep-learning machine-learning ticket-management

Last synced: 15 Mar 2025

https://github.com/gu18168/dbscansd

Rust implementation for DBSCANSD, a trajectory clustering algorithm.

clustering-algorithm dbscan-clustering rust

Last synced: 25 Mar 2025

https://github.com/daluisgarcia/euclidean_distance_clustering

Cluster your data using the euclidean distance and watch the distance matrix for each epoch of the algorithm. The program reads the data by a .csv file and plots the results on dendrogram and radar plots.

clustering-algorithm euclidean-distances python python3

Last synced: 20 Feb 2025

https://github.com/hue-jhan/android-qt-client

Client mode for the Quality Treshold clustering algorithm in android (University Project)

android android-client clustering-algorithm quality-threshold

Last synced: 11 Oct 2025

https://github.com/kskbhat/silhouette

Silhouette-Based Diagnostics for Standard, Soft, and Multi-Way Clustering

classification cluster-analysis clustering-algorithm membership-probability proximity-measure silhouette

Last synced: 22 Oct 2025

https://github.com/firoz-thakur/machine-learning

A collection of machine learning projects, including Face Recognition using KNN and various algorithms and techniques. This repo offers practical implementations and resources for exploring machine learning.

artificial-intelligence clustering-algorithm convolutional-neural-networks image-classification image-processing knn-classification machine-learning nlp-machine-learning perceptron-learning-algorithm regression-models

Last synced: 26 Oct 2025

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 09 Apr 2025

https://github.com/zjiayao/ms-2dpnts

Clustering with Mean Shift

clustering-algorithm mean-shift

Last synced: 11 Apr 2025

https://github.com/kiranvad/Amplitude-Phase-Distance

A light-weight repository to compute Amplitude Phase distance between two functions

clustering-algorithm differential-geometry functional-data-analysis hilbert-spaces machine-learning metric-learning

Last synced: 08 Apr 2025

https://github.com/moindalvs/assignment_crime_data_clustering

Content This data set contains statistics, in arrests per 100,000 residents for assault, murder, and rape in each of the 50 US states in 1973. Also given is the percent of the population living in urban areas.This is a systematic approach for identifying and analyzing patterns and trends in crime using USArrest dataset.

clustering-algorithm data-science dbscan-clustering epsilon hierarchical-clustering kmeans-clustering

Last synced: 11 Mar 2025

https://github.com/hsinlichu/customer-service-data-analysis-with-machine-learning-technique

In this project, I use several machine learning technique both supervised and unsupervised to analyze Cyberlink customer service feedback data.

bert-model clustering-algorithm lda-model machine-learning nlp-machine-learning

Last synced: 05 Mar 2025

https://github.com/otakmager/projectml-clusteringweb

This repo is the result of a project assignment for a machine learning course at my university which was assisted by other group members. This project is to create a website that can cluster from the models that have been made. This model was created using the KMeans algorithm with 3 clusters that were trained with the seed dataset

bahasa-indonesia clustering-algorithm flask jupyter-notebook kmeans-clustering numpy pandas pickle python scikit-learn seed-dataset

Last synced: 26 Mar 2025

https://github.com/chris-santiago/kmeans

A simple implementation of K-Means & K-Medoids Clustering

clustering clustering-algorithm kmeans kmedoids

Last synced: 02 Apr 2025

https://github.com/inazuma110/hcbe.jl

Hypergraph(or bipartite graph) clustering algorithm.

clustering clustering-algorithm hypergraph hypergraph-partitioning

Last synced: 06 Mar 2025

https://github.com/virajbhutada/article-recommendation-system

This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.

anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy

Last synced: 27 Mar 2025

https://github.com/myriamba/smart-grid-data-analysis-clustering

Exploratory Data Analysis of Smartmeter Data , Visualization, and Consumer Clustering for London Households.

clustering-algorithm data-analytics data-visualization eda unsupervised-learning

Last synced: 09 Sep 2025

https://github.com/antononcube/raku-ml-clustering

Raku package for Machine Learning (ML) clustering algorithms

clustering clustering-algorithm machine-learning machine-learning-algorithms raku

Last synced: 18 Jul 2025

https://github.com/nokia/pattern-clustering

The pattern clustering tool groups similar input log lines according to a set of predefined regular expressions.

clustering-algorithm dynamic-programming language-theory pattern-matching

Last synced: 25 Mar 2025

https://github.com/kelvinleandro/manim-animations

Animations exploring various concepts in computer science, with a special focus on machine learning and statistics

classification-algorithm clustering-algorithm computer-science machine-learning machine-learning-algorithms manim manim-3b1b manim-animations python python3 regression-algorithms statistics

Last synced: 16 Oct 2025

https://github.com/ryancodingg/customer-segmentation-and-clustering-analysis

This project focuses on customer segmentation using unsupervised machine learning techniques. The goal is to analyze customer data, identify distinct customer groups (clusters), and extract useful insights for business decision-making.

clustering-algorithm dbscan-clustering k-means-clustering k-medoids-clustering python

Last synced: 08 Jul 2025

https://github.com/letruongzzio/machine-learning

A place to store knowledge about basic Mathematical programming and Machine Learning. Readers should refer to the books in the reference section of this repository.

artificial-intelligence clustering-algorithm data-science decision-trees dimensionality-reduction ensemble-learning linear-regression machine-learning mathematical-modelling mathematical-programming neural-networks python3 recommendation-system support-vector-machines

Last synced: 22 Feb 2025

https://github.com/fonzy0508/clustering-distribution-visualization-learning

Visualizing clustering distributions on the Digits dataset using UMAP, PCA, t-SNE, and algorithms like K-Means, DBSCAN, and Hierarchical Clustering.

clustering clustering-algorithm data-visualization dbscan dimensionality-reduction kmeans machine-learning pca python tsne umap

Last synced: 07 Aug 2025

https://github.com/yuryalencar/imagesclustering

This project contains a Genetic Algorithm for images Clusterings (Using four Clusters).

clustering clustering-algorithm images machine-learning machine-learning-algorithms python3

Last synced: 26 Aug 2025

https://github.com/indyfree/tailor

Clustering Algorithm for clustering retail products according to custom requirements.

clustering clustering-algorithm retail-data

Last synced: 22 Mar 2025

https://github.com/jenson073/clustering_algorithms

This effectively conveys the focus on applying and comparing two clustering algorithms, K-Means and Agglomerative Clustering, using synthetic data.

agglomerative-clustering clustering-algorithm k-means-clustering machine-learning

Last synced: 14 Mar 2025

https://github.com/neelanjan-chakraborty/custoclarity

CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.

clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn

Last synced: 12 Aug 2025

https://github.com/cyberoctane29/penguins-data-modeling-and-analysis

This project applies statistical modeling, including single and multiple linear regression, using Python. It covers exploratory data analysis, data cleaning, and modeling with pandas, NumPy, statsmodels, and scikit-learn. Regression analyzes relationships, while clustering identifies patterns. Seaborn visualizations enhance interpretability.

clustering-algorithm clustering-analysis data-analytics eda kmeans-clustering machine-learning mulitple-linear-regression penguins predictive-modeling regression-analysis simple-linear-regression statistical-modeling supervised-learning unsupervised-learning

Last synced: 24 Mar 2025

https://github.com/annaanastasy/clustering-fish-species

A comprehensive project demonstrating the use of various clustering techniques to analyze and group fish data effectively.

clustering-algorithm data-science data-visualization machine-learning-algorithms unsupervised-clustering unsupervised-machine-learning

Last synced: 05 Apr 2025

https://github.com/dopebiscuit/applai-final-project

ApplAI ML workshop '23 final project, it's a customer segmentation project using clustering, deployed using streamlit.

clustering-algorithm final-project machine-learning streamlit

Last synced: 31 Mar 2025

https://github.com/kshitizrohilla/mall-customer-segmentation-using-k-means-clustering-algorithm

This project aims to perform customer segmentation on a Mall customer dataset using the K-Means clustering algorithm. The goal of this project is to cluster the customers based on their purchasing behavior and demographic characteristics.

cluster clustering clustering-algorithm clustering-methods customer-segmentation customer-segmentation-analysis k-means-clustering k-means-clustering-model k-means-clustering-segmentation k-means-clustering-sklearn mall-customer-segmentation

Last synced: 31 Mar 2025

https://github.com/malcolmmielle/deep-auto-encoder-based-clustering

Reimplementation of Deep auto-encoder based clustering by Song et al.

autoencoder clustering clustering-algorithm kmeans-clustering machinelearning tensorflow

Last synced: 16 Mar 2025

https://github.com/randogoth/dtscan

Rust implementation of the DTSCAN Clustering Algorithm

astronomy attractors clustering-algorithm clusters dbscan delaunay dtscan triangulation

Last synced: 08 Apr 2025

https://github.com/hibatillah/laporan-bencana

Identifikasi Tingkat Kerusakan Bencana Dengan Clustering

clustering-algorithm data-science disaster form-validation report

Last synced: 01 Apr 2025

https://github.com/rosacarla/segmentacao-de-clientes-com-kmeans

Projeto desenvolve modelo de clusterização em uma base de clientes com o algoritmo KMeans

clustering-algorithm machine-learning python sklearn

Last synced: 14 Jul 2025

https://github.com/jpgiant/land-classification-using-kmeans

Application of KMeans algorithm on a satellite image and identifying its feasibility in land use classification

clustering-algorithm data-science gdal-python geospatial-analysis jupyter-notebook kmeans-clustering machine-learning-algorithms satellite-image-classification sklearn

Last synced: 11 May 2025

https://github.com/coreyjs/identify_customer_segments

A Jupyter notebook that run PCA and KMeans on population demographic data.

clustering clustering-algorithm identify-segments kmeans notebook pca population udacity udacity-nanodegree

Last synced: 09 Apr 2025

https://github.com/venondev/almostcliquepoly

Implementation of the data reduction rule AlmostClique by Böcker et al.. This repository is part of my bachelor thesis.

clustering clustering-algorithm data-reduction np-hard

Last synced: 18 Oct 2025

https://github.com/eva-kaushik/data-clustering

Clustering Accelerators for hard and soft clustering, including implementations of K-means, K-medoids, hierarchical clustering, fuzzy C-means, and Gaussian mixture models. Demonstrates text clustering using both hard and soft clustering algorithms.

clustering clustering-algorithm data datascience machine-learning-algorithms

Last synced: 09 Apr 2025

https://github.com/lijesh010/customer_analysis

This repository contains a data science project aimed at analyzing customer behavior and classifying them based on their likelihood to accept marketing campaigns. Additionally, the project involves clustering customers into different segments for targeted marketing strategies.

agglomerative-clustering clustering-algorithm customer-analysis customer-segmentation exploratory-data-analysis internship-project jupyter-notebook kmeans-clustering machine-learning-algorithms python visualization

Last synced: 04 Jul 2025

https://github.com/shubham-s151/machine_learning_projects

A collection of machine learning projects covering predictive modeling, classification, clustering, and NLP. Each project includes detailed analysis, feature engineering, and model evaluation. Some projects also have interactive Streamlit apps for real-time insights! 🚀

classification clustering clustering-algorithm deployment jupyter-notebook model predictive-analytics predictive-modeling python regression statistical-analysis

Last synced: 04 Mar 2025

https://github.com/gu18168/traclus

This is an implementation for TraClus algorithm in Rust

clustering-algorithm traclus

Last synced: 25 Mar 2025

https://github.com/traore-07/unsupervised-learning-models

Unsupervised learning is a machine learning technique where the algorithm learns from unlabeled data.

clustering-algorithm machine-learning machine-learning-algorithms unsupervised-learning

Last synced: 12 Jun 2025

https://github.com/cyberoctane29/penguins-data-analysis-and-modeling

This project applies statistical modeling, including single and multiple linear regression, using Python. It covers exploratory data analysis, data cleaning, and modeling with pandas, NumPy, statsmodels, and scikit-learn. Regression analyzes relationships, while clustering identifies patterns. Seaborn visualizations enhance interpretability.

cluster-analysis clustering-algorithm data-analytics eda kmeans-clustering machine-learning multiple-linear-regression penguins predictive-modeling regression-analysis simple-linear-regression statistical-modeling supervised-learning unsupervised-learning

Last synced: 02 Apr 2025

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 24 Oct 2025

https://github.com/tberchanov/clustering-k-means

Sample program where is implemented clustering by K-means algorithm, and its visualisation.

clustering-algorithm kmeans-clustering matplotlib numpy python

Last synced: 23 Feb 2025

https://github.com/jingjing-jin/Purchase-Behavior-Analysis

Purchase Behavior Analysis for Targeted Customer Segmentation

clustering-algorithm data-mining machine-learning python scikit-learn

Last synced: 02 Apr 2025

https://github.com/michaelb/point-clustering

Regroup points in a nth-dimension space if they are closer than a certain distance

clustering-algorithm dimensions

Last synced: 03 Mar 2025

https://github.com/yashmittalz/customer-segmentation

A machine learning project that segments customers into meaningful groups using advanced clustering techniques, enabling targeted marketing strategies based on both numerical and categorical data.

categorical-analysis clustering clustering-algorithm customer-segmentation machine-learning numerical-analysis segmentation

Last synced: 05 Sep 2025

https://github.com/nirbhaykr87/imagecartoonizer

This Flask app lets users upload images and convert them into cartoonized versions using edge detection and color quantization. The process involves reading the image, detecting edges, reducing colors with k-means clustering, and blending for a cartoon effect. Try it out by running the app and uploading an image!

clustering-algorithm flask python

Last synced: 15 Jul 2025

https://github.com/balajimohan18/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning

Last synced: 27 Jul 2025

https://github.com/brandonmanke/k-means-clustering

K-Means clustering & classification algorithm for n-dimensional vectors implemented in C++

classification-algorithm clustering-algorithm cplusplus googletest k-means-clustering machine-learning unsupervised-learning

Last synced: 13 Jun 2025

https://github.com/shriyaak/machinelearning.studyjournal.1

This repository contains my study and practice of key machine learning concepts, including:

association-rule-learning classification-algorithm clustering-algorithm datapreprocessing regression-models

Last synced: 21 Jun 2025

https://github.com/tashwitab/sct_ml_02

This repository provides a Python implementation of K-Means clustering for segmenting retail store customers based on their purchase behavior. The algorithm groups customers into clusters using features such as Annual Income and Spending Score, enabling data-driven decision-making for marketing strategies.

clustering-algorithm kclustering python

Last synced: 20 Feb 2025

https://github.com/khushi130404/k_means

This repository showcases 2D, 3D, and custom K-Means clustering models with visualizations. It includes both Jupyter notebooks and Python scripts for ease of reproducibility.

clustering-algorithm k-means-clustering numpy plotly sklearn

Last synced: 01 Mar 2025

https://github.com/dimamirana/udemy-machine-learning-a-z

This Repository contains all the codes I have implemented while completing the course 'Machine Learning A-Z' on Udemy.

ann classification-model clustering-algorithm cnn-for-visual-recognition linear-regression machine-learning natural-language-processing nltk-python

Last synced: 02 Apr 2025

https://github.com/subhashpolisetti/crispdm-semma-kdd-workflows

This repository demonstrates three data mining methodologies applied to various real-world datasets: CRISP-DM (Weather Analysis), KDD (Social Media Ads Analysis), and SEMMA (Spotify Recommendation System). Each project includes data exploration, preprocessing, modeling, and evaluation steps, along with comprehensive documentation, supporting files,

clustering-algorithm crisp-dm kdd latex-template machine-learning numpy pandas random-forest research-paper semma

Last synced: 19 Jun 2025

https://github.com/ditikrushna/identify-customer-segments

In this project, Bertelsmann partners AZ Direct and Arvato Financial Solutions have provided two datasets one with demographic information about the people of Germany, and one with that same information for customers of a mail-order sales company. I have looked at relationships between demographics features, organized the population into clusters, and saw how prevalent customers are in each of the segments obtained.

clustering clustering-algorithm numpy pca

Last synced: 01 Mar 2025

https://github.com/fanboykun/datamining

Data Mining Calculation in PHP

clustering-algorithm datamining kmeans

Last synced: 01 Jul 2025

https://github.com/youssefwilliam/machine-learning

This is an assignment based on the least squares-based classification algorithm to train, cluster, and createing a model for testing the dataset images.

clustering-algorithm least-squares machine-learning

Last synced: 07 Oct 2025