An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with clustering

A curated list of projects in awesome lists tagged with clustering .

https://github.com/asynkron/protoactor-go

Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin

actor-model actors akka clustering cross-platform distributed-computing distributed-systems go golang grpc protobuf

Last synced: 13 May 2025

https://github.com/AsynkronIT/protoactor-go

Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin

actor-model actors akka clustering cross-platform distributed-computing distributed-systems go golang grpc protobuf

Last synced: 26 Apr 2025

https://github.com/binroot/tensorflow-book

Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.

autoencoder book classification clustering convolutional-neural-networks linear-regression logistic-regression machine-learning regression reinforcement-learning tensorflow

Last synced: 14 May 2025

https://github.com/BinRoot/TensorFlow-Book

Accompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.

autoencoder book classification clustering convolutional-neural-networks linear-regression logistic-regression machine-learning regression reinforcement-learning tensorflow

Last synced: 20 Mar 2025

https://github.com/dedupeio/dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

clustering datamade de-duplicating dedupe dedupe-library entity-resolution python python-library record-linkage

Last synced: 14 May 2025

https://github.com/datamade/dedupe

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

clustering datamade de-duplicating dedupe dedupe-library entity-resolution python python-library record-linkage

Last synced: 22 Feb 2025

https://github.com/leaflet/leaflet.markercluster

Marker Clustering plugin for Leaflet

clustering leaflet leaflet-plugins map mapping

Last synced: 11 May 2025

https://github.com/Leaflet/Leaflet.markercluster

Marker Clustering plugin for Leaflet

clustering leaflet leaflet-plugins map mapping

Last synced: 15 Mar 2025

https://github.com/alibaba/alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 14 May 2025

https://github.com/alibaba/Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 14 Mar 2025

https://github.com/unum-cloud/usearch

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly

Last synced: 29 Mar 2025

https://github.com/dipanjans/practical-machine-learning-with-python

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

classification clustering computer-vision convolutional-neural-networks deep-learning jupyter jupyter-notebook keras machine-learning natural-language-processing nltk notebook pandas prophet python scikit-learn spacy statsmodels tensorflow time-series-analysis

Last synced: 14 May 2025

https://github.com/dipanjanS/practical-machine-learning-with-python

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

classification clustering computer-vision convolutional-neural-networks deep-learning jupyter jupyter-notebook keras machine-learning natural-language-processing nltk notebook pandas prophet python scikit-learn spacy statsmodels tensorflow time-series-analysis

Last synced: 25 Mar 2025

https://github.com/mapbox/supercluster

A very fast geospatial point clustering library for browsers and Node.

algorithm clustering computational-geometry javascript maps

Last synced: 13 May 2025

https://github.com/bitwalker/libcluster

Automatic cluster formation/healing for Elixir applications

clustering elixir erlang-distribution

Last synced: 08 May 2025

https://github.com/sgrondin/bottleneck

Job scheduler and rate limiter, supports Clustering

clustering limiter rate-limiter rate-limiting scheduler throttle throttling

Last synced: 14 May 2025

https://github.com/SGrondin/bottleneck

Job scheduler and rate limiter, supports Clustering

clustering limiter rate-limiter rate-limiting scheduler throttle throttling

Last synced: 21 Mar 2025

https://github.com/asynkron/protoactor-dotnet

Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin

actors akka clustering distributed-computing distributed-systems proto-actor

Last synced: 14 May 2025

https://github.com/AsynkronIT/protoactor-dotnet

Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin

actors akka clustering distributed-computing distributed-systems proto-actor

Last synced: 08 Jan 2025

https://github.com/dipanjans/text-analytics-with-python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization

Last synced: 15 May 2025

https://github.com/dipanjanS/text-analytics-with-python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization

Last synced: 05 May 2025

https://github.com/nomic-ai/nomic

Interact, analyze and structure massive text, image, embedding, audio and video datasets

clustering duplicate-detection embeddings python text topic-modeling unstructured-data

Last synced: 13 May 2025

https://github.com/google/uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

clustering machine-learning speaker-diarization speaker-recognition supervised-clustering supervised-learning uis-rnn

Last synced: 14 May 2025

https://github.com/jokergoo/ComplexHeatmap

Make Complex Heatmaps

clustering complex-heatmaps heatmap

Last synced: 02 May 2025

https://github.com/jokergoo/complexheatmap

Make Complex Heatmaps

clustering complex-heatmaps heatmap

Last synced: 14 May 2025

https://github.com/oracle/tribuo

Tribuo - A Java machine learning library

classification clustering deep-learning java machine-learning ml nlp regression

Last synced: 11 May 2025

https://github.com/efremidze/cluster

Easy Map Annotation Clustering 📍

annotations apple carthage cluster clustering cocoapods ios map mapkit swift

Last synced: 15 May 2025

https://github.com/efremidze/Cluster

Easy Map Annotation Clustering 📍

annotations apple carthage cluster clustering cocoapods ios map mapkit swift

Last synced: 09 Dec 2024

https://github.com/prbonn/depth_clustering

:taxi: Fast and robust clustering of point clouds generated with a Velodyne sensor.

catkin clustering depth depth-clustering depth-image fast lidar pcl point-cloud range range-image real-time robotics ros segmentation velodyne velodyne-sensor

Last synced: 16 May 2025

https://github.com/PRBonn/depth_clustering

:taxi: Fast and robust clustering of point clouds generated with a Velodyne sensor.

catkin clustering depth depth-clustering depth-image fast lidar pcl point-cloud range range-image real-time robotics ros segmentation velodyne velodyne-sensor

Last synced: 20 Mar 2025

https://github.com/bitwalker/swarm

Easy clustering, registration, and distribution of worker processes for Erlang/Elixir

clustering elixir erlang-distribution process-registry

Last synced: 14 May 2025

https://github.com/unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

bert clip clustering contrastive-learning cross-attention huggingface-transformers image-search language-vision llava multi-lingual multimodal neural-network openai openclip pretrained-models pytorch representation-learning semantic-search transformer vector-search

Last synced: 14 May 2025

https://github.com/wannesm/dtaidistance

Time series distances: Dynamic Time Warping (fast DTW implementation in C)

c clustering distance-measure dtw dynamic-time-warping python timeseries

Last synced: 13 May 2025

https://github.com/steineggerlab/foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.

alignments bioinformatics clustering protein-structure

Last synced: 15 May 2025

https://github.com/WenjieDu/PyPOTS

A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/classification/clustering/forecasting/anomaly detection/cleaning on incomplete industrial (irregularly-sampled) multivariate TS with NaN missing values

classification clustering data-mining data-science deep-learning forecasting healthcare imputation incomplete industrial interpolation machine-learning missing-values missingness neural-network partially-observed-time-series pytorch science-research time-series time-series-analysis

Last synced: 01 Apr 2025

https://github.com/boazsegev/iodine

iodine - HTTP / WebSockets Server for Ruby with Pub/Sub support

clustering high-performance http message-bus multithreading pubsub ruby server sse web-server websocket

Last synced: 29 Apr 2025

https://github.com/rapidsai/raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

anns building-blocks clustering cuda distance gpu information-retrieval linear-algebra llm machine-learning nearest-neighbors neighborhood-methods primitives random-sampling solvers sparse statistics vector-search vector-similarity vector-store

Last synced: 14 May 2025

https://github.com/trekhleb/machine-learning-octave

🤖 MatLab/Octave examples of popular machine learning algorithms with code examples and mathematics being explained

clustering linear-regression machine-learning matlab neural-network neural-networks octave prediction regression

Last synced: 04 Apr 2025

https://github.com/chengshiwen/influxdb-cluster

InfluxDB Cluster - Open Source Alternative to InfluxDB Enterprise

clustering high-availability influxdb influxdb-cluster influxdb-enterprise

Last synced: 16 May 2025

https://github.com/smartcorelib/smartcore

A comprehensive library for machine learning and numerical computing. Apply Machine Learning with Rust leveraging first principles.

classification clustering machine-learning machine-learning-algorithms model-selection regression rust rust-lang scientific-computing statistical-learning statistical-models

Last synced: 14 May 2025

https://github.com/beedotkiran/Lidar_For_AD_references

A list of references on lidar point cloud processing for autonomous driving

autonomous-driving clustering lidar-point-cloud obstacle-detection simulator

Last synced: 20 Mar 2025

https://github.com/yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

clustering deduplication fuzzy-matching information-retrieval machine-learning natural-language-processing record-linkage

Last synced: 14 Apr 2025

https://github.com/tomekvenits/react-native-map-clustering

React Native map clustering both for Android and iOS.

clustering map maps mapview markers react react-native

Last synced: 16 May 2025

https://github.com/Yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

clustering deduplication fuzzy-matching information-retrieval machine-learning natural-language-processing record-linkage

Last synced: 15 Mar 2025

https://github.com/waikato/moa

MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.

clustering data-stream-mining java machine-learning machine-learning-algorithms moa streaming-algorithms

Last synced: 15 May 2025

https://github.com/Waikato/moa

MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.

clustering data-stream-mining java machine-learning machine-learning-algorithms moa streaming-algorithms

Last synced: 05 Mar 2025

https://github.com/hhblaze/dbreeze

C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 14 May 2025

https://github.com/logpai/drain3

A robust streaming log template miner based on the Drain algorithm

aiops anomaly-detection clustering drain log log-clustering machine-learning observability template-mining

Last synced: 15 May 2025

https://github.com/wq2012/spectralcluster

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

auto-tune clustering constrained-clustering machine-learning python speaker-diarization spectral-clustering unsupervised-clustering unsupervised-learning

Last synced: 16 May 2025

https://github.com/yukimasano/self-label

Self-labelling via simultaneous clustering and representation learning. (ICLR 2020)

clustering iclr2020 representation-learning resnet resnet-v2 self-supervised-learning

Last synced: 08 May 2025

https://github.com/bretfisher/dogvscat

Sample Docker Swarm cluster stack of tools

clustering containers docker elk monitoring prometheus rexray swarm

Last synced: 04 Apr 2025

https://github.com/BretFisher/dogvscat

Sample Docker Swarm cluster stack of tools

clustering containers docker elk monitoring prometheus rexray swarm

Last synced: 03 Apr 2025

https://github.com/hhblaze/DBreeze

C# .NET NOSQL ( key value store embedded ) ACID multi-paradigm database management system.

acid android c-sharp clustering database dotnet embedded key net netcore netstandard nosql search search-engine similarity-search text transaction value vector-database xamarin

Last synced: 14 Mar 2025

https://github.com/nanopack/shaman

Small, lightweight, api-driven dns server.

clustering developer-tools devops devtools dns dns-server golang nanobox nanopack

Last synced: 03 Apr 2025

https://github.com/terhechte/postsack

Visually cluster your emails by sender, domain, and more to identify waste

clustering email linux macos native rust treemap wasm windows

Last synced: 05 Apr 2025

https://github.com/lucidrains/slot-attention

Implementation of Slot Attention from GoogleAI

artificial-intelligence attention-mechanism clustering deep-learning

Last synced: 12 Apr 2025

https://github.com/matrix-profile-foundation/matrixprofile

A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.

algorithms anomaly-detection clustering data-mining data-science hacktoberfest matrixprofile motif-discovery python python2 python3 segmentation time-series time-series-analysis

Last synced: 16 May 2025