{"id":15288186,"url":"https://github.com/alexandreday/fast_density_clustering","last_synced_at":"2025-04-13T07:35:13.989Z","repository":{"id":73953907,"uuid":"79415532","full_name":"alexandreday/fast_density_clustering","owner":"alexandreday","description":"Fast local density clustering for low-dimensional data","archived":false,"fork":false,"pushed_at":"2018-10-23T15:38:07.000Z","size":58578,"stargazers_count":7,"open_issues_count":3,"forks_count":7,"subscribers_count":4,"default_branch":"master","last_synced_at":"2025-02-05T14:12:21.336Z","etag":null,"topics":["clustering","density-estimation","sklearn"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/alexandreday.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.rst","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-01-19T04:29:27.000Z","updated_at":"2024-08-12T19:27:14.000Z","dependencies_parsed_at":null,"dependency_job_id":"24019875-4002-4600-a898-9b282b82aa3f","html_url":"https://github.com/alexandreday/fast_density_clustering","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alexandreday%2Ffast_density_clustering","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alexandreday%2Ffast_density_clustering/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alexandreday%2Ffast_density_clustering/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alexandreday%2Ffast_density_clustering/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/alexandreday","download_url":"https://codeload.github.com/alexandreday/fast_density_clustering/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":240045016,"owners_count":19739186,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["clustering","density-estimation","sklearn"],"created_at":"2024-09-30T15:44:35.891Z","updated_at":"2025-02-23T02:30:44.443Z","avatar_url":"https://github.com/alexandreday.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Fast density clustering (fdc)\nA Python package for efficiently clustering low-dimensional data. The core of the algorithm is based on using kernel density maps and density graphs. See below for some specific examples for gaussian mixtures and some test benchmarks that shows the performance against standard clustering algorithms that may be easily imported from sklearn for instance. \n\nOur algorithm solves multiscale problems (multiple variances/densities and population sizes) and works for non-convex clusters. It uses cross-validation and is regularized by two main global parameters : a neighborhood\nsize and a noise threshold measure. The later detects spurious cluster centers while the former guarantees that only local information is used to infer cluster centers. Perhaps one of it's main advantage is that it does not require the user to select the number of clusters and is usually fairly robust to variation of the parameters (within reason!). \n\nThe underlying code is based on fast KD-trees for nearest-neighbor searches. For low-dimensional spaces, the algorithm has a O(n log n), where n is the size of the dataset. Is also has a memory complexity of O(n).\n\n# Installing\nI suggest you install the code using ```pip``` from an [Anaconda](https://conda.io/docs/user-guide/tasks/manage-environments.html) Python 3 environment. From that environment:\n```\ngit clone https://github.com/alexandreday/fast_density_clustering.git\ncd fast_density_clustering\npip install .\n```\nThat's it ! You can now import the package ```fdc``` from your Python scripts. Check out the examples\nin the file ```example``` and see if you can run the scripts provided.\n# Examples and comparison with other methods\nCheck out the example for gaussian mixtures (example.py). You should be able to run it directly. It\nshould produce a plot similar to this: ![alt tag](https://github.com/alexandreday/fast_density_clustering/blob/master/example/result.png)\n\nIn another example (example2.py), the algorithm is benchmarked against some sklearn datasets (note that the same parameters are used across all datasets). This is to be compared with other clustering methods easily accesible from [sklearn](http://scikit-learn.org/stable/modules/clustering.html).\n\n![alt tag](https://github.com/alexandreday/fast_density_clustering/blob/master/example/sklearn_datasets.png)\n\n# Citation\nIf you use this code in a scientific publication, I would appreciate citation/reference to this repository. Also, for further references on clustering\nand machine learning check out our machine learning review:\n```\n@article{mehta2018high,\n  title={A high-bias, low-variance introduction to Machine Learning for physicists},\n  author={Mehta, Pankaj and Bukov, Marin and Wang, Ching-Hao and Day, Alexandre GR and Richardson, Clint and Fisher, Charles K and Schwab, David J},\n  journal={arXiv preprint arXiv:1803.08823},\n  year={2018}\n}\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Falexandreday%2Ffast_density_clustering","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Falexandreday%2Ffast_density_clustering","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Falexandreday%2Ffast_density_clustering/lists"}