hdbscan
Clustering based on density with variable density clusters
Clustering based on density with variable density clusters
To install this package, run one of the following:
HDBSCAN - Hierarchical Density-Based Spatial Clustering of Applications with Noise. Performs DBSCAN over varying epsilon values and integrates the result to find a clustering that gives the best stability over epsilon. This allows HDBSCAN to find clusters of varying densities (unlike DBSCAN), and be more robust to parameter selection.
In practice this means that HDBSCAN returns a good clustering straight away with little or no parameter tuning -- and the primary parameter, minimum cluster size, is intuitive and easy to select.
HDBSCAN is ideal for exploratory data analysis; it's a fast and robust algorithm that you can trust to return meaningful clusters (if there are any).
Summary
Clustering based on density with variable density clusters
Last Updated
Aug 1, 2023 at 15:08
License
BSD-3-Clause
Total Downloads
14.2K
Supported Platforms
GitHub Repository
https://github.com/scikit-learn-contrib/hdbscanDocumentation
https://hdbscan.readthedocs.io