CMD + K

r-clustimpute

Community

This k-means algorithm is able to cluster data with missing values and as a by-product completes the data set. The implementation can deal with missing values in multiple variables and is computationally efficient since it iteratively uses the current cluster assignment to define a plausible distribution for missing value imputation. Weights are used to shrink early random draws for missing values (i.e., draws based on the cluster assignments after few iterations) towards the global mean of each feature. This shrinkage slowly fades out after a fixed number of iterations to reflect the increasing credibility of cluster assignments. See the vignette for details.

Installation

To install this package, run one of the following:

Conda
$conda install conda-forge::r-clustimpute

Usage Tracking

0.2.4
0.2.0
0.1.7
0.1.6
0.1.5
5 / 8 versions selected
Downloads (Last 6 months): 0

About

Summary

This k-means algorithm is able to cluster data with missing values and as a by-product completes the data set. The implementation can deal with missing values in multiple variables and is computationally efficient since it iteratively uses the current cluster assignment to define a plausible distribution for missing value imputation. Weights are used to shrink early random draws for missing values (i.e., draws based on the cluster assignments after few iterations) towards the global mean of each feature. This shrinkage slowly fades out after a fixed number of iterations to reflect the increasing credibility of cluster assignments. See the vignette for details.

Last Updated

May 31, 2021 at 10:37

License

GPL-3.0-only

Total Downloads

39.8K

Supported Platforms

noarch