| consensus_cluster {ClustAssess} | R Documentation | 
Consensus Clustering and Proportion of Ambiguously Clustered Pairs
Description
Calculate consensus clustering and proportion of ambiguously clustered pairs (PAC) with hierarchical clustering.
Usage
consensus_cluster(
  x,
  k_min = 3,
  k_max = 100,
  n_reps = 100,
  p_sample = 0.8,
  p_feature = 1,
  p_minkowski = 2,
  dist_method = "euclidean",
  linkage = "complete",
  lower_lim = 0.1,
  upper_lim = 0.9,
  verbose = TRUE
)
Arguments
| x | A samples x features normalized data matrix. | 
| k_min | The minimum number of clusters calculated. | 
| k_max | The maximum number of clusters calculated. | 
| n_reps | The total number of subsamplings and reclusterings of the data; this value needs to be high enough to ensure PAC converges; convergence can be assessed with pac_convergence. | 
| p_sample | The proportion of samples included in each subsample. | 
| p_feature | The proportion of features included in each subsample. | 
| p_minkowski | The power of the Minkowski distance. | 
| dist_method | The distance measure for the distance matrix used in hclust; must be one of "euclidean", "maximum", "manhattan", "canberra", "binary" or "minkowski". | 
| linkage | The linkage method used in hclust; must be one of "ward.D", "ward.D2", "single", "complete", "average", "mcquitty", "median" or "centroid" | 
| lower_lim | The lower limit for determining whether a pair is clustered ambiguously; the lower this value, the higher the PAC. | 
| upper_lim | The upper limit for determining whether a pair is clustered ambiguously; the higher this value, the higher the PAC. | 
| verbose | Logical value used for choosing to display a progress bar or not. | 
Value
A data.frame with PAC values across iterations, as well as parameter values used when calling the method.
References
Monti, S., Tamayo, P., Mesirov, J., & Golub, T. (2003). Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data. Machine learning, 52(1), 91-118. https://doi.org/10.1023/A:1023949509487
Senbabaoglu, Y., Michailidis, G., & Li, J. Z. (2014). Critical limitations of consensus clustering in class discovery. Scientific reports, 4(1), 1-13. https://doi.org/10.1038/srep06207
Examples
pac.res = consensus_cluster(iris[,1:4], k_max=20)
pac_convergence(pac.res, k_plot=c(3,5,7,9))