clust_lev {GSSTDA} | R Documentation |
Get clusters for a particular data level
Description
It performs clustering of the samples belonging to a particular level (to a particular interval of the filter function) with the proposed clustering algorithm and the proposed method to determine the optimal number of clusters.
Usage
clust_lev(
data_i,
distance_type,
clustering_type,
linkage_type,
optimal_clustering_mode,
silhouette_threshold = 0.25,
num_bins_when_clustering,
level_name
)
Arguments
data_i |
Matrix with the columns of the input matrix corresponding to the individuals belonging to the level. |
distance_type |
Type of distance to be used for clustering. Choose between correlation ("correlation") and euclidean ("euclidean"). |
clustering_type |
Type of clustering method. Choose between "hierarchical" and "PAM" (“partition around medoids”) options. |
linkage_type |
Linkage criteria used in hierarchical clustering. Choose between "single" for single-linkage clustering, "complete" for complete-linkage clustering or "average" for average linkage clustering (or UPGMA). Only necessary for hierarchical clustering. The value provided if the type of clustering chosen is hierarchical will be ignored |
optimal_clustering_mode |
Method for selection optimal number of clusters. It is only necessary if the chosen type of algorithm is hierarchical. In this case, choose between "standard" (the method used in the original mapper article) or "silhouette". In the case of the PAM algorithm, the method will always be "silhouette". |
silhouette_threshold |
Minimum value of |
num_bins_when_clustering |
Number of bins to generate the histogram employed by the standard optimal number of cluster finder method. Parameter not necessary if the "optimal_clust_mode" option is "silhouette" or the "clust_type" is "PAM". |
level_name |
Name of the studied level. # ERROR No usado |
Value
Returns a interger vector with the samples included in each cluster for the specific level analyzed. The names of the vector values are the names of the samples and the vector values are the node number to which the individual belongs.