cluster {theftdlc} | R Documentation |
Perform cluster analysis of time series using their feature vectors
Description
Perform cluster analysis of time series using their feature vectors
Usage
cluster(
data,
norm_method = c("zScore", "Sigmoid", "RobustSigmoid", "MinMax"),
unit_int = FALSE,
clust_method = c("kmeans", "hclust", "mclust"),
k = 2,
features = NULL,
na_removal = c("feature", "sample"),
seed = 123,
...
)
Arguments
data |
feature_calculations object containing the raw feature matrix produced by theft::calculate_features
|
norm_method |
character denoting the rescaling/normalising method to apply. Can be one of "zScore" , "Sigmoid" , "RobustSigmoid" , "MinMax" , or "MaxAbs" . Defaults to "zScore"
|
unit_int |
Boolean whether to rescale into unit interval [0,1] after applying normalisation method. Defaults to FALSE
|
clust_method |
character specifying the clustering algorithm to use. Can be one of "kmeans" for k-means clustering, "hclust" for hierarchical clustering, or "mclust" for Gaussian mixture model clustering. Defaults to "kMeans"
|
k |
integer denoting the number of clusters to extract. Defaults to 2
|
features |
character vector denoting the names of time-series features to use in the clustering algorithm. Defaults to NULL for no feature filtering and usage of the entire feature matrix
|
na_removal |
character defining the way to deal with NAs produced during feature calculation. Can be one of "feature" or "sample" . "feature" removes all features that produced any NAs in any sample, keeping the number of samples the same. "sample" omits all samples that produced at least one NA. Defaults to "feature"
|
seed |
integer to fix R's random number generator to ensure reproducibility. Defaults to 123
|
... |
arguments to be passed to stats::kmeans or stats::hclust , or mclust::Mclust depending on selection in clust_method
|
Value
object of class feature_cluster
containing the clustering algorithm and a tidy version of clusters joined to the input dataset ready for further analysis
Author(s)
Trent Henderson
Examples
library(theft)
features <- theft::calculate_features(theft::simData,
group_var = "process",
feature_set = "catch22")
clusts <- cluster(features,
k = 6)
[Package
theftdlc version 0.1.0
Index]