R: Perform a cluster analysis of subjects in a JAR experiment.

cluscata_jar {ClustBlock}

R Documentation

Perform a cluster analysis of subjects in a JAR experiment.

Description

Hierarchical clustering of subjects from a JAR experiment. Each cluster of subjects is associated with a compromise computed by the CATATIS method. The hierarchical clustering is followed by a partitioning algorithm (consolidation).

Usage

cluscata_jar(Data, nprod, nsub, levelsJAR=3, beta=0.1,  Noise_cluster=FALSE,
        Itermax=30, Graph_dend=TRUE, Graph_bar=TRUE, printlevel=FALSE,
        gpmax=min(6, nsub-2), rhoparam=NULL,
        Testonlyoneclust=FALSE, alpha=0.05, nperm=50, Warnings=FALSE)

Arguments

`Data`	data frame where the first column is the Assessors, the second is the products and all other columns the JAR attributes with numbers (1 to 3 or 1 to 5, see levelsJAR)
`nprod`	integer. Number of products.
`nsub`	integer. Number of subjects.
`levelsJAR`	integer. 3 or 5 levels. If 5, the data will be transformed in 3 levels.
`beta`	numerical. Parameter for agreement between JAR and other answers. Between 0 and 0.5.
`Noise_cluster`	logical. Should a noise cluster be computed? Default: FALSE
`Itermax`	numerical. Maximum of iteration for the partitioning algorithm. Default:30
`Graph_dend`	logical. Should the dendrogram be plotted? Default: TRUE
`Graph_bar`	logical. Should the barplot of the difference of the criterion and the barplot of the overall homogeneity at each merging step of the hierarchical algorithm be plotted? Default: TRUE
`printlevel`	logical. Print the number of remaining levels during the hierarchical clustering algorithm? Default: FALSE
`gpmax`	logical. What is maximum number of clusters to consider? Default: min(6, nblo-2)
`rhoparam`	numerical. What is the threshold for the noise cluster? Between 0 and 1, high value can imply lot of blocks set aside. If NULL, automatic threshold is computed.
`Testonlyoneclust`	logical. Test if there is more than one cluster? Default: FALSE
`alpha`	numerical between 0 and 1. What is the threshold to test if there is more than one cluster? Default: 0.05
`nperm`	numerical. How many permutations are required to test if there is more than one cluster? Default: 50
`Warnings`	logical. Display warnings about the fact that none of the subjects in some clusters checked an attribute or product? Default: FALSE

Value

Each partitionK contains a list for each number of clusters of the partition, K=1 to gpmax with:

group: the clustering partition after consolidation. If Noise_cluster=TRUE, some subjects could be in the noise cluster ("K+1")
rho: the threshold for the noise cluster
homogeneity: homogeneity index (
s_with_compromise: similarity coefficient of each subject with its cluster compromise
weights: weight associated with each subject in its cluster
compromise: the compromise of each cluster
CA: list. the correspondance analysis results on each cluster compromise (coordinates, contributions...)
inertia: percentage of total variance explained by each axis of the CA for each cluster
s_all_cluster: the similarity coefficient between each subject and each cluster compromise
criterion: the CLUSCATA criterion error
param: parameters called
type: parameter passed to other functions

There is also at the end of the list:

dend: The CLUSCATA dendrogram
cutree_k: the partition obtained by cutting the dendrogram in K clusters (before consolidation).
overall_homogeneity_ng: percentage of overall homogeneity by number of clusters before consolidation (and after if there is no noise cluster)
diff_crit_ng: variation of criterion when a merging is done before consolidation (and after if there is no noise cluster)
test_one_cluster: decision and pvalue to know if there is more than one cluster
param: parameters called
type: parameter passed to other functions

References

Llobell, F., Vigneau, E. & Qannari, E. M. ((September 14, 2022). Multivariate data analysis and clustering of subjects in a Just about right task. Eurosense, Turku, Finland.

Examples


data(cheese)
res=cluscata_jar(Data=cheese, nprod=8, nsub=72, levelsJAR=5)
#plot(res, ngroups=4, Graph_dend=FALSE)
summary(res, ngroups=4)

[Package ClustBlock version 4.0.0 Index]