disfa {drclust}R Documentation

Disjoint Factor Analysis

Description

Performs disjoint factor analysis, i.e., a Factor Analysis with a simple structure. In fact, each factor is defined by a disjoint subset of variables, resulting thus, in a simplified, easier to interpret loading matrix A and factors. Estimation is carried out via Maximum Likelihood.

Usage

disfa(X, Q, Rndstart, verbose, maxiter, tol, constr, prep, print)

Arguments

X

Units x variables numeric data matrix.

Q

Number of factors.

Rndstart

Number of runs to be performed (Defaults is 20).

verbose

Outputs basic summary statistics for each run (1 = enabled; 0 = disabled, default option).

maxiter

Maximum number of iterations allowed (if convergence is not yet reached. Default is 100).

tol

Tolerance threshold (maximum difference between the values of the objective function of two consecutive iterations such that convergence is assumed. Default is 1e-6).

constr

is a vector of length J = nr. of variables, pre-specifying to which cluster some of the variables must be assigned. Each component of the vector can assume integer values from 1 o Q (See example for more details), or 0 if no constraint on the variable is imposed (i.e., it will be assigned based on the plain algorithm).

prep

Pre-processing of the data. 1 performs the z-score transform (default choice); 2 performs the min-max transform; 0 leaves the data un-pre-processed.

print

Prints summary statistics of the performed method (1 = enabled; 0 = disabled, default option).

Value

returns a list of estimates and some descriptive quantities of the final results.

V

Variables x factors membership matrix (binary and row-stochastic). Each row is a dummy variable indicating to which cluster each variable has been assigned.

A

Variables x components loading matrix.

Psi

Specific variance of each observed variable, not accounted for by the common factors (matrix).

discrepancy

Value of the objective function, to be minimized. Difference between the observed and estimated covariance matrices (scalar).

RMSEA

Adjusted Root Mean Squared Error (scalar).

AIC

Aikake Information Criterion (scalar).

BIC

Bayesian Information Criterion (scalar).

GFI

Goodness of Fit Index (scalar).

Author(s)

Ionel Prunila, Maurizio Vichi

References

Vichi M. (2017) "Disjoint factor analysis with cross-loadings" <doi:10.1007/s11634-016-0263-9>

Examples

# Iris data 
# Loading the numeric variables of iris data
iris <- as.matrix(iris[,-5]) 

# No constraint on variables
out <- disfa(iris, Q = 2)

# Constraint: the first two variables must contribute to the same factor.
outc <- disfa(iris, Q = 2, constr = c(1,1,0,0))


[Package drclust version 0.1 Index]