R: Rank selection

sele_rank {tensorregress}

R Documentation

Rank selection

Description

Estimate the Tucker rank of tensor decomposition based on BIC criterion. The choice of BIC aims to balance between the goodness-of-fit for the data and the degree of freedom in the population model.

Usage

sele_rank(
  tsr,
  X_covar1 = NULL,
  X_covar2 = NULL,
  X_covar3 = NULL,
  rank_range,
  niter = 10,
  cons = "non",
  lambda = 0.1,
  alpha = 1,
  solver = "CG",
  dist,
  initial = c("random", "QR_tucker")
)

Arguments

`tsr`	response tensor with 3 modes
`X_covar1`	side information on first mode
`X_covar2`	side information on second mode
`X_covar3`	side information on third mode
`rank_range`	a matrix containing rank candidates on each row
`niter`	max number of iterations if update does not convergence
`cons`	the constraint method, "non" for without constraint, "vanilla" for global scale down at each iteration, "penalty" for adding log-barrier penalty to object function.
`lambda`	penalty coefficient for "penalty" constraint
`alpha`	max norm constraint on linear predictor
`solver`	solver for solving object function when using "penalty" constraint, see "details"
`dist`	distribution of response tensor, see "details"
`initial`	initialization of the alternating optimiation, "random" for random initialization, "QR_tucker" for deterministic initialization using tucker decomposition

Details

For rank selection, recommend using non-constraint version.

Constraint penalty adds log-barrier regularizer to general object function (negative log-likelihood). The main function uses solver in function "optim" to solve the objective function. The "solver" passes to the argument "method" in function "optim".

dist specifies three distributions of response tensor: binary, poisson and normal distributions.

Value

a list containing the following:

rank a vector with selected rank with minimal BIC

result a matrix containing rank candidate and its loglikelihood and BIC on each row

Examples

seed=24
dist='binary'
data=sim_data(seed, whole_shape = c(20,20,20),
core_shape=c(3,3,3),p=c(5,5,5),dist=dist, dup=5, signal=4)
rank_range = rbind(c(3,3,3),c(3,3,2),c(3,2,2),c(2,2,2),c(3,2,3))
re = sele_rank(data$tsr[[1]],data$X_covar1,data$X_covar2,data$X_covar3,
 rank_range = rank_range,niter=10,cons = 'non',dist = dist,initial = "random")

[Package tensorregress version 5.1 Index]