R: Estimates delta for Iterated Stable Autoencoder

estim_delta {denoiseR}

R Documentation

Estimates delta for Iterated Stable Autoencoder

Description

This function uses cross-validation to estimate delta for the Iterated Stable Autoencoder when considering Binomial noise. delta is the probability of deletion of each cell of the data matrix

Usage

estim_delta(X, delta = seq(0.1, 0.9, length.out = 9), nbsim = 10,
  noise = "Binomial", transformation = c("None", "CA"), pNA = 0.1,
  maxiter = 1000, threshold = 1e-08)

Arguments

`X`	a data frame or a matrix with count
`delta`	vector, a sequence of values for the probability of deletion of each cell of the data matrix
`nbsim`	number of times that pNA values are inserted and predicted in the data
`noise`	noise model assumed for the data. By default and only available "Binomial"
`transformation`	estimates a transformation of the original matrix; currently, only correspondence analysis CA is available
`pNA`	percentage of missing values added in the data set
`maxiter`	integer, maximum number of iterations of the iterative imputation algorithm
`threshold`	for assessing convergence of the iterative imputation algorithm (difference between two successive iterations)

Details

For each value delta, repeated learning cross-validation consists in inserting pNA percentage of missing values in the data set and predicting them with the Iterative Stable Autoencoder. More precisely, the prediction is obtained using the iterative imputation algorithm (imputecount) which alternates steps of imputation of the missing entries and estimation of the low-rank signal. This process is repeated nbsim times for all the deltas. The mean squared error of prediction is kept for each simulation and value of delta. The value of delta leading to the smallest MSEP on average over the simulations is given.

Value

msep, matrix with the MSEP obtained for each simulation and each value of delta

delta, value giving in average the smallest MSEP over the nbsim simulations

Examples

 # A regularized Correspondence Analysis 
 ## Not run: library(FactoMineR)
 perfume <-  read.table("http://factominer.free.fr/docs/perfume.txt",header=TRUE,
 sep="\t",row.names=1)
 rownames(perfume)[4] <- "Cinema"
 
 isa.delt <- estim_delta(perfume, nbsim = 10, transformation = "CA")
 
 isa.ca <- ISA(perfume, delta = isa.delt$delta, noise = "Binomial", transformation = "CA")
 rownames(isa.ca$mu.hat) <- rownames(perfume)
 colnames(isa.ca$mu.hat) <- colnames(perfume)
 res.isa.ca <- CA(isa.ca$mu.hat, graph = FALSE)
 plot(res.isa.ca, title = "Regularized CA", cex = 0.6, selectCol = "contrib 20")
## End(Not run)

[Package denoiseR version 1.0.2 Index]