generateCountData {NBLDA} | R Documentation |
Generate Count Data
Description
This function can be used to generate counts, e.g., RNA-Sequencing data, for both the classification and clustering purposes.
Usage
generateCountData(n, p, K, param, sdsignal = 1, DE = 0.3, allZero.rm = TRUE,
tag.samples = FALSE)
Arguments
n |
number of samples. |
p |
number of variables/features. |
K |
number of classes. |
param |
overdispersion parameter. This parameter is matched with the argument |
sdsignal |
a nonzero numeric value. As |
DE |
a numeric value within the interval [0, 1]. This is the proportion of total number of variables that is significantly different among K classes. The remaining part is assumed to be having no contribution to the discrimination function. |
allZero.rm |
a logical. If TRUE, the columns having all zero cells are dropped. |
tag.samples |
a logical. If TRUE, the row names are automatically generated using a tag for each sample such as "S1", "S2", etc. |
Value
x , xte |
count data matrix for training and test set. |
y , yte |
class labels for training and test set. |
truesf , truesfte |
true size factors for training and test set. See Witten (2011) for more information on estimating size factors. |
Author(s)
Dincer Goksuluk
Examples
set.seed(2128)
counts <- generateCountData(n = 20, p = 10, K = 2, param = 1, sdsignal = 0.5, DE = 0.8,
allZero.rm = FALSE, tag.samples = TRUE)
head(counts$x)