createData {FamilyRank}R Documentation

Simulate Data

Description

Simulate data sets meant to emulate gene expression data in oncology.

Usage

createData(n.case, n.control, mean.upper = 13, mean.lower = 5, 
sd.upper = 1, sd.lower = 1, n.features = 10000, 
subtype1.feats = 1:5, subtype2.feats = 6:10, subtype3.feats = 11:15)

Arguments

n.case

Number of cases to simulate.

n.control

Number of controls to simulate

mean.upper

Mean of upper component of bimodal Gaussian distribution from which features are simulated.

mean.lower

Mean of lower component of bimodal Gaussian distribution from which features are simulated.

sd.upper

Standard deviation of upper component of bimodal Gaussian distribution from which features are simulated.

sd.lower

Standard deviation of lower component of bimodal Gaussian distribution from which features are simulated.

n.features

Number of features to simulate

subtype1.feats

Index of features used to define subtype 1.

subtype2.feats

Index of features used to define subtype 2.

subtype3.feats

Index of features used to define subtype 3.

Details

Simulates case/control data as described in createCase and createControl, and graphical domain knowledge as described in createGraph.

Value

Returns a named list with a simulated feature matrix (x), simulated binary response vector (y), vector of subtype labels (subtype), and simulated domain knowledge graph (graph).

Author(s)

Michelle Saul

References

ADD REFERENCE

See Also

createCase, createControl, createGraph

Examples

## Toy Example
# Simulate data set
# 10 samples
# 20 features
# Features 1 through 15 perfectly define response
# All other features are random noise.
data <- createData(n.case = 5, n.control = 5, mean.upper=13, mean.lower=5,
                   sd.upper=1, sd.lower=1, n.features = 20,
                   subtype1.feats = 1:5, subtype2.feats = 6:10,
                   subtype3.feats = 11:15)
x <- data$x
y <- data$y
graph <- data$graph

[Package FamilyRank version 1.0 Index]