simdata {eclust} | R Documentation |
Simulated Data with Environment Dependent Correlations
Description
A dataset containing simulated data for example use of the eclust
package functions. This data was generated using the s_modules
and s_generate_data
Usage
simdata
Format
A matrix with 100 rows and 502 variables:
- Y
continuous response vector
- E
binary environment variable for ECLUST method. E = 0 for unexposed (n=50) and E = 1 for exposed (n=50)
- columns 3:502
gene expression data for 1000 genes. column names are the gene names
Note
Code used to generate this data can be found on the GitHub page for this package. See URL below.
Source
https://raw.githubusercontent.com/sahirbhatnagar/eclust/master/data-raw/simulated-data-processing.R
References
Bhatnagar, SR., Yang, Y., Blanchette, M., Bouchard, L., Khundrakpam, B., Evans, A., Greenwood, CMT. (2016+). An analytic approach for interpretable predictive models in high dimensional data, in the presence of interactions with exposures Preprint
Examples
simdata[1:5, 1:10]
table(simdata[,"E"])