singh2002 {sda} | R Documentation |
Prostate Cancer Study of Singh et al. (2002)
Description
Gene expression data (6033 genes for 102 samples) from the microarray study of Singh et al. (2002).
Usage
data(singh2002)
Format
singh2002$x
is a 102 x 6033 matrix containing the expression levels.
The rows contain the samples and the columns the genes.
singh2002$y
is a factor containing the diagnosis for each sample ("cancer" or "healthy").
Details
This data set contains measurements of the gene expression of 6033 genes for 102 observations: 52 prostate cancer patients and 50 healty men.
Source
The data are described in Singh et al. (2001) and are provided in exactly the form as used by Efron (2008).
References
D. Singh et al. 2002. Gene expression correlates of clinical prostate cancer behavior. Cancer Cell 1:203–209.
Efron, B. 2008. Empirical Bayes estimates for large-scale prediction problems. Technical Report, Standford University.
Examples
# load sda library
library("sda")
# load Singh et al (2001) data set
data(singh2002)
dim(singh2002$x) # 102 6033
hist(singh2002$x)
singh2002$y # 2 levels