dystrophy {ipred} | R Documentation |
Detection of muscular dystrophy carriers.
Description
The dystrophy
data frame has 209 rows and 10 columns.
Usage
data(dystrophy)
Format
This data frame contains the following columns:
- OBS
numeric. Observation number.
- HospID
numeric. Hospital ID number.
- AGE
numeric, age in years.
- M
numeric. Month of examination.
- Y
numeric. Year of examination.
- CK
numeric. Serum marker creatine kinase.
- H
numeric. Serum marker hemopexin.
- PK
numeric. Serum marker pyruvate kinase.
- LD
numeric. Serum marker lactate dehydroginase.
- Class
factor with levels,
carrier
andnormal
.
Details
Duchenne Muscular Dystrophy (DMD) is a genetically transmitted disease,
passed from a mother to her children. Affected female offspring usually suffer
no apparent symptoms, male offspring with the disease die at young age.
Although female carriers have no physical symptoms they tend to exhibit
elevated levels of certain serum enzymes or proteins.
The dystrophy dataset contains 209 observations of 75 female DMD carriers and
134 female DMD non-carrier. It includes 6 variables describing age of the
female and the serum parameters serum marker creatine kinase (CK), serum marker
hemopexin (H), serum marker pyruvate kinase (PK) and serum marker lactate
dehydroginase (LD). The serum markers CK and H may be measured rather
inexpensive from frozen serum, PK and LD requires fresh serum.
Source
D.Andrews and A. Herzberg (1985), Data. Berlin: Springer-Verlag.
References
Robert Tibshirani and Geoffry Hinton (1998), Coaching variables for regression and classification. Statistics and Computing 8, 25-33.
Examples
## Not run:
data("dystrophy")
library("rpart")
errorest(Class~CK+H~AGE+PK+LD, data = dystrophy, model = inbagg,
pFUN = list(list(model = lm, predict = mypredict.lm), list(model = rpart)),
ns = 0.75, estimator = "cv")
## End(Not run)