MiInflCount {MiDA} | R Documentation |
Mean microarray probes' feature importance from binary classification
Description
Counts mean of probes' feature importance for multiple models of binary classification built on microarray gene/transcript expression data
Usage
MiInflCount(importance.list)
Arguments
importance.list |
a list of data frames, containing the result of binary classification:
probe IDs in first column and probe's feature importance (relative influence) in the second column
in the order from most important to the least important for classification.
Such list is the |
Details
This function takes the result of binary classification performed with cross-validation and counts mean of each probe's feature importance (relative influence) gained in all fitted models.
Value
a list of 2
data.mean
- data frame of probe names (in alphabetical order), their mean feature importance
and standard deviation.
data.importance
- data frame of probe IDs (in alphabetical order) and their
original feature importance values gained in all cross-validation models.
Author(s)
Elena N. Filatova
See Also
Examples
# get gene expression and specimen data
data("IMexpression");data("IMspecimen")
# sample expression matrix and specimen data for binary classification,
# only "NORM" and "EBV" specimens are left
SampleMatrix<-MiDataSample(IMexpression, IMspecimen$diagnosis,"norm", "ebv")
SampleSpecimen<-MiSpecimenSample(IMspecimen$diagnosis, "norm", "ebv")
#Fitting, low tuning for faster running
ClassRes<-MiBiClassGBODT(SampleMatrix, SampleSpecimen, n.crossval = 3,
ntrees = 10, shrinkage = 1, intdepth = 2)
# List of influence data frames for all 3 models build using cross-validation
# is the 2nd element of BiClassGBODT results
Importances<-MiInflCount(ClassRes[[2]])
Importances[[1]][1:10,] # mean and sd. 0s are for low feature importance
Importances[[2]][1:10,] # original values for n.crossval = 3 models