SIMet {MetabolicSurv} | R Documentation |
Sequential Increase in Metabolites for the PCA or PLS classifier
Description
The Function fits cox proportional hazard model and does classification by sequentially increasing the metabolites using either PCA or PLS based on the topK metabolites specified.
Usage
SIMet(
TopK = 15,
Survival,
Mdata,
Censor,
Reduce = TRUE,
Select = 50,
Prognostic = NULL,
Plot = FALSE,
DimMethod = c("PLS", "PCA"),
...
)
Arguments
TopK |
Top K metabolites (15 by default) to be used in the sequential analysis. |
Survival |
A vector of survival time with length equals to number of subjects |
Mdata |
A large or small metabolic profile matrix. A matrix with metabolic profiles where the number of rows should be equal to the number of metabolites and number of columns should be equal to number of patients. |
Censor |
A vector of censoring indicator |
Reduce |
A boolean parameter indicating if the metabolic profile matrix should be reduced, default is TRUE and larger metabolic profile matrix is reduced by supervised pca approach and first pca is extracted from the reduced matrix to be used in the classifier. |
Select |
Number of metabolites to be selected from supervised PCA. This is valid only if the argument Reduce=TRUE |
Prognostic |
A dataframe containing possible prognostic(s) factor and/or treatment effect to be used in the model. |
Plot |
A boolean parameter indicating if Plot should be shown. Default is FALSE |
DimMethod |
Dimension reduction method which can either be PLS or PCA. |
... |
Additinal arguments for plotting and only valid if Plot=TRUE |
Details
This function sequentially increase the number of top K metabolites to be
used in the PCA or PLS methods in order to obtain the risk score.
This function internally calls MSpecificCoxPh
to rank the metabolites based on HR.
Therefore metabolites can be ordered based on increasing order of the HR
for low risk group. Thereafter, the function takes few top K
(15 is the default) to be used in the sequential analysis.
Value
A list containing a data frame with estimated HR along with 95% CI at each TopK value for the sequential analysis.
Result |
The hazard ratio statistics (HR, Lower confidence interval and upper confidence interval) of the lower riskgroup based for each sequential metabolite analysis |
TopKplot |
A graphical representation of the Result containing the hazard ratio statistics |
Author(s)
Olajumoke Evangelina Owokotomo, olajumoke.owokotomo@uhasselt.be
Ziv Shkedy
References
Vinzi VE, Chin WW, Henseler J, Wang H (2010). Handbook of Partial Least Squares: Concepts, Methods and Applications, 1st edition. Springer Publishing Company, Incorporated.
Bair E, Hastie T, Debashis P, Tibshirani R (2006). “Prediction by supervised principal components.” American Statistics Association,, 101(473), 119–137.
See Also
coxph
, EstimateHR
, MSpecificCoxPh
, SurvPcaClass
, SurvPcaClass
Examples
## FIRSTLY SIMULATING A METABOLIC SURVIVAL DATA
Data = MSData(nPatients = 100, nMet = 150, Prop = 0.5)
## USING THE FUNCTION
Example1 = SIMet(TopK = 10, Survival=Data$Survival,
Mdata=t(Data$Mdata), Censor=Data$Censor, Reduce = TRUE,
Select = 50,Prognostic = Data$Prognostic, Plot = TRUE, DimMethod ="PLS")
## FOR THE HR STATISTICS
Example1$Result
## FOR THE GRAPHICAL OUTPUT
Example1$TopKplot