mlhighFrail {highMLR}R Documentation

mlhighFrail

Description

This function extracts features based on minimum log-Loss function using Cox proportional hazard model as learner method on a high dimensional survival data. For those genes, we obtain frailty variances using CoxPH.

Usage

mlhighFrail(
  cols,
  idSurv,
  idEvent,
  idFrail,
  dist = "gaussian",
  per = 20,
  fold = 3,
  data
)

Arguments

cols

A numeric vector of column numbers indicating the features for which the log Loss functions are to be computed

idSurv

The name of the survival time variable

idEvent

The name of the survival event variable

idFrail

The name of the frailty variable

dist

The name of the frailty distribution. Options are "gamma", "gaussian" or "t", default is "gaussian"

per

Percentage of features to be selected, default value 20

fold

An integer denoting number of folds in cross validation, default value 3

data

A data frame that contains the survival and covariate information for the subjects

Details

Performs CoxPH frailty on high doimensional survival data

Using the Cox proportional hazard model on the given survival data, this function selects the most significant feature based on minimum logarithmic loss function. The logarithmic loss function is defined as,

L(f,t)=-log(f(t))

After selcting the most significant features, a Cox proportional hazard frailty model is fitted on the selected features. The CoxPH frailty model is defined as,

\lambda(t)=\lambda 0(t)\nu exp{X'\beta}

where \nu is called the frailty component. The variance of the frailty term is considered as the heterogeneity among the subjects or patients. The distribution of frailty component is considered as either Gaussian, Gamma or t distribution.

Value

A dataframe containing desired number of features with corresponding frailty variances.

Author(s)

Atanu Bhattacharjee, Gajendra K. Vishwakarma & Souvik Banerjee

References

Sonabend, R., Kiraly, F. J., Bender, A., Bernd Bischl B. and Lang M. mlr3proba: An R Package for Machine Learning in Survival Analysis, 2021, Bioinformatics, <https://doi.org/10.1093/bioinformatics/btab039>

See Also

mlhighHet, mlhighCox

Examples

## Not run: 
data(hnscc)
mlhighFrail(cols=c(10:20), idSurv="OS", idEvent="Death", idFrail="ID", dist="gaussian",
per=20, fold = 3, data=hnscc)

## End(Not run)

[Package highMLR version 0.1.1 Index]