R: Assess surrogacy for two survival endpoints based on...

SurvSurv {Surrogate}

R Documentation

Assess surrogacy for two survival endpoints based on information theory and a two-stage approach

Description

The function SurvSurv implements the information-theoretic approach to estimate individual-level surrogacy (i.e., R^2_{h.ind}) and the two-stage approach to estimate trial-level surrogacy (R^2_{trial}, R^2_{ht}) when both endpoints are time-to-event variables (Alonso & Molenberghs, 2008). See the Details section below.

Usage

SurvSurv(Dataset, Surr, SurrCens, True, TrueCens, Treat,
Trial.ID, Weighted=TRUE, Alpha=.05)

Arguments

`Dataset`	A `data.frame` that should consist of one line per patient. Each line contains (at least) a surrogate value and censoring indicator, a true endpoint value and censoring indicator, a treatment indicator, and a trial ID.
`Surr`	The name of the variable in `Dataset` that contains the surrogate endpoint values.
`SurrCens`	The name of the variable in `Dataset` that contains the censoring indicator for the surrogate endpoint values (1 = event, 0 = censored).
`True`	The name of the variable in `Dataset` that contains the true endpoint values.
`TrueCens`	The name of the variable in `Dataset` that contains the censoring indicator for the true endpoint values (1 = event, 0 = censored).
`Treat`	The name of the variable in `Dataset` that contains the treatment indicators.
`Trial.ID`	The name of the variable in `Dataset` that contains the trial ID to which the patient belongs.
`Weighted`	Logical. If `TRUE`, then a weighted regression analysis is conducted at stage 2 of the two-stage approach. If `FALSE`, then an unweighted regression analysis is conducted at stage 2 of the two-stage approach. See the Details section below. Default `TRUE`.
`Alpha`	The `\alpha`-level that is used to determine the confidence intervals around `R^2_{trial}` and `R_{trial}`. Default `0.05`.

Details

Individual-level surrogacy

Alonso & Molenbergs (2008) proposed to redefine the surrogate endpoint S as a time-dependent covariate S(t), taking value 0 until the surrogate endpoint occurs and 1 thereafter. Furthermore, these author considered the models

\lambda [t \mid x_{ij}, \beta] = K_{ij}(t) \lambda_{0i}(t) exp(\beta x_{ij}),

\lambda [t \mid x_{ij}, s_{ij}, \beta, \phi] = K_{ij}(t) \lambda_{0i}(t) exp(\beta x_{ij} + \phi S_{ij}),

where K_{ij}(t) is the risk function for patient j in trial i, x_{ij} is a p-dimensional vector of (possibly) time-dependent covariates, \beta is a p-dimensional vector of unknown coefficients, \lambda_{0i}(t) is a trial-specific baseline hazard function, S_{ij} is a time-dependent covariate version of the surrogate endpoint, and \phi its associated effect.

The mutual information between S and T is estimated as I(T,S)=\frac{1}{n}G^2, where n is the number of patients and G^2 is the log likelihood test comparing the previous two models. Individual-level surrogacy can then be estimated as

R^2_{h.ind} = 1 - exp \left(-\frac{1}{n}G^2 \right).

O'Quigley and Flandre (2006) pointed out that the previous estimator depends upon the censoring mechanism, even when the censoring mechanism is non-informative. For low levels of censoring this may not be an issue of much concern but for high levels it could lead to biased results. To properly cope with the censoring mechanism in time-to-event outcomes, these authors proposed to estimate the mutual information as {I}(T,S)=\frac{1}{k}G^2, where k is the total number of events experienced. Individual-level surrogacy is then estimated as

R^2_{h.ind} = 1 - exp \left(-\frac{1}{k}G^2 \right).

Trial-level surrogacy

A two-stage approach is used to estimate trial-level surrogacy, following a procedure proposed by Buyse et al. (2011). In stage 1, the following trial-specific Cox proportional hazard models are fitted:

S_{ij}(t)=S_{i0}(t) exp(\alpha_{i}Z_{ij}),

T_{ij}(t)=T_{i0}(t) exp(\beta_{i}Z_{ij}),

where S_{i0}(t) and T_{i0}(t) are the trial-specific baseline hazard functions, Z_{ij} is the treatment indicator for subject j in trial i, and \alpha_{i}, \beta_{i} are the trial-specific treatment effects on S and T, respectively.

Next, the second stage of the analysis is conducted:

\widehat{\beta_{i}}=\lambda_{0}+\lambda_{1}\widehat{\alpha_{i}}+\varepsilon_{i},

where the parameter estimates for \beta_i and \alpha_i are based on the full model that was fitted in stage 1.

When the argument Weighted=FALSE is used in the function call, the model that is fitted in stage 2 is an unweighted linear regression model. When a weighted model is requested (using the argument Weighted=TRUE in the function call), the information that is obtained in stage 1 is weighted according to the number of patients in a trial.

The classical coefficient of determination of the fitted stage 2 model provides an estimate of R^2_{trial}.

Value

An object of class SurvSurv with components,

`Results.Stage.1`	The results of stage 1 of the two-stage model fitting approach: a `data.frame` that contains the trial-specific log hazard ratio estimates of the treatment effects for the surrogate and the true endpoints.
`Results.Stage.2`	An object of class `lm` (linear model) that contains the parameter estimates of the regression model that is fitted in stage 2 of the analysis.
`R2.ht`	A `data.frame` that contains the trial-level coefficient of determination (`R^2_{ht}`), its standard error and confidence interval.
`R2.hind`	A `data.frame` that contains the individual-level coefficient of determination (`R^2_{hind}`), its standard error and confidence interval.
`R2h.ind.QF`	A `data.frame` that contains the individual-level coefficient of determination using the correction proposed by O'Quigley and Flandre (2006), its standard error and confidence interval.
`R2.hInd.By.Trial.QF`	A `data.frame` that contains individual-level surrogacy estimates using the correction proposed by O'Quigley and Flandre (2006), (cluster-based estimates) and their confidence interval for each of the trials seperately.

Author(s)

Wim Van der Elst, Ariel Alonso, & Geert Molenberghs

References

Alonso, A. A., & Molenberghs, G. (2008). Evaluating time-to-cancer recurrence as a surrogate marker for survival from an information theory perspective. Statistical Methods in Medical Research, 17, 497-504.

Buyse, M., Michiels, S., Squifflet, P., Lucchesi, K. J., Hellstrand, K., Brune, M. L., Castaigne, S., Rowe, J. M. (2011). Leukemia-free survival as a surrogate end point for overall survival in the evaluation of maintenance therapy for patients with acute myeloid leukemia in complete remission. Haematologica, 96, 1106-1112.

O'Quigly, J., & Flandre, P. (2006). Quantification of the Prentice criteria for surrogate endpoints. Biometrics, 62, 297-300.

Examples

# Open Ovarian dataset
data(Ovarian)

# Conduct analysis
Fit <- SurvSurv(Dataset = Ovarian, Surr = Pfs, SurrCens = PfsInd,
True = Surv, TrueCens = SurvInd, Treat = Treat, 
Trial.ID = Center)

# Examine results 
plot(Fit)
summary(Fit)

[Package Surrogate version 3.3.0 Index]