R.s.surv.estimate {Rsurrogate}R Documentation

Calculates the proportion of treatment effect explained by the surrogate marker information measured at a specified time and primary outcome information up to that specified time

Description

This function calculates the proportion of treatment effect on the primary outcome explained by the surrogate marker information measured at t0t_0 and primary outcome information up to t0t_0. The user can also request a variance estimate, estimated using perturbating-resampling, and a 95% confidence interval. If a confidence interval is requested three versions are provided: a normal approximation based interval, a quantile based interval and Fieller's confidence interval, all using perturbation-resampling. The user can also request an estimate of the incremental value of surrogate marker information.

Usage

R.s.surv.estimate(xone, xzero, deltaone, deltazero, sone, szero, t, 
weight.perturb = NULL, landmark, extrapolate = FALSE, transform = FALSE, 
conf.int = FALSE, var = FALSE, incremental.value = FALSE, approx = T)

Arguments

xone

numeric vector, the observed event times in the treatment group, X = min(T,C) where T is the time of the primary outcome and C is the censoring time.

xzero

numeric vector, the observed event times in the control group, X = min(T,C) where T is the time of the primary outcome and C is the censoring time.

deltaone

numeric vector, the event indicators for the treatment group, D = I(T<C) where T is the time of the primary outcome and C is the censoring time.

deltazero

numeric vector, the event indicators for the control group, D = I(T<C) where T is the time of the primary outcome and C is the censoring time.

sone

numeric vector; surrogate marker measurement at t0t_0 for treated observations, assumed to be continuous. If X1i<t0X_{1i}<t_0, then the surrogate marker measurement should be NA.

szero

numeric vector; surrogate marker measurement at t0t_0 for control observations, assumed to be continuous. If X1i<t0X_{1i}<t_0, then the surrogate marker measurement should be NA.

t

the time of interest.

weight.perturb

weights used for perturbation resampling.

landmark

the landmark time t0t_0 or time of surrogate marker measurement.

extrapolate

TRUE or FALSE; indicates whether the user wants to use extrapolation.

transform

TRUE or FALSE; indicates whether the user wants to use a transformation for the surrogate marker.

conf.int

TRUE or FALSE; indicates whether a 95% confidence interval for delta is requested, default is FALSE.

var

TRUE or FALSE; indicates whether a variance estimate is requested, default is FALSE.

incremental.value

TRUE or FALSE; indicates whether the user would like to see the incremental value of the surrogate marker information, default is FALSE.

approx

TRUE or FALSE indicating whether an approximation should be used when calculating the probability of censoring; most relevant in settings where the survival time of interest for the primary outcome is greater than the last observed event but before the last censored case, default is TRUE.

Details

Let GG be the binary treatment indicator with G=1G=1 for treatment and G=0G=0 for control and we assume throughout that subjects are randomly assigned to a treatment group at baseline. Let T(1)T^{(1)} and T(0)T^{(0)} denote the time of the primary outcome of interest, death for example, under the treatment and under the control, respectively. Let S(1)S^{(1)} and S(0)S^{(0)} denote the surrogate marker measured at time t0t_0 under the treatment and the control, respectively.

The residual treatment effect is defined as

ΔS(t,t0)=P(T(0)>t0){ψ1(ts,t0)dF0(st0)P(T(0)>tT(0)>t0)} \Delta_S(t,t_0) = P(T^{(0)} > t_0) \left \{\int \psi_1(t | s, t_0) dF_0(s | t_0)-P(T^{(0)}> t| T^{(0)}> t_0) \right \}

where F0(t0)F_0(\cdot | t_0) is the cumulative distribution function of S(0)S^{(0)} conditional on T(0)>t0T^{(0)}> t_0 and ψ1(ts,t0)=P(T(1)>tS(1)=s,T(1)>t0)\psi_1(t | s,t_0) = P(T^{(1)}> t | S^{(1)}=s, T ^{(1)} > t_0). The proportion of treatment effect explained by the surrogate marker information measured at t0t_0 and primary outcome information up to t0t_0, which we denote by RS(t,t0)R_S(t,t_0), can be expressed using a contrast between ΔS(t,t0)\Delta_S(t,t_0) and Δ(t)\Delta(t):

RS(t,t0)={Δ(t)ΔS(t,t0)}/Δ(t)=1ΔS(t,t0)/Δ(t).R_S(t,t_0)=\{\Delta(t)-\Delta_S(t,t_0)\}/\Delta(t)=1-\Delta_S(t,t_0)/\Delta(t).

The definition and estimation of Δ(t)\Delta(t) is described in the delta.surv.estimate documentation.

Due to censoring, our data consist of n1n_1 observations {(X1i,δ1i,S1i),i=1,...,n1}\{(X_{1i}, \delta_{1i}, S_{1i}), i=1,...,n_1\} from the treatment group G=1G=1 and n0n_0 observations {(X0i,δ0i,S0i),i=1,...,n0}\{(X_{0i}, \delta_{0i}, S_{0i}), i=1,...,n_0\} from the control group G=0G=0 where Xgi=min(Tgi,Cgi)X_{gi} = \min(T_{gi}, C_{ gi}), δgi=I(Tgi<Cgi)\delta_{gi} = I(T_{gi} < C_{gi}), CgiC_{gi} denotes the censoring time, and SgiS_{gi} denotes the surrogate marker information measured at time t0t_0, for g=1,0g= 1,0, for individual ii. Note that if Xgi<t0X_{gi} < t_0, then SgiS_{gi} should be NA (not available).

To estimate ΔS(t,t0)\Delta_S(t,t_0), we use a nonparametric kernel Nelson-Aalen estimator to estimate ψ1(ts,t0)\psi_1(t | s,t_0) as ψ^1(ts,t0)=exp{Λ^1(ts,t0)}\hat{\psi}_1(t| s,t_0) = \exp\{-\hat{\Lambda}_1(t| s,t_0) \}, where

Λ^1(ts,t0)=t0ti=1n1I(X1i>t0)Kh{γ(S1i)γ(s)}dN1i(z)i=1n1I(X1i>t0)Kh{γ(S1i)γ(s)}Y1i(z),\hat{\Lambda}_1(t| s,t_0) = \int_{t_0}^t \frac{\sum_{i=1}^{n_1} I(X_{1i}>t_0) K_h\{\gamma(S_{1i}) - \gamma(s)\}dN_{1i}(z)}{\sum_{i=1}^{n_1} I(X_{1i}>t_0) K_h\{\gamma(S_{1i}) - \gamma(s)\} Y_{1i}(z)},

is a consistent estimate of Λ1(ts,t0)=log[ψ1(ts,t0)],Y1i(t)=I(X1it),N1i(t)=I(X1it)δi,K()\Lambda_1(t| s,t_0 ) = -\log [\psi_1(t| s,t_0)], Y_{1i}(t) = I(X_{1i} \geq t), N_{1i}(t) = I(X_{1i} \leq t) \delta_i, K(\cdot) is a smooth symmetric density function, Kh(x)=K(x/h)/h,γ()K_h(x) = K(x/h)/h, \gamma(\cdot) is a given monotone transformation function, and hh is a specified bandwidth. To obtain an appropriate hh we first use bw.nrd to obtain hopth_{opt}; and then we let h=hoptn1c0h = h_{opt}n_1^{-c_0} with c0=0.11c_0 = 0.11.

Since F0(st0)=P(S0isX0i>t0)F_0(s | t_0) = P(S_{0i} \le s | X_{0i} > t_0), we empirically estimate F0(st0)F_0(s | t_0) using all subjects with X0i>t0X_{0i} > t_0 as

F^0(st0)=i=1n0I(S0is,X0i>t0)i=1n0I(X0i>t0). \hat{F}_0(s | t_0) = \frac{\sum_{i=1}^{n_0} I(S_{0i} \le s, X_{0i} > t_0)}{\sum_{i=1}^{n_0} I(X_{0i} > t_0)}.

Subsequently, we construct an estimator for ΔS(t,t0)\Delta_{S}(t,t_0) as

Δ^S(t,t0)=n01i=1n0[ψ^1(tS0i,t0)I(X0i>t0)W^0C(t0)I(X0i>t)W^0C(t)]\hat{\Delta}_S(t,t_0) = n_0^{-1} \sum_{i=1}^{n_0} \left[\hat{\psi}_1(t| S_{0i},t_0) \frac{I(X_{0i} > t_0)}{\hat{W}^C_0(t_0)} - \frac{I(X_{0i} > t)}{\hat{W}^C_0(t)}\right]

where W^gC()\hat{W}^C_g(\cdot) is the Kaplan-Meier estimator of survival for censoring for g=1,0.g=1,0. Finally, we estimate RS(t,t0)R_S(t,t_0) as R^S(t,t0)=1Δ^S(t,t0)/Δ^(t).\hat{R}_S(t,t_0) =1- \hat{\Delta}_S(t,t_0)/\hat{\Delta}(t).

Variance estimation and confidence interval construction are performed using perturbation-resampling. Specifically, let {V(b)=(V11(b),...V1n1(b),V01(b),...V0n0(b))T,b=1,....,D}\left \{ V^{(b)} = (V_{11}^{(b)}, ...V_{1n_1}^{(b)}, V_{01}^{(b)}, ...V_{0n_0}^{(b)})^T, b=1,....,D \right \} be n×Dn \times D independent copies of a positive random variables VV from a known distribution with unit mean and unit variance. Let

Δ^(b)(t)=i=1n1V1i(b)I(X1i>t)i=1n1V1i(b)W^1C(b)(t)i=1n0V0i(b)I(X0i>t)i=1n0V0i(b)W^0C(b)(t).\hat{\Delta}^{(b)}(t) = \frac{ \sum_{i=1}^{n_1} V_{1i}^{(b)} I(X_{1i}>t)}{ \sum_{i=1}^{n_1} V_{1i}^{(b)} \hat{W}_1^{C(b)}(t)} -\frac{ \sum_{i=1}^{n_0} V_{0i}^{(b)} I(X_{0i}>t)}{ \sum_{i=1}^{n_0} V_{0i}^{(b)} \hat{W}_0^{C(b)}(t)}.

In this package, we use weights generated from an Exponential(1) distribution and use D=500D=500. The variance of Δ^(t)\hat{\Delta}(t) is obtained as the empirical variance of {Δ^(t)(b),b=1,...,D}\{\hat{\Delta}(t)^{(b)}, b = 1,...,D\}. Variance estimates for Δ^S(t,t0)\hat{\Delta}_S(t,t_0) and R^S(t,t0)\hat{R}_S(t,t_0) are calculated similarly. We construct two versions of the 95%95\% confidence interval for each estimate: one based on a normal approximation confidence interval using the estimated variance and another taking the 2.5th and 97.5th empirical percentile of the perturbed quantities. In addition, we use Fieller's method to obtain a third confidence interval for RS(t,t0)R_S(t,t_0) as

{1r:(Δ^S(t,t0)rΔ^(t))2σ^112rσ^12+r2σ^22cα},\left\{1-r: \frac{(\hat{\Delta}_S(t,t_0)-r\hat{\Delta}(t))^2}{\hat{\sigma}_{11}-2r\hat\sigma_{12}+r^2\hat\sigma_{22}} \le c_{\alpha}\right\},

where Σ^=(σ^ij)1i,j2\hat{\Sigma}=(\hat\sigma_{ij})_{1\le i,j\le 2} and cαc_\alpha is the (1α)(1-\alpha)th percentile of

{{Δ^S(b)(t)(1R^S(t,t0))Δ^(t)(b)}2σ^112(1R^S(t,t0))σ^12+(1R^S(t,t0))2σ^22,b=1,,C}\left\{\frac{\{\hat{\Delta}^{(b)}_S(t)-(1-\hat R_S(t,t_0))\hat{\Delta}(t)^{(b)}\}^2}{\hat{\sigma}_{11}-2(1-\hat R_S(t,t_0))\hat\sigma_{12}+(1-\hat R_S(t,t_0))^2\hat\sigma_{22}}, b=1, \cdots, C\right\}

where α=0.05\alpha=0.05.

Since the definition of RS(t,t0)R_S(t,t_0) considers the surrogate information as a combination of both SS information and TT information up to t0t_0, a logical inquiry would be how to assess the incremental value of the SS information in terms of the proportion of treatment effect explained, when added to TT information up to t0t_0. The proportion of treatment effect explained by TT information up to t0t_0 only is denoted as RT(t,t0)R_T(t,t_0) and is described in the documentation for R.t.surv.estimate. The incremental value of SS information is defined as:

IVS(t,t0)=RS(t,t0)RT(t,t0)=ΔT(t,t0)ΔS(t,t0)Δ(t). IV_S(t,t_0) = R_S(t,t_0) - R_T(t,t_0) = \frac{\Delta_T(t,t_0) - \Delta_S(t,t_0)}{\Delta (t)}.

For estimation of RT(t0)R_T(t_0), see documentation for R.t.surv.estimate. The quantity IVS(t,t0)IV_S(t,t_0) is then estimated by IV^S(t,t0)=R^S(t,t0)R^T(t,t0)\hat{IV}_S(t,t_0) = \hat{R}_S(t,t_0) - \hat{R}_T(t,t_0). Perturbation-resampling is used for variance estimation and confidence interval construction for this quantity, similar to the other quantities in this package.

Note that if the observed supports for S are not the same, then Λ^1(ts,t0)\hat{\Lambda}_1(t| s,t_0) for S0i=sS_{0i} = s outside the support of S1iS_{1i} may return NA (depending on the bandwidth). If extrapolation = TRUE, then the Λ^1(ts,t0)\hat{\Lambda}_1(t| s,t_0) values for these surrogate values are set to the closest non-NA value. If transform = TRUE, then S1iS_{1i} and S0iS_{0i} are transformed such that the new transformed values, S1itrS^{tr}_{1i} and S0itrS^{tr}_{0i} are defined as: Sgitr=F([Sgiμ]/σ)S^{tr}_{gi} = F([S_{gi} - \mu]/\sigma) for g=0,1g=0,1 where F()F(\cdot) is the cumulative distribution function for a standard normal random variable, and μ\mu and σ\sigma are the sample mean and standard deviation, respectively, of {(S1i,S0i)T,is.t.Xgi>t0}\{(S_{1i}, S_{0i})^T, i \quad s.t. X_{gi} > t_0\}.

Value

A list is returned:

delta

the estimate, Δ^(t)\hat{\Delta}(t), described in delta.estimate documentation.

delta.s

the estimate, Δ^S(t,t0)\hat{\Delta}_S(t,t_0), described above.

R.s

the estimate, R^S(t,t0)\hat{R}_S(t,t_0), described above.

delta.var

the variance estimate of Δ^(t)\hat{\Delta}(t); if var = TRUE or conf.int = TRUE.

delta.s.var

the variance estimate of Δ^S(t,t0)\hat{\Delta}_S(t,t_0); if var = TRUE or conf.int = TRUE.

R.s.var

the variance estimate of R^S(t,t0)\hat{R}_S(t,t_0); if var = TRUE or conf.int = TRUE.

conf.int.normal.delta

a vector of size 2; the 95% confidence interval for Δ^(t)\hat{\Delta}(t) based on a normal approximation; if conf.int = TRUE.

conf.int.quantile.delta

a vector of size 2; the 95% confidence interval for Δ^(t)\hat{\Delta}(t) based on sample quantiles of the perturbed values, described above; if conf.int = TRUE.

conf.int.normal.delta.s

a vector of size 2; the 95% confidence interval for Δ^S(t,t0)\hat{\Delta}_S(t,t_0) based on a normal approximation; if conf.int = TRUE.

conf.int.quantile.delta.s

a vector of size 2; the 95% confidence interval for Δ^S(t,t0)\hat{\Delta}_S(t,t_0) based on sample quantiles of the perturbed values, described above; if conf.int = TRUE.

conf.int.normal.R.s

a vector of size 2; the 95% confidence interval for R^S(t,t0)\hat{R}_S(t,t_0) based on a normal approximation; if conf.int = TRUE.

conf.int.quantile.R.s

a vector of size 2; the 95% confidence interval for R^S(t,t0)\hat{R}_S(t,t_0) based on sample quantiles of the perturbed values, described above; if conf.int = TRUE.

conf.int.fieller.R.s

a vector of size 2; the 95% confidence interval for R^S(t,t0)\hat{R}_S(t,t_0) based on Fieller's approach, described above; if conf.int = TRUE.

delta.t

the estimate, Δ^T(t,t0)\hat{\Delta}_T(t,t_0), described above; if incremental.vaue = TRUE.

R.t

the estimate, R^T(t,t0)\hat{R}_T(t,t_0), described above; if incremental.vaue = TRUE.

incremental.value

the estimate, IV^S(t,t0)\hat{IV}_S(t,t_0), described above; if incremental.vaue = TRUE.

delta.t.var

the variance estimate of Δ^T(t,t0)\hat{\Delta}_T(t,t_0); if var = TRUE or conf.int = TRUE and incremental.vaue = TRUE.

R.t.var

the variance estimate of R^T(t,t0)\hat{R}_T(t,t_0); if var = TRUE or conf.int = TRUE and incremental.vaue = TRUE.

incremental.value.var

the variance estimate of IV^S(t,t0)\hat{IV}_S(t,t_0); if var = TRUE or conf.int = TRUE and incremental.vaue = TRUE.

conf.int.normal.delta.t

a vector of size 2; the 95% confidence interval for Δ^T(t,t0)\hat{\Delta}_T(t,t_0) based on a normal approximation; if conf.int = TRUE and incremental.vaue = TRUE.

conf.int.quantile.delta.t

a vector of size 2; the 95% confidence interval for Δ^T(t,t0)\hat{\Delta}_T(t,t_0) based on sample quantiles of the perturbed values, described above; if conf.int = TRUE and incremental.vaue = TRUE.

conf.int.normal.R.t

a vector of size 2; the 95% confidence interval for R^T(t,t0)\hat{R}_T(t,t_0) based on a normal approximation; if conf.int = TRUE and incremental.vaue = TRUE.

conf.int.quantile.R.t

a vector of size 2; the 95% confidence interval for R^T(t,t0)\hat{R}_T(t,t_0) based on sample quantiles of the perturbed values, described above; if conf.int = TRUE and incremental.vaue = TRUE.

conf.int.fieller.R.t

a vector of size 2; the 95% confidence interval for R^T(t,t0)\hat{R}_T(t,t_0) based on Fieller's approach, described above; if conf.int = TRUE and incremental.vaue = TRUE.

conf.int.normal.iv

a vector of size 2; the 95% confidence interval for IV^S(t,t0)\hat{IV}_S(t,t_0) based on a normal approximation; if conf.int = TRUE and incremental.vaue = TRUE.

conf.int.quantile.iv

a vector of size 2; the 95% confidence interval for IV^S(t,t0)\hat{IV}_S(t,t_0) based on sample quantiles of the perturbed values, described above; if conf.int = TRUE and incremental.vaue = TRUE.

Note

If the treatment effect is not significant, the user will receive the following message: "Warning: it looks like the treatment effect is not significant; may be difficult to interpret the residual treatment effect in this setting". If the observed support of the surrogate marker for the control group is outside the observed support of the surrogate marker for the treatment group, the user will receive the following message: "Warning: observed supports do not appear equal, may need to consider a transformation or extrapolation".

Author(s)

Layla Parast

References

Parast, L., Cai, T., & Tian, L. (2017). Evaluating surrogate marker information using censored data. Statistics in Medicine, 36(11), 1767-1782.

Examples

data(d_example_surv)
names(d_example_surv)

[Package Rsurrogate version 3.2 Index]