ICA.BinBin {Surrogate}R Documentation

Assess surrogacy in the causal-inference single-trial setting in the binary-binary case

Description

The function ICA.BinBin quantifies surrogacy in the single-trial causal-inference framework (individual causal association and causal concordance) when both the surrogate and the true endpoints are binary outcomes. See Details below.

Usage

ICA.BinBin(pi1_1_, pi1_0_, pi_1_1, pi_1_0, pi0_1_, pi_0_1,
Monotonicity=c("General"), Sum_Pi_f = seq(from=0.01, to=0.99, by=.01),
M=10000, Volume.Perc=0, Seed=sample(1:100000, size=1))

Arguments

pi1_1_

A scalar or vector that contains values for P(T=1,S=1Z=0)P(T=1,S=1|Z=0), i.e., the probability that S=T=1S=T=1 when under treatment Z=0Z=0. A vector is specified to account for uncertainty, i.e., rather than keeping P(T=1,S=1Z=0)P(T=1,S=1|Z=0) fixed at one estimated value, a distribution can be specified (see examples below) from which a value is drawn in each run.

pi1_0_

A scalar or vector that contains values for P(T=1,S=0Z=0)P(T=1,S=0|Z=0).

pi_1_1

A scalar or vector that contains values for P(T=1,S=1Z=1)P(T=1,S=1|Z=1).

pi_1_0

A scalar or vector that contains values for P(T=1,S=0Z=1)P(T=1,S=0|Z=1).

pi0_1_

A scalar or vector that contains values for P(T=0,S=1Z=0)P(T=0,S=1|Z=0).

pi_0_1

A scalar or vector that contains values for P(T=0,S=1Z=1)P(T=0,S=1|Z=1).

Monotonicity

Specifies which assumptions regarding monotonicity should be made: Monotonicity=c("General"), Monotonicity=c("No"), Monotonicity=c("True.Endp"), Monotonicity=c("Surr.Endp"), or Monotonicity=c("Surr.True.Endp"). See Details below. Default Monotonicity=c("General").

Sum_Pi_f

A scalar or vector that specifies the grid of values G=g1,g2,...,gkG={g_{1},\: g_{2},\:...,\: g_{k}} to be considered when the sensitivity analysis is conducted. See Details below. Default Sum_Pi_f = seq(from=0.01, to=0.99, by=.01).

M

The number of runs that are conducted for a given value of Sum_Pi_f. This argument is not used when Volume.Perc=0. Default M=10000.

Volume.Perc

Note that the marginals that are observable in the data set a number of restrictions on the unidentified correlations. For example, under montonicity for SS and TT, it holds that π0111<=min(π01,π11)\pi_{0111}<=min(\pi_{0\cdot1\cdot}, \pi_{\cdot1\cdot1}) and π1100<=min(π10,π10)\pi_{1100}<=min(\pi_{1\cdot0\cdot}, \pi_{\cdot1\cdot0}). For example, when min(π01,π11)=0.10min(\pi_{0\cdot1\cdot}, \pi_{\cdot1\cdot1})=0.10 and min(π10,π10)=0.08min(\pi_{1\cdot0\cdot}, \pi_{\cdot1\cdot0})=0.08, then all valid π0111<=0.10\pi_{0111}<=0.10 and all valid π1100<=0.08\pi_{1100}<=0.08. The argument Volume.Perc specifies the fraction of the 'volume' of the paramater space that is explored. This volume is computed based on the grids G={0, 0.01, ..., maximum possible value for the counterfactual probability at hand}. E.g., in the previous example, the 'volume' of the parameter space would be 119=9911*9=99, and when e.g., the argument Volume.Perc=1 is used a total of 9999 runs will be conducted for each given value of Sum_Pi_f. Notice that when monotonicity is not assumed, relatively high values of Volume.Perc will lead to a large number of runs and consequently a long analysis time.

Seed

The seed to be used to generate πr\pi_r. Default Seed=sample(1:100000, size=1).

Details

In the continuous normal setting, surroagacy can be assessed by studying the association between the individual causal effects on SS and TT (see ICA.ContCont). In that setting, the Pearson correlation is the obvious measure of association.

When SS and TT are binary endpoints, multiple alternatives exist. Alonso et al. (2014) proposed the individual causal association (ICA; RH2R_{H}^{2}), which captures the association between the individual causal effects of the treatment on SS (ΔS\Delta_S) and TT (ΔT\Delta_T) using information-theoretic principles.

The function ICA.BinBin computes RH2R_{H}^{2} based on plausible values of the potential outcomes. Denote by Y=(T0,T1,S0,S1)\bold{Y}'=(T_0,T_1,S_0,S_1) the vector of potential outcomes. The vector Y\bold{Y} can take 16 values and the set of parameters πijpq=P(T0=i,T1=j,S0=p,S1=q)\pi_{ijpq}=P(T_0=i,T_1=j,S_0=p,S_1=q) (with i,j,p,q=0/1i,j,p,q=0/1) fully characterizes its distribution.

However, the parameters in πijpq\pi_{ijpq} are not all functionally independent, e.g., 1=π1=\pi_{\cdot\cdot\cdot\cdot}. When no assumptions regarding monotonicity are made, the data impose a total of 77 restrictions, and thus only 99 proabilities in πijpq\pi_{ijpq} are allowed to vary freely (for details, see Alonso et al., 2014). Based on the data and assuming SUTVA, the marginal probabilites π11\pi_{1 \cdot 1 \cdot}, π10\pi_{1 \cdot 0 \cdot}, π11\pi_{\cdot 1 \cdot 1}, π10\pi_{\cdot 1 \cdot 0}, π01\pi_{0 \cdot 1 \cdot}, and π01\pi_{\cdot 0 \cdot 1} can be computed (by hand or using the function MarginalProbs). Define the vector

b=(1,π11,π10,π11,π10,π01,π01)\bold{b}'=(1, \pi_{1 \cdot 1 \cdot}, \pi_{1 \cdot 0 \cdot}, \pi_{\cdot 1 \cdot 1}, \pi_{\cdot 1 \cdot 0}, \pi_{0 \cdot 1 \cdot}, \pi_{\cdot 0 \cdot 1})

and A\bold{A} is a contrast matrix such that the identified restrictions can be written as a system of linear equation

Aπ=b.\bold{A \pi} = \bold{b}.

The matrix A\bold{A} has rank 77 and can be partitioned as A=(ArAf)\bold{A=(A_r | A_f)}, and similarly the vector π\bold{\pi} can be partitioned as π=(πrπf)\bold{\pi^{'}=(\pi_r^{'} | \pi_f^{'})} (where ff refers to the submatrix/vector given by the 99 last columns/components of A/π\bold{A/\pi}). Using these partitions the previous system of linear equations can be rewritten as

Arπr+Afπf=b.\bold{A_r \pi_r + A_f \pi_f = b}.

The following algorithm is used to generate plausible distributions for Y\bold{Y}. First, select a value of the specified grid of values (specified using Sum_Pi_f in the function call). For k=1k=1 to MM (specified using M in the function call), generate a vector πf\pi_f that contains 99 components that are uniformly sampled from hyperplane subject to the restriction that the sum of the generated components equals Sum_Pi_f (the function RandVec, which uses the randfixedsum algorithm written by Roger Stafford, is used to obtain these components). Next, πr=Ar1(bAfπf)\bold{\pi_r=A_r^{-1}(b - A_f \pi_f)} is computed and the πr\pi_r vectors where all components are in the [0;1][0;\:1] range are retained. This procedure is repeated for each of the Sum_Pi_f values. Based on these results, RH2R_H^2 is estimated. The obtained values can be used to conduct a sensitivity analysis during the validation exercise.

The previous developments hold when no monotonicity is assumed. When monotonicity for SS, TT, or for SS and TT is assumed, some of the probabilities of π\pi are zero. For example, when montonicity is assumed for TT, then P(T0<=T1)=1P(T_0 <= T_1)=1, or equivantly, π1000=π1010=π1001=π1011=0\pi_{1000}=\pi_{1010}=\pi_{1001}=\pi_{1011}=0. When monotonicity is assumed, the procedure described above is modified accordingly (for details, see Alonso et al., 2014). When a general analysis is requested (using Monotonicity=c("General") in the function call), all settings are considered (no monotonicity, monotonicity for SS alone, for TT alone, and for both for SS and TT.)

To account for the uncertainty in the estimation of the marginal probabilities, a vector of values can be specified from which a random draw is made in each run (see Examples below).

Value

An object of class ICA.BinBin with components,

Pi.Vectors

An object of class data.frame that contains the valid π\pi vectors.

R2_H

The vector of the RH2R_H^2 values.

Theta_T

The vector of odds ratios for TT.

Theta_S

The vector of odds ratios for SS.

H_Delta_T

The vector of the entropies of ΔT\Delta_T.

Monotonicity

The assumption regarding monotonicity that was made.

Volume.No

The 'volume' of the parameter space when monotonicity is not assumed. Is only provided when the argument Volume.PercVolume.Perc is used (i.e., when it is not equal to 00.

Volume.T

The 'volume' of the parameter space when monotonicity for TT is assumed. Is only provided when the argument Volume.PercVolume.Perc is used.

Volume.S

The 'volume' of the parameter space when monotonicity for SS is assumed. Is only provided when the argument Volume.PercVolume.Perc is used.

Volume.ST

The 'volume' of the parameter space when monotonicity for SS and TT is assumed. Is only provided when the argument Volume.PercVolume.Perc is used.

Author(s)

Wim Van der Elst, Paul Meyvisch, Ariel Alonso & Geert Molenberghs

References

Alonso, A., Van der Elst, W., & Molenberghs, G. (2015). Validation of surrogate endpoints: the binary-binary setting from a causal inference perspective.

See Also

ICA.ContCont, MICA.ContCont

Examples

## Not run: # Time consuming code part
# Compute R2_H given the marginals specified as the pi's, making no
# assumptions regarding monotonicity (general case)
ICA <- ICA.BinBin(pi1_1_=0.2619048, pi1_0_=0.2857143, pi_1_1=0.6372549,
pi_1_0=0.07843137, pi0_1_=0.1349206, pi_0_1=0.127451, Seed=1,
Monotonicity=c("General"), Sum_Pi_f = seq(from=0.01, to=.99, by=.01), M=10000)

# obtain plot of the results
plot(ICA, R2_H=TRUE)

# Example 2 where the uncertainty in the estimation
# of the marginals is taken into account
ICA_BINBIN2 <- ICA.BinBin(pi1_1_=runif(10000, 0.2573, 0.4252),
pi1_0_=runif(10000, 0.1769, 0.3310),
pi_1_1=runif(10000, 0.5947, 0.7779),
pi_1_0=runif(10000, 0.0322, 0.1442),
pi0_1_=runif(10000, 0.0617, 0.1764),
pi_0_1=runif(10000, 0.0254, 0.1315),
Monotonicity=c("General"),
Sum_Pi_f = seq(from=0.01, to=0.99, by=.01),
M=50000, Seed=1)

# Plot results
plot(ICA_BINBIN2)

## End(Not run)

[Package Surrogate version 3.3.0 Index]