late.regu.cv {RCAL}R Documentation

Model-assisted inference for local average treatment effects (LATEs) with instrumental variables based on cross validation

Description

This function implements model-assisted inference for LATEs with instrumental variables, using regularized calibrated estimation based on cross validation.

Usage

late.regu.cv(fold, nrho = NULL, rho.seq = NULL, y, tr, iv, fx, gx, hx,
  arm = 2, d1 = NULL, d2 = NULL, ploss = "cal", yloss = "gaus",
  off = NULL, ...)

Arguments

fold

A vector of length 3 giving the fold numbers for cross validation in instrument propensity score estimation, treatment and outcome regressions respectively.

nrho

A vector of length 3 giving the numbers of tuning parameters searched in cross validation.

rho.seq

A list of three vectors giving the tuning parameters in instrument propensity score estimation (first vector), treatment (second vector) and outcome (third vector) regressions.

y

An n x 1 vector of observed outcomes.

tr

An n x 1 vector of treatment indicators (=1 if treated or 0 if untreated).

iv

An n x 1 vector of instruments (0 or 1).

fx

An n x p matix of covariates, used in the instrument propensity score model.

gx

An n x q_1 matix of covariates, used in the treatment regression models. In theory, gx should be a subvector of gx, hence p\leq q_1.

hx

An n x q_2 matix of covariates, used in the outcome regression models. In theory, hx should be a subvector of hx, hence p\leq q_2.

arm

An integer 0, 1 or 2 indicating whether \theta_0, \theta_1 or both are computed; see Details for late.aipw.

d1

Degree of truncated polynomials of fitted values from treatment regression to be included as regressors in the outcome regression (NULL: no adjustment, 0: piecewise constant, 1: piecewise linear etc.).

d2

Number of knots of fitted values from treatment regression to be included as regressors in the outcome regression, with knots specified as the i/(d2+1)-quantiles for i=1,...,d2.

ploss

A loss function used in instrument propensity score estimation (either "ml" for likelihood estimation or "cal" for calibrated estimation).

yloss

A loss function used in outcome regression (either "gaus" for continuous outcomes or "ml" for binary outcomes).

off

A 2 x 1 vector of offset values (e.g., the true values in simulations) used to calculate the z-statistics from augmented IPW estimation.

...

Additional arguments to glm.regu.cv.

Details

For ploss="cal", regularized calibrated estimation of the instrument propensity score (IPS) and regularized weighted likelihood estimation of the treatment and outcome regression models are performed. The method leads to model-assisted inference for LATE, in which condidence intervals are valid with high-dimensional data if the IPS model is correctly specified, but the treatment and outcome regression models may be misspecified (Sun and Tan 2020). For ploss="ml", regularized maximum likelihood estimation is used (Chernozhukov et al. 2018). In this case, standard errors are only shown to be valid if the IPS, treatment and outcome models are all correctly specified.

Value

ips

A list containing the results from fitting the instrument propensity score models by glm.regu.cv.

mfp

An n x 2 matrix of fitted instrument propensity scores for iv=0 (first column) and iv=1 (second column).

tps

A list containing the results from fitting the treatment regression models by glm.regu.cv.

mft

An n x 2 matrix of fitted treatment regression models for iv=0 (first column) and iv=1 (second column).

or

A list containing the results from fitting the outcome regression models by glm.regu.cv.

mfo

An n x 4 matrix of fitted outcome regression models for for iv=0, tr=0 (first column), iv=0, tr=1 (second column), iv=1, tr=0 (third column) and iv=1, tr=1 (fourth column). Two columns are set to NA if arm=0 or 1.

est

A list containing the results from augmented IPW estimation by late.aipw.

References

Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W. and Robins, J.M. (2018) Double/debiased machine learning for treatment and structural parameters, The Econometrics Journal, 21, C1–C68.

Sun, B. and Tan, Z. (2020) High-dimensional model-assisted inference for local average treatment effects with instrumental variables, arXiv:2009.09286.

Examples


data(simu.iv.data)
n <- dim(simu.iv.data)[1]
p <- dim(simu.iv.data)[2]-3

y <- simu.iv.data[,1]
tr <- simu.iv.data[,2]
iv <- simu.iv.data[,3]
x <- simu.iv.data[,3+1:p]
x <- scale(x)

late.cv.rcal <- late.regu.cv(fold=5*c(1,1,1), nrho=(1+10)*c(1,1,1), rho.seq=NULL, 
                 y, tr, iv, fx=x, gx=x, hx=x, arm=2, d1=1, d2=3, ploss="cal", yloss="gaus")

matrix(unlist(late.cv.rcal$est), ncol=2, byrow=TRUE, 
dimnames=list(c("ipw", "or", "est", "var", "ze", 
 "late.est", "late.var", "late.ze"), c("theta1", "theta0")))



[Package RCAL version 2.0 Index]