late.regu.cv {RCAL} | R Documentation |
Model-assisted inference for local average treatment effects (LATEs) with instrumental variables based on cross validation
Description
This function implements model-assisted inference for LATEs with instrumental variables, using regularized calibrated estimation based on cross validation.
Usage
late.regu.cv(fold, nrho = NULL, rho.seq = NULL, y, tr, iv, fx, gx, hx,
arm = 2, d1 = NULL, d2 = NULL, ploss = "cal", yloss = "gaus",
off = NULL, ...)
Arguments
fold |
A vector of length 3 giving the fold numbers for cross validation in instrument propensity score estimation, treatment and outcome regressions respectively. |
nrho |
A vector of length 3 giving the numbers of tuning parameters searched in cross validation. |
rho.seq |
A list of three vectors giving the tuning parameters in instrument propensity score estimation (first vector), treatment (second vector) and outcome (third vector) regressions. |
y |
An |
tr |
An |
iv |
An |
fx |
An |
gx |
An |
hx |
An |
arm |
An integer 0, 1 or 2 indicating whether |
d1 |
Degree of truncated polynomials of fitted values from treatment regression to be included as regressors in the outcome regression (NULL: no adjustment, 0: piecewise constant, 1: piecewise linear etc.). |
d2 |
Number of knots of fitted values from treatment regression to be included as regressors in the outcome regression, with knots specified as the |
ploss |
A loss function used in instrument propensity score estimation (either "ml" for likelihood estimation or "cal" for calibrated estimation). |
yloss |
A loss function used in outcome regression (either "gaus" for continuous outcomes or "ml" for binary outcomes). |
off |
A |
... |
Additional arguments to |
Details
For ploss
="cal", regularized calibrated estimation of the instrument propensity score (IPS) and regularized weighted likelihood estimation of the treatment and outcome regression models are performed. The method leads to model-assisted inference for LATE, in which condidence intervals are valid
with high-dimensional data if the IPS model is correctly specified, but the treatment and outcome regression models may be misspecified (Sun and Tan 2020). For ploss
="ml", regularized maximum likelihood estimation is used (Chernozhukov et al. 2018). In this case, standard errors are only shown to be valid if the IPS, treatment and outcome models are all correctly specified.
Value
ips |
A list containing the results from fitting the instrument propensity score models by |
mfp |
An |
tps |
A list containing the results from fitting the treatment regression models by |
mft |
An |
or |
A list containing the results from fitting the outcome regression models by |
mfo |
An |
est |
A list containing the results from augmented IPW estimation by |
References
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W. and Robins, J.M. (2018) Double/debiased machine learning for treatment and structural parameters, The Econometrics Journal, 21, C1–C68.
Sun, B. and Tan, Z. (2020) High-dimensional model-assisted inference for local average treatment effects with instrumental variables, arXiv:2009.09286.
Examples
data(simu.iv.data)
n <- dim(simu.iv.data)[1]
p <- dim(simu.iv.data)[2]-3
y <- simu.iv.data[,1]
tr <- simu.iv.data[,2]
iv <- simu.iv.data[,3]
x <- simu.iv.data[,3+1:p]
x <- scale(x)
late.cv.rcal <- late.regu.cv(fold=5*c(1,1,1), nrho=(1+10)*c(1,1,1), rho.seq=NULL,
y, tr, iv, fx=x, gx=x, hx=x, arm=2, d1=1, d2=3, ploss="cal", yloss="gaus")
matrix(unlist(late.cv.rcal$est), ncol=2, byrow=TRUE,
dimnames=list(c("ipw", "or", "est", "var", "ze",
"late.est", "late.var", "late.ze"), c("theta1", "theta0")))