R: Function performing the test of point identification on a...

point_ident_test {RegCombin}

R Documentation

Function performing the test of point identification on a validation sample.

Description

Function performing the test of point identification on a validation sample.

Usage

point_ident_test(
  validation,
  Ldata = NULL,
  Rdata = NULL,
  out_var,
  nc_var,
  c_var = NULL,
  alpha = 0.05,
  constraint = NULL,
  nc_sign = NULL,
  c_sign = NULL,
  weights_validation = NULL,
  weights_x = NULL,
  weights_y = NULL,
  nbCores = 1,
  grid = 10,
  eps_default = 0.5,
  R2bound = NULL,
  unchanged = FALSE,
  ties = FALSE
)

Arguments

`validation`	dataset containing the joint distribution (Y,Xnc,Xc) where Y is the outcome, Xnc are the non commonly observed regressors, Xc are potential common regressors.
`Ldata`	dataset containing (Y,Xc) where Y is the outcome, Xc are potential common regressors. Default is NULL
`Rdata`	dataset containing (Xnc,Xc) where Xnc are the non commonly observed regressors, Xc are potential common regressors. Default is NULL.
`out_var`	label of the outcome variable Y.
`nc_var`	label of the non commonly observed regressors Xnc.
`c_var`	label of the commonly observed regressors Xc.
`alpha`	the level of the confidence intervals. Default is 0.05.
`constraint`	a vector indicating the different constraints in a vector of the size of X_c indicating the type of constraints, if any on f(X_c) : "concave", "concave", "nondecreasing", "nonincreasing", "nondecreasing_convex", "nondecreasing_concave", "nonincreasing_convex", "nonincreasing_concave", or NULL for none. Default is NULL, no contraints at all.
`nc_sign`	if sign restrictions on the non-commonly observed regressors Xnc: -1 for a minus sign, 1 for a plus sign, 0 otherwise. Default is NULL, i.e. no constraints.
`c_sign`	if sign restrictions on the commonly observed regressors: -1 for a minus sign, 1 for a plus sign, 0 otherwise. Default is NULL, i.e. no constraints.
`weights_validation`	the sampling weights for the full dataset (Y, Xnc,Xc). Default is NULL.
`weights_x`	the sampling weights for the dataset (Xnc,Xc). Default is NULL.
`weights_y`	the sampling weights for the dataset (Y,Xc). Default is NULL.
`nbCores`	number of cores for the parallel computation. Default is 1.
`grid`	the number of points for the grid search on epsilon. Default is 30. If NULL, then epsilon is taken fixed equal to eps_default.
`eps_default`	If grid =NULL, then epsilon is taken equal to eps_default.
`R2bound`	the lower bound on the R2 of the long regression if any. Default is NULL.
`unchanged`	Boolean indicating if the categories based on Xc must be kept unchanged (TRUE). Otherwise (FALSE), a thresholding approach is taken imposing that each value appears more than 10 times in both datasets and 0.01 per cent is the pooled one. Default is FALSE.
`ties`	Boolean indicating if there are ties in the dataset. Default is FALSE.

Value

a list containing, in order: - S: the point estimation used the statistic for the test

- S_ci: the CI on the upper bound

- stat: the statistic of the test

- the critical value at level alpha

- the p_value of the test

- the fit with the OLS on this sample

- n the sample size

- epsilon, the choice of epsilon we made

- r2long the r2 on the long regression

-r2short the r2 on the short regression

Examples


### Simulating joint distribution according to this DGP
n=200
Xnc = rnorm(n,0,1.5)
epsilon = rnorm(n,0,1)

## true value
beta0 =1
Y = Xnc*beta0 + epsilon
out_var = "Y"
nc_var = "Xnc"

# create the datasets
validation<- as.data.frame(cbind(Y,Xnc))
colnames(validation) <- c(out_var,nc_var)


############# Estimation #############
test = point_ident_test (validation, Ldata=NULL,Rdata=NULL,out_var,nc_var)

[Package RegCombin version 0.4.1 Index]