RATEOmnibusTest {EpiForsk}R Documentation

RATE based omnibus test of heterogeneity

Description

Provides the P-value for a formal test of heterogeneity based on the RATE statistic by Yadlowsky et al.

Usage

RATEOmnibusTest(
  forest,
  level = 0.95,
  target = c("AUTOC", "QINI"),
  q = seq(0.1, 1, 0.1),
  R = 500,
  num.threads = 1,
  seed = NULL,
  honesty = TRUE,
  stabilize.splits = TRUE,
  ...
)

Arguments

forest

An object of class causal_forest, as returned by causal_forest(), with binary treatment.

level

numeric, level of RATE confidence interval.

target

character, see rank_average_treatment_effect.

q

numeric, see rank_average_treatment_effect.

R

integer, see rank_average_treatment_effect

num.threads

passed to causal_forest. Number of threads used in training. Default value is 1.

seed

numeric, either length 1, in which case the same seed is used for both new forests, or length 2, to train each forest with a different seed. Default is NULL, in which case two seeds are randomly sampled.

honesty

Boolean, TRUE if forest was trained using honesty. Otherwise FALSE. Argument controls if honesty is used to train the new forests on the random half-samples, so misspecification will lead to invalid results. Default is TRUE, the default in causal_forest.

stabilize.splits

Boolean, TRUE if forest was trained taking treatment into account when determining the imbalance of a split. Otherwise FALSE. Argument controls if treatment is taken into account when determining the imbalance of a split during training of the new forests on the random half-samples, so misspecification will lead to invalid results. Default is TRUE, the default in causal_forest.

...

additional arguments for causal_forest. By default, the arguments used by forest will be used to train new forests on the random half-samples. Arguments provided through ... will override these. Note that sample.weights and clusters are passed to both causal_forest and rank_average_treatment_effect.fit.

Details

RATE evaluates the ability of a provided prioritization rule to prioritize treatment to subjects with a large benefit. In order to test for heterogeneity, we want estimated CATE's to define the prioritization rule. However, to obtain valid inference the prioritization scores must be constructed independently of the evaluating forest training data. To accomplice this, we split the data and train separate forests on each part. Then we estimate double robust scores on the observations used to train each forest, and obtain prioritization scores by predicting CATE's with each forest on the samples not used for training.

Value

A list of class rank_average_treatment_effect with elements

Author(s)

KIJA

References

Yadlowsky S, Fleming S, Shah N, Brunskill E, Wager S. Evaluating Treatment Prioritization Rules via Rank-Weighted Average Treatment Effects. 2021. http://arxiv.org/abs/2111.07966.

Examples


n <- 800
p <- 3
X <- matrix(rnorm(n * p), n, p)
W <- rbinom(n, 1, 0.5)
event_prob <- 1 / (1 + exp(2 * (pmax(2 * X[, 1], 0) * W - X[, 2])))
Y <- rbinom(n, 1, event_prob)
clusters <- sample(1:4, n, replace = TRUE)
cf <- grf::causal_forest(X, Y, W, clusters = clusters)
rate <- RATEOmnibusTest(cf, target = "QINI")
rate



[Package EpiForsk version 0.1.1 Index]