df_uc_omc_sel {multibias}R Documentation

Simulated data with uncontrolled confounding, outcome misclassification, and selection bias

Description

Data containing three sources of bias, three known confounders, and 100,000 observations. This data is obtained by sampling with replacement with probability = S from df_uc_omc_sel_source then removing the columns Y, U, and S. The resulting data corresponds to what a researcher would see in the real-world: a misclassified outcome, Ystar; missing data on a confounder U; and missing data for those not selected into the study (S=0). As seen in df_uc_omc_sel_source, the true, unbiased exposure-outcome odds ratio = 2.

Usage

df_uc_omc_sel

Format

A dataframe with 100,000 rows and 5 columns:

X

exposure, 1 = present and 0 = absent

Ystar

misclassified outcome, 1 = present and 0 = absent

C1

1st confounder, 1 = present and 0 = absent

C2

2nd confounder, 1 = present and 0 = absent

C3

3rd confounder, 1 = present and 0 = absent


[Package multibias version 1.5.1 Index]