df_sel {multibias} | R Documentation |
Simulated data with selection bias
Description
Data containing one source of bias, three known confounders, and 100,000
observations. This data is obtained by sampling with replacement with
probability = S from df_sel_source
then removing the S column.
The resulting data corresponds to what a researcher would see
in the real-world: missing data for those not selected into the study (S=0).
As seen in df_sel_source
, the true, unbiased
exposure-outcome odds ratio = 2.
Usage
df_sel
Format
A dataframe with 100,000 rows and 5 columns:
- X
exposure, 1 = present and 0 = absent
- Y
outcome, 1 = present and 0 = absent
- C1
1st confounder, 1 = present and 0 = absent
- C2
2nd confounder, 1 = present and 0 = absent
- C3
3rd confounder, 1 = present and 0 = absent
[Package multibias version 1.5.1 Index]