Example2 {rsides} | R Documentation |
Subgroup search in a clinical trial with a binary endpoint
Description
Consider a clinical trial that was conducted to evaluate the efficacy and safety of an experimental treatment compared to placebo using a binary primary endpoint. The endpoint assumes the values of 0 and 1 (1 corresponds to the desirable outcome). The trial's data set (see the binary data set)) includes five biomarkers (four numeric biomarkers and one nominal biomarker) as well as several covariates that can be included in the primary analysis model. The trial's sponsor is interested in identifying a set of promising subgroups with enhanced treatment effect. Subgroup search will be performed using the basic SIDES procedure and two SIDEScreen procedures (Fixed and Adaptive SIDEScreen procedures).
The endpoint parameters will be defined as follows. The name of the outcome variable is specified using outcome_variable = "outcome"
and, since the endpoint is binary, the type
parameter is set to "binary"
. The desirable outcome for the endpoint is 1, which means that the direction
parameter is set to 1 (a higher value of the endpoint indicates beneficial effect).
Two sets of endpoint parameters will be defined to implement a simple evaluation of the treatment effect based on the Z-test for proportions as well as a more advanced analysis based on a logistic regression model with an adjustment for important prognostic covariates:
Analysis strategy 1: The
analysis_method
parameter is set to"Z-test for proportions"
.Analysis strategy 2: The
analysis_method
parameter is set to"Logistic regression"
. The covariates to be included in the logistic model need to be defined using thecont_covariates
andclass_covariates
arguments. To adjust for two continuous covariates (cont1, cont2) and two class/categorical covariate (class1, class2),cont_covariates
is set to"cont1, cont2"
andclass_covariates
is set to"class1, class2"
.
The data set parameters will be specified as follows:
The data set's name (
data_set
) isbinary
.The treatment variable's name (
treatment_variable_name
) is"treatment"
.The value of the treatment variable that defines the control arm
(treatment_variable_control_value
) is"0"
.The list of candidate biomarkers to be included in the subgroup search (
biomarker_names
) isc("biomarker1", "biomarker2", "biomarker3", "biomarker4", "biomarker5")
.The list of biomarker types (
biomarker_types
) isc(rep("numeric", 4), "nominal")
.
The following algorithm parameters will be used in the subgroup search procedures:
Search depth (
depth
) is 2, which means that patient subgroups will be defined in terms of one or two biomarkers. Note that this is the default value of this parameter and thus it does not need to be explicitly defined.Search width (
width
) is 2, i.e., only up to two best child subgroups will be retained for each parent group. This is the default value of this parameter and thus it does not need to be explicitly defined.Child-to-parent ratio (
gamma
) equals 1 for each search level. This is the default value of this parameter and thus it does not need to be explicitly defined.Minimum total number of patients in a promising subgroup (
min_subgroup_size
) is 60.Minimum number of unique values for continuous biomarkers for applying a percentile transformation (
nperc
) is 20, i.e., a percentile transformation will not be applied to a continuous biomarker if there are few than 20 unique values. This is the default value of this parameter and thus it does not need to be explicitly defined.Number of permutations to compute multiplicity-adjusted treatment effect p-values within promising subgroups (
n_perms_mult_adjust
) is 10.Number of processor cores to be used in parallel calculations (
ncores
) is 1.
The subgroup search will be performed using the basic SIDES procedure as well as two SIDEScreen procedures by calling the SubgroupSearch function and a comprehensive summary of subgroup search results will be generated by calling the GenerateReport function.
See Also
Examples
##############################################################################
# Primary endpoint parameters
# Analysis strategy 1: Analysis of the binary endpoint without
# accounting for any covariates
endpoint_parameters = list(outcome_variable = "outcome",
type = "binary",
label = "Outcome",
analysis_method = "Z-test for proportions",
direction = 1)
# Analysis strategy 2: Analysis of the continuous endpoint using an ANCOVA
# model that accounts for two continuous covariates (cont1, cont2) and
# two class/categorical covariates (class1, class2)
endpoint_parameters = list(outcome_variable = "outcome",
type = "binary",
label = "Outcome",
analysis_method = "Logistic regression",
cont_covariates = "cont1, cont2",
class_covariates = "class1, class2",
direction = 1)
##############################################################################
# Data set parameters
# Set of candidate biomarkers
biomarker_names = c("biomarker1", "biomarker2",
"biomarker3", "biomarker4",
"biomarker5")
# Biomarker type
biomarker_types = c(rep("numeric", 4), "nominal")
# Data set parameters
data_set_parameters = list(data_set = binary,
treatment_variable_name = "treatment",
treatment_variable_control_value = "0",
biomarker_names = biomarker_names,
biomarker_types = biomarker_types)
##############################################################################
# Algorithm parameters for the basic SIDES procedure
# Algorithm
subgroup_search_algorithm = "SIDES procedure"
# Number of permutations to compute multiplicity-adjusted treatment
# effect p-values within promising subgroups
n_perms_mult_adjust = 10
# Number of processor cores (use less or equal number of CPU cores on the current host)
ncores = 1
# Default values for the search depth (2), search width (2),
# maximum number of unique values for continuous biomarkers (20)
# Algorithm parameters
algorithm_parameters = list(
n_perms_mult_adjust = n_perms_mult_adjust,
min_subgroup_size = 60,
subgroup_search_algorithm = subgroup_search_algorithm,
ncores = ncores,
random_seed = 3011)
# Perform subgroup search
# List of all parameters
parameters = list(endpoint_parameters = endpoint_parameters,
data_set_parameters = data_set_parameters,
algorithm_parameters = algorithm_parameters)
results = SubgroupSearch(parameters)
# Simple summary of subgroup search results
results
# Generate a detailed Word-based report with a summary of subgroup search results
report_information = GenerateReport(results,
report_title = "Subgroup search report",
report_filename = tempfile(
"Binary endpoint (SIDES).docx",
fileext=".docx"
)
)
##############################################################################
# Algorithm parameters for the Fixed SIDEScreen procedure
# Algorithm
subgroup_search_algorithm = "Fixed SIDEScreen procedure"
# Number of permutations to compute multiplicity-adjusted treatment
# effect p-values within promising subgroups
n_perms_mult_adjust = 10
# Number of processor cores (use less or equal number of CPU cores on the current host)
ncores = 1
# Number of biomarkers selected for the second stage in the Fixed SIDEScreen algorithm
n_top_biomarkers = 3
# Default values for the search depth (2), search width (2),
# maximum number of unique values for continuous biomarkers (20)
# Algorithm parameters
algorithm_parameters = list(
n_perms_mult_adjust = n_perms_mult_adjust,
min_subgroup_size = 60,
subgroup_search_algorithm = subgroup_search_algorithm,
ncores = ncores,
n_top_biomarkers = n_top_biomarkers,
random_seed = 3011)
# Perform subgroup search
# List of all parameters
parameters = list(endpoint_parameters = endpoint_parameters,
data_set_parameters = data_set_parameters,
algorithm_parameters = algorithm_parameters)
results = SubgroupSearch(parameters)
# Simple summary of subgroup search results
results
# Generate a detailed Word-based report with a summary of subgroup search results
report_information = GenerateReport(results,
report_title = "Subgroup search report",
report_filename = tempfile(
"Binary endpoint (Fixed SIDEScreen).docx",
fileext=".docx"
)
)
##############################################################################
# Algorithm parameters for the Adaptive SIDEScreen procedure
# Algorithm
subgroup_search_algorithm = "Adaptive SIDEScreen procedure"
# Number of permutations to compute multiplicity-adjusted treatment
# effect p-values within promising subgroups
n_perms_mult_adjust = 10
# Number of processor cores (use less or equal number
# of CPU cores on the current host)
ncores = 1
# Multiplier for selecting biomarkers for the second stage
# in the Adaptive SIDEScreen algorithm
multiplier = 1
# Number of permutations for computing the null distribution
# of the maximum VI score in the Adaptive SIDEScreen algorithm
n_perms_vi_score = 100
# Default values for the search depth (2), search width (2),
# maximum number of unique values for continuous biomarkers (20)
# Algorithm parameters
algorithm_parameters = list(
n_perms_mult_adjust = n_perms_mult_adjust,
min_subgroup_size = 60,
subgroup_search_algorithm = subgroup_search_algorithm,
ncores = ncores,
multiplier = multiplier,
n_perms_vi_score = n_perms_vi_score,
random_seed = 3011)
# Perform subgroup search
# List of all parameters
parameters = list(endpoint_parameters = endpoint_parameters,
data_set_parameters = data_set_parameters,
algorithm_parameters = algorithm_parameters)
results = SubgroupSearch(parameters)
# Simple summary of subgroup search results
results
# Generate a detailed Word-based report with a summary of subgroup search results
GenerateReport(results,
report_title = "Subgroup search report",
report_filename = tempfile(
"Binary endpoint (Adaptive SIDEScreen).docx",
fileext=".docx"
)
)