R: Fit weighted cumulative exposure models

WCE {WCE}

R Documentation

Fit weighted cumulative exposure models

Description

WCE implements a flexible method for modeling cumulative effects of time-varying exposures, weighted according to their relative proximity in time, and represented by time-dependent covariates. The current implementation estimates the weight function in the Cox proportional hazards model. The function that assigns weights to doses taken in the past is estimated using cubic regression splines.

Usage

WCE(
  data,
  analysis = "Cox",
  nknots,
  cutoff,
  constrained = FALSE,
  aic = FALSE,
  MatchedSet = NULL,
  id,
  event,
  start,
  stop,
  expos,
  covariates = NULL,
  controls = NULL,
  ...
)

Arguments

`data`	A data frame in an interval (long) format, in which each line corresponds to one and only one time unit for a given individual.
`analysis`	Character string. One of 'Cox', 'NCC' or 'CC' for Cox proportional hazards model, conditional logistic regression for nested case controls ('NCC') or logistic regression for case-controls ('CC'). Currently only 'Cox' for the Cox proportional hazards model is implemented, calling the `coxph` function of the survival package.
`nknots`	A scalar or a vector. Corresponds to the number(s) of interior knots for the cubic splines to estimate the weight function. For example, if `nknots` is set to 2, then a model with two interior knots is fitted. If `nknots` is set to 1:3 or alternatively c(1,2,3) then three models with 1, 2, and 3 interior knots, respectively, are fitted.
`cutoff`	Integer. Time window over which the WCE model is estimated. Corresponds to the length of the estimated weight function.
`constrained`	Controls whether the weight function should be constrained to smoothly go to zero. Set to FALSE for unconstrained models, to 'Right' or 'R' to constrain the weight function to smoothly go to zero for exposure remote in time, and to 'Left' or 'L' to constrain the weight function to start a zero for the current values.
`aic`	Logical. If TRUE, then the AIC is used to select the best fitting model among those estimated for the different numbers of interior knots requested with `nknots`. If FALSE, then the BIC is used instead of the AIC. Default to FALSE (BIC). Note that the BIC implemented in `WCE` is the version suggested by Volinsky and Raftery in Biometrics (2000), which corresponds to BIC = 2 * log(PL) + p * log(d) where PL is the model's partial likelihood, p is the number of estimated parameters and d is the number of uncensored events. See Sylvestre and Abrahamowicz (2009) for more details.
`MatchedSet`	Argument required for 'NCC' analysis only. Corresponds to the variable in `data` that specifies the matched sets for the conditional logistic regression. Currently not implemented.
`id`	Name of the variable in `data` corresponding to the identification of subjects.
`event`	Name of the variable in `data` corresponding to event indicator. Must be coded 1 = event and 0 = no event.
`start`	Name of the variable in `data` corresponding to the starting time for the interval. Corresponds to `time` argument in function `Surv` in the survival package.
`stop`	Name of the variable in `data` corresponding to the ending time for the interval. Corresponds to `time2` argument in function `Surv` in the survival package.
`expos`	Name of the variable in `data` corresponding to the exposure variable.
`covariates`	Optional. Vector of characters corresponding to the name(s) of the variable(s) in `data` corresponding to the covariate(s) to be included in the model. Default to NULL, which corresponds to fitting model(s) without covariates.
`controls`	List corresponding to the control parameters to be passed to the `coxph` function. See `coxph.control` for more details.
`...`	Optional; other parameters to be passed through to `WCE`.

Details

The current implementation of the WCE function does not allow missing values in the Id, event, start, stop, expos variables. Intervals in data determined by start and stop are assumed to be open on the left and closed on the right, (start, stop]. Intervals for a given individual (Id) must not overlap, and must cover the entire follow-up for the individual. The start and stop values for a given interval must not be equal. Delayed entry is not implemented in this version of the WCE function so all of the Id must start their follow-up at the same start value. The interior knots are placed at quantiles of the exposure variable distribution.

Value

A list of elements:

`knotsmat`	List of vectors of knots used for the spline modelling of the weight function(s).

`WCEmat`	Matrix of the estimated weight function. Each row corresponds to an estimated weight function. The number of columns in the `WCEmat` corresponds to the value of the argument `nknots`.

`loglik`	Partial likelihood for each estimated model.

`est`	List of vectors of estimated coefficients for the artificial time-dependent variables used to fit the WCE model(s).

`vcovmat`	List of variance-covariance matrices estimated for each model.

`SED`	List of vectors of estimated standard errors of the estimated coefficients of the artificial time-dependent variables used to fit each WCE model.

`beta.hat.covariates`	List of vectors of estimated coefficients for the covariates.

`se.covariates`	List of vectors of standard errors of the estimated coefficients for the covariates.

`covariates`	Names of the covariates used in the estimation.

`constrained`	Indicator of whether the model(s) was(were) unconstrained, right-constrained or left-constrained.

`nevents`	Number of events.

`aic`	Logical value corresponding to the `aic` argument.

`info.criterion`	Value of the AIC or BIC for each model estimated.

`analysis`	Value of the `analysis` argument.

`...`	Optional, additional argument(s).

Note

Note that the print method for a WCE object returns the estimated WCE function(s), the number of events, the partial likelihoods, the AIC or BIC values, the matrix of coefficients estimates for the covariates (if any) and the matrix of standard error estimates for the covariates (if any).

References

Sylvestre, M. P., & Abrahamowicz, M. (2009). Flexible modeling of the cumulative effects of time-dependent exposures on the hazard. Statistics in medicine, 28(27), 3437-3453.

Examples


wce <- WCE(drugdata, "Cox", 1, 90, constrained = "R", id = "Id", event = "Event",
start = "Start", stop = "Stop", expos = "dose", covariates = c("age", "sex"))
## Not run: 
 # Confidence intervals for HR, as well as pointwise confidence bands
 # for the estimated weight function can be obtained via bootstrap.

 # Set the number of bootstrap resamples
 #(set to 5 for demonstration purposes, should be higher)
 B <- 5

 # Obtain the list of ID for sampling
 ID <- unique(drugdata$Id)

 # Prepare vectors to extract estimated weight function and HR
 # for the best-fitting model for each bootstrap resample
 boot.WCE <- matrix(NA, ncol = 90, nrow=B)
 boot.HR <- rep(NA, B)

 # Sample IDs with replacement
 for (i in 1:B){
   ID.resamp <- sort(sample(ID, replace=T))
   datab <- drugdata[drugdata$Id %in% ID.resamp,] # select obs. but duplicated Id are ignored

   # deal with duplicated Id and assign them new Id
   step <- 1
   repeat {
   # select duplicated Id in ID.resamp
     ID.resamp <- ID.resamp[duplicated(ID.resamp)==TRUE]
     if (length(ID.resamp)==0) break # stop when no more duplicated Id to deal with
     # select obs. but remaining duplicated Id are ignored
     subset.dup <- drugdata[drugdata$Id %in% ID.resamp,]
     # assign new Id to duplicates
     subset.dup$Id <- subset.dup$Id + step * 10^ceiling(log10(max(drugdata$Id)))
     # 10^ceiling(log10(max(drugdata$Id)) is the power of 10
     # above the maximum Id from original data
     datab <- rbind(datab, subset.dup)
     step <- step+1
   }

   mod <- WCE(data = datab, analysis = "Cox", nknots = 1:3, cutoff = 90,
   constrained = "R", aic = FALSE, MatchedSet = NULL, id = "Id",
   event = "Event", start = "Start", stop = "Stop", expos = "dose",
   covariates = c("sex", "age"))

   # return best WCE estimates and corresponding HR
   best <- which.min(mod$info.criterion)
   boot.WCE[i,] <- mod$WCEmat[best,]
   boot.HR[i] <- HR.WCE(mod, rep(1, 90), rep(0, 90))
 }

 # Summarize bootstrap results using percentile method
 apply(boot.WCE, 2, quantile, p = c(0.05, 0.95))
 quantile(boot.HR, p = c(0.05, 0.95))

## End(Not run)

[Package WCE version 1.0.3 Index]