R: Compute the regression of an array on another along one...

Regression {s2dv}

R Documentation

Compute the regression of an array on another along one dimension.

Description

Compute the regression of the array 'datay' on the array 'datax' along the 'reg_dim' dimension by least square fitting (default) or self-defined model. The function provides the slope of the regression, the intercept, and the associated p-value and confidence interval. The filtered datay from the regression onto datax is also provided.
The p-value relies on the F distribution, and the confidence interval relies on the student-T distribution.

Usage

Regression(
  datay,
  datax,
  reg_dim = "sdate",
  formula = y ~ x,
  pval = TRUE,
  conf = TRUE,
  sign = FALSE,
  alpha = 0.05,
  na.action = na.omit,
  ncores = NULL
)

Arguments

`datay`	An numeric array as predictand including the dimension along which the regression is computed.
`datax`	An numeric array as predictor. The dimension should be identical as parameter 'datay'.
`reg_dim`	A character string indicating the dimension along which to compute the regression. The default value is 'sdate'.
`formula`	An object of class "formula" (see function `link[stats]{lm}`).
`pval`	A logical value indicating whether to retrieve the p-value or not. The default value is TRUE.
`conf`	A logical value indicating whether to retrieve the confidence intervals or not. The default value is TRUE.
`sign`	A logical value indicating whether to compute or not the statistical significance of the test The default value is FALSE.
`alpha`	A numeric of the significance level to be used in the statistical significance test. The default value is 0.05.
`na.action`	A function or an integer. A function (e.g., na.omit, na.exclude, na.fail, na.pass) indicates what should happen when the data contain NAs. A numeric indicates the maximum number of NA position (it counts as long as one of datay and datax is NA) allowed for compute regression. The default value is na.omit-
`ncores`	An integer indicating the number of cores to use for parallel computation. Default value is NULL.

Value

A list containing:

`$regression`	A numeric array with same dimensions as parameter 'datay' and 'datax' except the 'reg_dim' dimension, which is replaced by a 'stats' dimension containing the regression coefficients from the lowest order (i.e., intercept) to the highest degree. The length of the 'stats' dimension should be `polydeg + 1`.
`$conf.lower`	A numeric array with same dimensions as parameter 'daty' and 'datax' except the 'reg_dim' dimension, which is replaced by a 'stats' dimension containing the lower value of the `siglev`% confidence interval for all the regression coefficients with the same order as $regression. The length of 'stats' dimension should be `polydeg + 1`. Only present if `conf = TRUE`.
`$conf.upper`	A numeric array with same dimensions as parameter 'daty' and 'datax' except the 'reg_dim' dimension, which is replaced by a 'stats' dimension containing the upper value of the `siglev`% confidence interval for all the regression coefficients with the same order as $regression. The length of 'stats' dimension should be `polydeg + 1`. Only present if `conf = TRUE`.
`$p.val`	A numeric array with same dimensions as parameter 'daty' and 'datax' except the 'reg_dim' dimension, The array contains the p-value.
`sign`	A logical array of the statistical significance of the regression with the same dimensions as $regression. Only present if `sign = TRUE`.
`$filtered`	A numeric array with the same dimension as paramter 'datay' and 'datax', the filtered datay from the regression onto datax along the 'reg_dim' dimension.

Examples

# Load sample data as in Load() example:
example(Load)
datay <- sampleData$mod[, 1, , ]
names(dim(datay)) <- c('sdate', 'ftime')
datax <- sampleData$obs[, 1, , ]
names(dim(datax)) <- c('sdate', 'ftime')
res1 <- Regression(datay, datax, formula = y~poly(x, 2, raw = TRUE))
res2 <- Regression(datay, datax, alpha = 0.1)

[Package s2dv version 2.0.0 Index]