R: Sample Size for Simultaneous Nonparametric Prediction...

predIntNparSimultaneousN {EnvStats}

R Documentation

Sample Size for Simultaneous Nonparametric Prediction Interval for Continuous Distribution

Description

Compute the sample size necessary for a nonparametric simultaneous prediction interval to achieve a specified confidence level based on one of three possible rules: k-of-m, California, or Modified California. Observations are assumed to come from from a continuous distribution.

Usage

  predIntNparSimultaneousN(n.median = 1, k = 1, m = 2, r = 1, rule = "k.of.m", 
    lpl.rank = ifelse(pi.type == "upper", 0, 1), 
    n.plus.one.minus.upl.rank = ifelse(pi.type == "lower", 0, 1), pi.type = "upper", 
    conf.level = 0.95, n.max = 5000, integrate.args.list = NULL, maxiter = 1000)

Arguments

`n.median`	vector of positive odd integers specifying the sample size associated with the future medians. The default value is `n.median=1` (i.e., individual observations). Note that all future medians must be based on the same sample size.
`k`	for the `k`-of-`m` rule (`rule="k.of.m"`), a vector of positive integers specifying the minimum number of observations (or medians) out of `m` observations (or medians) (all obtained on one future sampling “occassion”) the prediction interval should contain. The default value is `k=1`. This argument is ignored when the argument `rule` is not equal to `"k.of.m"`.
`m`	vector of positive integers specifying the maximum number of future observations (or medians) on one future sampling “occasion”. The default value is `m=2`, except when `rule="Modified.CA"`, in which case this argument is ignored and `m` is automatically set equal to `4`.
`r`	vector of positive integers specifying the number of future sampling “occasions”. The default value is `r=1`.
`rule`	character string specifying which rule to use. The possible values are `"k.of.m"` (`k`-of-`m` rule; the default), `"CA"` (California rule), and `"Modified.CA"` (modified California rule).
`lpl.rank`	vector of positive integers indicating the rank of the order statistic to use for the lower bound of the prediction interval. When `pi.type="lower"`, the default value is `lpl.rank=1` (implying the minimum value of `x` is used as the lower bound of the prediction interval). When `pi.type="upper"`, the argument `lpl.rank` is set equal to `0`.
`n.plus.one.minus.upl.rank`	vector of positive integers related to the rank of the order statistic to use for the upper bound of the prediction interval. A value of `n.plus.one.minus.upl.rank=1` (the default) means use the first largest value, and in general a value of `n.plus.one.minus.upl.rank=i` means use the `i`'th largest value. If `pi.type="lower"`, this argument is set equal to `0`.
`pi.type`	character string indicating what kind of prediction interval to compute. The possible values are `"two.sided"` (the default), `"lower"`, and `"upper"`.
`conf.level`	numeric vector of values between 0 and 1 indicating the confidence level associated with the prediction interval. The default value is `conf=0.95`.
`n.max`	numeric scalar indicating the maximum sample size to consider. This argument is used in the search algorithm to determine the required sample size. The default value is `n.max=5000`.
`integrate.args.list`	list of arguments to supply to the `integrate` function. The default value is `NULL`.
`maxiter`	positive integer indicating the maximum number of iterations to use in the `uniroot` search algorithm. The default value is `maxiter=1000`.

Details

If the arguments k, m, r, lpl.rank, and n.plus.one.minus.upl.rank are not all the same length, they are replicated to be the same length as the length of the longest argument.

The function predIntNparSimultaneousN computes the required sample size n by solving Equation (8), (9), or (10) in the help file for predIntNparSimultaneous for n, depending on the value of the argument rule.

Note that when rule="k.of.m" and r=1, this is equivalent to a standard nonparametric prediction interval and you can use the function predIntNparN instead.

Value

vector of positive integers indicating the required sample size(s) for the specified nonparametric simultaneous prediction interval(s).

Note

See the help file for predIntNparSimultaneous.

Author(s)

Steven P. Millard (EnvStats@ProbStatInfo.com)

References