toleranceBound {TailRank}R Documentation

Upper tolerance bounds on normal quantiles

Description

The function toleranceBound computes theoretical upper tolerance bounds on the quantiles of the standard normal distribution. These can be used to produce reliable data-driven estimates of the quantiles in any normal distribution.

Usage

toleranceBound(psi, gamma, N)

Arguments

psi

A real number between 0 and 1 giving the desired quantile

gamma

A real number between 0 and 1 giving the desired tolerance bound

N

An integer giving the number of observations used to estimate the quantile

Details

Suppose that we collect NN observations from a normal distribution with unknown mean and variance, and wish to estimate the 9595th percentile of the distribution. A simple point estimate is given by τ=Xˉ+1.68s\tau = \bar{X} + 1.68s. However, only the mean of the distribution is less than this value 95%95\% of the time. When N=40N=40, for example, almost half of the time (43.5%43.5\%), fewer than 95%95\% of the observed values will be less than τ\tau. This problem is addressed by constructing a statistical tolerance interval (more precisely, a one-sided tolerance bound) that contains a given fraction, ψ\psi, of the population with a given confidence level, γ\gamma [Hahn and Meeker, 1991]. With enough samples, one can obtain distribution-free tolerance bounds [op.\ cit., Chapter 5]. For instance, one can use bootstrap or jackknife methods to estimate these bounds empirically.

Here, however, we assume that the measurements are normally distributed. We let Xˉ\bar{X} denote the sample mean and let ss denote the sample standard deviation. The upper tolerance bound that, 100γ%100 \gamma\% of the time, exceeds 100ψ%100 \psi\% of GG values from a normal distribution is approximated by XU=Xˉ+kγ,ψsX_U = \bar{X} + k_{\gamma,\psi}s, where

kγ,ψ=zψ+zψ2aba, k_{\gamma, \psi} = {z_{\psi} + \sqrt{z_{\psi}^2 - ab} \over a},

a=1z1γ22N2, a = 1-{z_{1-\gamma}^2\over 2N-2},

b=zψ2z1γ2N, b = z_{\psi}^2 - {z_{1-\gamma}^2\over N},

and, for any π\pi, zπz_\pi is the critical value of the normal distribution that is exceeded with probability π\pi [Natrella, 1963].

Value

Returns the value of kγ,ψk_{\gamma, \psi} with the property that the ψ\psith quantile will be less than the estimate XU=Xˉ+kγ,ψsX_U = \bar{X} + k_{\gamma,\psi}s (based on NN data points) at least 100γ%100 \gamma\% of the time.

Note

Lower tolerance bounds on quantiles with psi less than one-half can be obtained as XU=Xˉkγ,1ψsX_U = \bar{X} - k_{\gamma,1-\psi}s,

Author(s)

Kevin R. Coombes <krc@silicovore.com>

References

Natrella, M.G. (1963) Experimental Statistics. NBS Handbook 91, National Bureau of Standards, Washington DC.

Hahn, G.J. and Meeker, W.Q. (1991) Statistical Intervals: A Guide for Practitioners. John Wiley and Sons, Inc., New York.

Examples

N <- 50
x <- rnorm(N)
tolerance <- 0.90
quant <- 0.95
tolerance.factor <- toleranceBound(quant, tolerance, N)

# upper 90% tolerance bound for 95th percentile
tau <- mean(x) + sd(x)*tolerance.factor

# lower 90% tolerance bound for 5th percentile
rho <- mean(x) - sd(x)*tolerance.factor

# behavior of the tolerance bound as N increases
nn <- 10:100
plot(nn, toleranceBound(quant, tolerance, nn))

# behavior of the bound as the tolerance varies
xx <- seq(0.5, 0.99, by=0.01)
plot(xx, toleranceBound(quant, xx, N))

[Package TailRank version 3.2.2 Index]