hr05CriticalValue {CerioliOutlierDetection}R Documentation

Hardin and Rocke (2005) Critical Value for Testing MCD-based Mahalanobis Distances

Description

Hardin and Rocke (2005) provide an approximate F distribution for testing whether Mahalanobis distances calculated using the MCD dispersion estimate are unusually large, and hence, indicative of outliers in the data.

Usage

hr05CriticalValue(em, p.dim, signif.alpha)

Arguments

em

(Numeric) Degrees of freedom for Wishart distribution approximation to the MCD scatter matrix.

p.dim

(Integer) Dimension of the data, i.e., number of variables.

signif.alpha

(Numeric) Significance level for testing the null hypothesis

Details

Hardin and Rocke (2005) derived an F distributional approximation for the Mahalanobis distances of the observations that were excluded from the MCD calculation; see equation 3.2 on page 938 of the paper.

It is assumed here that the MCD covariance estimate used in the Mahalanobis distance calculation was adjusted by the consistency factor, so it is not included in the calculation here. (If one needs the consistency factor it is returned by the function ch99AsymptoticDF in this package or by the function .MCDcons in the robustbase package.)

Value

The appropriate cutoff value (from the F distributional approximation) for testing whether a Mahalanobis distance is unusually large at the specified significance level.

Note

It can happen that one of the F distribution paramaters, m - p + 1, is non-positive, in which case qf will return NaN. hr05CriticalValue will issue a warning in this case, and return NA.

Author(s)

Written and maintained by Christopher G. Green <christopher.g.green@gmail.com>

References

J. Hardin and D. M. Rocke. The distribution of robust distances. Journal of Computational and Graphical Statistics, 14:928-946, 2005. doi:10.1198/106186005X77685

See Also

hr05AdjustedDF, hr05CutoffMvnormal

Examples

hr05CriticalValue( hr05AdjustedDF( 1000, 20 ), 20, 0.05 ) 

[Package CerioliOutlierDetection version 1.1.13 Index]