| gpdEst {RecordLinkage} | R Documentation | 
Estimate Threshold from Pareto Distribution
Description
Fits a Pareto distribution to the distribution of weights and calculates a quantile on the fitted model as classification threshold.
Usage
gpdEst(Wdata, thresh = -Inf, quantil = 0.95)
Arguments
| Wdata | A numeric vector representing weights of record pairs. | 
| thresh | Threshold for exceedances. | 
| quantil | A real number between 0 and 1. The desired quantile. | 
Details
The weights that exceed thresh are fitted to a 
generalized Pareto distribution (GPD). The estimated parameters shape
and scale are used to calculate a classification threshold by the
formula
\mathit{thresh}+\frac{\mathit{scale}}{\mathit{shape}}
    ((\frac{n}{k}(1-\mathit{quantil}))^{-\mathit{shape}} -1)
where n is the total number of weights and k the number of
exceedances.
Value
A real number representing the resulting classification threshold. It is assured that the threshold lies in a reasonable range.
Author(s)
Murat Sariyar
See Also
getParetoThreshold for user-level function