R: Multidimensional Hill Tail Index Estimation

MultiHTailIndex {ExtremeRisks}

R Documentation

Multidimensional Hill Tail Index Estimation

Description

Computes point estimates and (1-\alpha)100\% confidence regions estimate of d-dimensional tail indices based on the Hill's estimator.

Usage

MultiHTailIndex(data, k, var=FALSE, varType="asym-Dep", bias=FALSE,
                alpha=0.05, plot=FALSE)

Arguments

`data`	A matrix of `(n \times d)` observations.
`k`	An integer specifying the value of the intermediate sequence `k_n`. See Details.
`var`	If `var=TRUE` then an estimate of the variance-covariance matrix of the tail indices estimators is computed.
`varType`	A string specifying the asymptotic variance to compute. By default `varType="asym-Dep"` specifies the variance estimator for `d` dependent marginal variables. See Details.
`bias`	A logical value. By default `biast=FALSE` specifies that no bias correction is computed. See Details.
`alpha`	A real in `(0,1)` specifying the confidence level `(1-\alpha)100\%` of the approximate confidence interval for the tail index.
`plot`	A logical value. By default `plot=FALSE` specifies that no graphical representation of the estimates is provided. See Details.

Details

For a dataset data of (n \times d) observations, where d is the number of variables and n is the sample size, the tail index \gamma of the d marginal distributions is estimated by applying the Hill estimator. Together with a point estimate a (1-\alpha)100\% confidence region is computed. The data are regarded as d-dimensional temporal independent observations coming from dependent variables.

k or k_n is the value of the so-called intermediate sequence k_n, n=1,2,\ldots. Its represents a sequence of positive integers such that k_n \to \infty and k_n/n \to 0 as n \to \infty. Practically, the value k_n specifies the number of k+1 larger order statistics to be used to estimate each marginal tail index \gamma_j for j=1,\ldots,d.
If var=TRUE then an estimate of the asymptotic variance-covariance matrix of the multivariate Hill estimator is computed. With independent observations the asymptotic variance-covariance matrix is estimated by the matrix \hat{\Sigma}^{LAWS}_{j,\ell}(\gamma,R)(1,1), see bottom formula in page 14 of Padoan and Stupfler (2020). This is achieved through varType="asym-Dep" which means d dependent marginal variables. When varType="asym-Ind" d marginal variables are regarded as independent and the returned variance-covariance matrix \hat{\Sigma}^{LAWS}_{j,\ell}(\gamma,R)(1,1) is a diagonal matrix with only variance terms.
If bias=TRUE then an estimate of the bias term of the Hill estimator is computed implementing using formula (4.2) in de Haan et al. (2016). In this case the asymptotic variance is not estimated using the formula in Haan et al. (2016) Theorem 4.1 but instead for simplicity the formula at the bottom of page 14 in Padoan and Stupfler (2020) is still used.
Given a small value \alpha\in (0,1) then an estimate of an asymptotic confidence region for \gamma_j, for j=1,\ldots,d, with approximate nominal confidence level (1-\alpha)100\%, is computed. The confidence intervals are computed exploiting the asymptotic normality of multivariate Hill estimator appropriately normalized (the logarithmic scale is not used), see Padoan and Stupfler (2020) for details.
If plot=TRUE then a graphical representation of the estimates is not provided.

Value

A list with elements:

gammaHat: an estimate of the d tail indices \gamma_j, for j=1,\ldots,d;
VarCovGHat: an estimate of the asymptotic variance-covariance matrix of the multivariate Hill estimator;
biasTerm: an estimate of bias term of the multivariate Hill estimator;
EstConReg: an estimate of the (1-\alpha)100\% confidence region.

Author(s)

Simone Padoan, simone.padoan@unibocconi.it, http://mypage.unibocconi.it/simonepadoan/; Gilles Stupfler, gilles.stupfler@ensai.fr, http://ensai.fr/en/equipe/stupfler-gilles/

References

Padoan A.S. and Stupfler, G. (2020). Joint inference on extreme expectiles for multivariate heavy-tailed distributions. arXiv e-prints arXiv:2007.08944, https://arxiv.org/abs/2007.08944.

de Haan, L., Mercadier, C. and Zhou, C. (2016). Adapting extreme value statistics to financial time series: dealing with bias and serial dependence. Finance and Stochastics, 20, 321-354.

de Haan, L. and Ferreira, A. (2006). Extreme Value Theory: An Introduction. Springer-Verlag, New York.

Examples

# Tail index estimation based on the multivariate Hill estimator obtained with
# n observations simulated from a d-dimensional random vector with a multivariate
# distribution with equal Frechet margins and a Clayton copula.
library(plot3D)
library(copula)
library(evd)

# distributional setting
copula <- "Clayton"
dist <- "Frechet"

# parameter setting
dep <- 3
dim <- 3
scale <- rep(1, dim)
shape <- rep(3, dim)
par <- list(dep=dep, scale=scale, shape=shape, dim=dim)

# Number of larger order statistics
k <- 150

# sample size
ndata <- 1000

# Simulates a sample from a multivariate distribution with equal Frechet
# marginals distributions and a Clayton copula
data <- rmdata(ndata, dist, copula, par)
scatter3D(data[,1], data[,2], data[,3])

# tail indices estimation
est <- MultiHTailIndex(data, k, TRUE)
est$gammaHat
est$VarCovGHat
# run the following command to see the graphical representation

 est <- MultiHTailIndex(data, k, TRUE, plot=TRUE)

[Package ExtremeRisks version 0.0.4 Index]