predExpectiles {ExtremeRisks} | R Documentation |
Computes a point and interval estimate of the expectile at the extreme level (Expectile Prediction).
predExpectiles(data, tau, tau1, method="LAWS", tailest="Hill", var=FALSE, varType="asym-Dep", bias=FALSE, bigBlock=NULL, smallBlock=NULL, k=NULL, alpha_n=NULL, alpha=0.05)
data |
A vector of (1 x n) observations. |
tau |
A real in (0,1) specifying the intermediate level τ_n. See Details. |
tau1 |
A real in (0,1) specifying the extreme level τ'_n. See Details. |
method |
A string specifying the method used to estimate the expecile. By default |
tailest |
A string specifying the tail index estimator. By default |
var |
If |
varType |
A string specifying the type of asymptotic variance to compute. By default |
bias |
A logical value. By default |
bigBlock |
An interger specifying the size of the big-block used to estimaste the asymptotic variance. See Details. |
smallBlock |
An interger specifying the size of the small-block used to estimaste the asymptotic variance. See Details. |
k |
An integer specifying the value of the intermediate sequence k_n. See Details. |
alpha_n |
A real in (0,1) specifying the quantile's extreme level to be use in order to estimate the expectile's extreme level. |
alpha |
A real in (0,1) specifying the confidence level (1-α)100\% of the approximate confidence interval for the expecile at the intermedite level. |
For a dataset data
of sample size n, an estimate of the τ'_n-th expectile is computed. The estimation of the expectile at the extreme level tau1
(τ'_n) is meant to be a prediction beyond the observed sample. Two estimators are available: the so-called Least Asymmetrically Weighted Squares (LAWS) based estimator and the Quantile-Based (QB) estimator. The definition of both estimators depends on the estimation of the tail index γ. Here, γ is estimated using the Hill estimation (see HTailIndex for details) or in alternative using the the expectile based estimator (see EBTailIndex).
The observations can be either independent or temporal dependent. See Section 3.2 in Padoan and Stupfler (2020) for details.
The so-called intermediate level tau
or τ_n is a sequence of positive reals such that τ_n -> 1 as n -> ∞. Practically, τ_n in (0,1) is the ratio between N (Numerator) and D (Denominator). Where N is the empirical mean distance of the τ_n-th expectile from the observations smaller than it, and D is the empirical mean distance of τ_n-th expectile from all the observations.
The so-called extreme level tau1
or τ'_n is a sequence of positive reals such that τ'_n -> 1 as n -> ∞.
The value (1-tau'_n) in (0,1) is meant to be a small tail probability such that (1-τ'_n)=1/n or (1-τ'_n) < 1/n. It is also assumed that n(1-τ'_n) -> C as n -> ∞, where C is a positive finite constant. Typically, C in (0,1) so it is expected that there are no observations in a data sample that are greater than the expectile at the extreme level τ_n'.
When method='LAWS'
, then the τ'_n-th expectile is estimated using the LAWS based estimator. When method='QB'
, the expectile is instead estimated using the QB esimtator. The definition of both estimators depend on the estimation of the tail index γ. When tailest='Hill'
then γ is estimated using the Hill estimator (see HTailIndex). When tailest='ExpBased'
, then γ is estimated using the expectile based estimator (see EBTailIndex). See Section 3.2 in Padoan and Stupfler (2020) for details.
If var=TRUE
then an esitmate of the asymptotic variance of the tau'_n-th expectile is computed. Notice that the estimation of the asymptotic variance is only available when γ is estimated using the Hill estimator (see HTailIndex). With independent observations the asymptotic variance is estimated by \hat{γ}^2, see the remark below Theorem 3.5 in Padoan and Stupfler (2020). This is achieved through varType="asym-Ind"
. With serial dependent observations the asymptotic variance is estimated by the formula in Throrem 3.5 of Padoan and Stupfler (2020). This is achieved through varType="asym-Dep"
. See Section 3.2 in Padoan and Stupfler (2020) for details. In this latter case the computation of the serial dependence is based on the "big blocks seperated by small blocks" techinque which is a standard tools in time series, see e.g. Leadbetter et al. (1986).
The size of the big and small blocks are specified by the parameters bigBlock
and smallBlock
, respectively.
If bias=TRUE
then γ is estimated using formula (4.2) of Haan et al. (2016). This is used by the LAWS and QB estimators. Furthermore, the τ'_n–th quantile is estimated using the formula in page 330 of de Haan et al. (2016). This provides a bias corrected version of the Weissman estimator. This is used by the QB estimator.
However, in this case the asymptotic variance is not estimated using the formula in Haan et al. (2016) Theorem 4.2. Instead, for simplicity the asymptotic variance is estimated by the formula in Corollary 3.8, with serial dependent observations, and \hat{γ}^2 with independent observation (see e.g. de Drees 2000, for the details).
k
or k_n is the value of the so-called intermediate sequence k_n, n=1,2,.... Its represents a sequence of positive integers such that k_n -> ∞ and k_n/n -> 0 as n -> ∞. Practically, when tau=NULL
and method='LAWS'
, then τ_n=1-k_n/n is the intermediate level of the expectile to be stimated. The latter is also used to estimate the tail index when tailest='ExpBased'
. Instead, if tailest='Hill'
, then k_n specifies the number of k
+1 larger order statistics used in the definition of the Hill estimator. Differently, When tau=NULL
and method='QB'
, then τ_n=1-k_n/n is the intermediate level of the quantile to be stimated and of the expectile to be stimated when tailest='ExpBased'
. Instead, when tailest='Hill'
it is the numer of k
+1 larger order statistics used in the definition of the Hill estimator.
If quantile's extreme level is provided by alpha_n
, then expectile's extreme level tau'_n(α_n) is replaced by tau'_n(α_n) which is esitmated using the method described in Section 6 of Padoan and Stupfler (2020). See estExtLevel for details.
Given a small value α\in (0,1) then an estimate of an asymptotic confidence interval for tau'_n-th expectile, with approximate nominal confidence level (1-α)100\%, is computed. The confidence intervals are computed exploiting formula (10) and (11) in Padoan and Stupfler (2020) and (46) in Drees (2003). See Section 5 in Padoan and Stupfler (2020) for details. When biast=TRUE
confidence intervals are computed in the same way but after correcting the tail index estimate by an estimate of the bias term, see formula (4.2) in de Haan et al. (2016) for details.
A list with elements:
EExpcHat
: an estimate of the τ'_n-th expecile;
VarExtHat
: an estimate of the asymptotic variance of the expectile estimator;
CIExpct
: an estimate of the approximate (1-α)100\% confidence interval for τ'_n-th expecile.
Simone Padoan, simone.padoan@unibocconi.it, http://mypage.unibocconi.it/simonepadoan/; Gilles Stupfler, gilles.stupfler@ensai.fr, http://ensai.fr/en/equipe/stupfler-gilles/
Padoan A.S. and Stupfler, G. (2020). Extreme expectile estimation for heavy-tailed time series. arXiv e-prints arXiv:2004.04078, https://arxiv.org/abs/2004.04078.
Daouia, A., Girard, S. and Stupfler, G. (2018). Estimation of tail risk based on extreme expectiles. Journal of the Royal Statistical Society: Series B, 80, 263-292.
de Haan, L., Mercadier, C. and Zhou, C. (2016). Adapting extreme value statistics tonancial time series: dealing with bias and serial dependence. Finance and Stochastics, 20, 321-354.
Drees, H. (2003). Extreme quantile estimation for dependent data, with applications to finance. Bernoulli, 9, 617-657.
Drees, H. (2000). Weighted approximations of tail processes for β-mixing random variables. Annals of Applied Probability, 10, 1274-1301.
Leadbetter, M.R., Lindgren, G. and Rootzen, H. (1989). Extremes and related properties of random sequences and processes. Springer.
HTailIndex, EBTailIndex, estExpectiles, extQuantile
# Extreme expectile estimation at the extreme level tau1 obtained with # 1-dimensional data simulated from an AR(1) with univariate # Student-t distributed innovations tsDist <- "studentT" tsType <- "AR" # parameter setting corr <- 0.8 df <- 3 par <- c(corr, df) # Big- small-blocks setting bigBlock <- 65 smallBlock <- 15 # Intermediate level (or sample tail probability 1-tau) tau <- 0.95 # Extreme level (or tail probability 1-tau1 of unobserved expectile) tau1 <- 0.9995 # sample size ndata <- 2500 # Simulates a sample from an AR(1) model with Student-t innovations data <- rtimeseries(ndata, tsDist, tsType, par) # Extreme expectile estimation expectHat1 <- predExpectiles(data, tau, tau1, var=TRUE, bigBlock=bigBlock, smallBlock=smallBlock) expectHat1$EExpcHat expectHat1$CIExpct # Extreme expectile estimation with bias correction tau <- 0.80 expectHat2 <- predExpectiles(data, tau, tau1, "QB", var=TRUE, bias=TRUE, bigBlock=bigBlock, smallBlock=smallBlock) expectHat2$EExpcHat expectHat2$CIExpct