Generalized extreme value distribution

From Infogalactic: the planetary knowledge core
(Redirected from Extreme value distribution)
Jump to: navigation, search
Notation \textrm{GEV}(\mu,\,\sigma,\,\xi)
Parameters μRlocation,
σ > 0 — scale,
ξRshape.
Support x ∈ [ μσ / ξ, +∞)   when ξ > 0,
x ∈ (−∞, +∞)   when ξ = 0,
x ∈ (−∞, μσ / ξ ]   when ξ < 0.
PDF \frac{1}{\sigma}\,t(x)^{\xi+1}e^{-t(x)},   where t(x) = \begin{cases}\big(1+(\tfrac{x-\mu}{\sigma})\xi\big)^{-1/\xi} & \textrm{if}\ \xi\neq0 \\ e^{-(x-\mu)/\sigma} & \textrm{if}\ \xi=0\end{cases}
CDF e^{-t(x)},\,   for x ∈ support
Mean \begin{cases}\mu + \sigma\frac{\Gamma(1-\xi)-1}{\xi} & \text{if}\ \xi\neq 0,\xi<1,\\ \mu + \sigma\,\gamma & \text{if}\ \xi=0,\\ \infty & \text{if}\ \xi\geq 1,\end{cases} where \gamma is Euler’s constant.
Median \begin{cases}\mu + \sigma \frac{(\ln2)^{-\xi}-1}{\xi} & \text{if}\ \xi\neq0,\\ \mu - \sigma \ln\ln2 & \text{if}\ \xi=0.\end{cases}
Mode \begin{cases}\mu + \sigma \frac{(1+\xi)^{-\xi}-1}{\xi} & \text{if}\ \xi\neq0,\\ \mu & \text{if}\ \xi=0.\end{cases}
Variance \begin{cases}\sigma^2\,(g_2-g_1^2)/\xi^2 & \text{if}\ \xi\neq0,\xi<\frac12,\\ \sigma^2\,\frac{\pi^2}{6} & \text{if}\ \xi=0, \\ \infty & \text{if}\ \xi\geq\frac12,\end{cases}
where gk = Γ(1 − ).
Skewness \begin{cases}\frac{g_3-3g_1g_2+2g_1^3}{(g_2-g_1^2)^{3/2}} & \text{if}\ \xi >0,\\ -\frac{g_3-3g_1g_2+2g_1^3}{(g_2-g_1^2)^{3/2}} & \text{if}\ \xi <0,\\ \frac{12 \sqrt{6} \zeta(3)}{\pi^3} & \text{if}\ \xi=0.\end{cases}
where \zeta(x) is Riemann zeta function
Ex. kurtosis \begin{cases}\frac{g_4-4g_1g_3+6g_2g_1^2-3g_1^4}{(g_2-g_1^2)^{2}}-3  & \text{if}\ \xi\neq0,\xi<\frac{1}{4},\\ \frac{12}{5} & \text{if}\ \xi=0,\\ \infty  & \text{if}\ \xi\ge\frac{1}{4}.\end{cases}
Entropy \log(\sigma)\,+\,\gamma\xi\,+\,(\gamma+1)
MGF [1]
CF [1]

In probability theory and statistics, the generalized extreme value (GEV) distribution is a family of continuous probability distributions developed within extreme value theory to combine the Gumbel, Fréchet and Weibull families also known as type I, II and III extreme value distributions. By the extreme value theorem the GEV distribution is the only possible limit distribution of properly normalized maxima of a sequence of independent and identically distributed random variables. Note that a limit distribution need not exist: this requires regularity conditions on the tail of the distribution. Despite this, the GEV distribution is often used as an approximation to model the maxima of long (finite) sequences of random variables.

In some fields of application the generalized extreme value distribution is known as the Fisher–Tippett distribution, named after Ronald Fisher and L. H. C. Tippett who recognised three function forms outlined below. However usage of this name is sometimes restricted to mean the special case of the Gumbel distribution.

Specification

The generalized extreme value distribution has cumulative distribution function

F(x;\mu,\sigma,\xi) = \exp\left\{-\left[1+\xi\left(\frac{x-\mu}{\sigma}\right)\right]^{-1/\xi}\right\}

for 1+\xi(x-\mu)/\sigma>0, where \mu\in\mathbb R is the location parameter, \sigma>0 the scale parameter and \xi\in\mathbb R the shape parameter. Thus for \xi>0, the expression just given for the cumulative distribution function is valid for x > \mu-\sigma/\xi, while for \xi<0 it is valid for x < \mu+ \sigma/(-\xi). In the first case, at the lower end-point it equals 0; in the second case, at the upper end-point, it equals 1. For \xi = 0 the expression just given for the cumulative distribution function is formally undefined and is replaced by the result obtained by taking the limit as \xi\to 0

F(x;\mu,\sigma,0) = \exp\left\{-\exp \left(-\frac{x-\mu}{\sigma}\right)\right\},

without any restriction on x.

The density function is, consequently,

f(x;\mu,\sigma,\xi) = \frac{1}{\sigma}\left[1+\xi\left(\frac{x-\mu}{\sigma}\right)\right]^{(-1/\xi)-1} \exp\left\{-\left[1+\xi\left(\frac{x-\mu}{\sigma}\right)\right]^{-1/\xi}\right\}

again, for x > \mu-\sigma/\xi in the case \xi>0, and for x < \mu+\sigma/(-\xi) in the case \xi<0. The density is zero outside of the relevant range. In the case \xi=0 the density is positive on the whole real line and equal to

f(x;\mu,\sigma,\xi) = \frac{1}{\sigma}\exp\left[-\left(\frac{x-\mu}{\sigma}\right)\right] \exp\left\{-\exp\left[\left(-\frac{x-\mu}{\sigma}\right)\right]\right\}.

Example of density functions for distributions of the GEV family.

Summary statistics

Some simple statistics of the distribution are:[citation needed]

\operatorname{E}(X) = \mu-\frac{\sigma}{\xi}+\frac{\sigma}{\xi}g_1 ,
\operatorname{Var}(X) = \frac{\sigma^2}{\xi^2}(g_2-g_1^2) ,
\operatorname{Mode}(X) = \mu+\frac{\sigma}{\xi}[(1+\xi)^{-\xi}-1] .

The skewness is for ξ>0

\operatorname{skewness}(X) = \frac{g_3-3g_1g_2+2g_1^3}{(g_2-g_1^2)^{3/2}}

For ξ<0, the sign of the numerator is reversed.

The excess kurtosis is:

\operatorname{kurtosis\ excess}(X) = \frac{g_4-4g_1g_3+6g_2g_1^2-3g_1^4}{(g_2-g_1^2)^{2}}-3 .

where g_k=\Gamma(1-k\xi), k=1,2,3,4, and \Gamma(t) is the gamma function.

Link to Fréchet, Weibull and Gumbel families

Ronald Fisher

The shape parameter \xi governs the tail behaviour of the distribution. The sub-families defined by \xi= 0, \xi>0 and \xi<0 correspond, respectively, to the Gumbel, Fréchet and Weibull families, whose cumulative distribution functions are displayed below.

  • Gumbel or type I extreme value distribution (\xi=0)
 F(x;\mu,\sigma,0)=e^{-e^{-(x-\mu)/\sigma}}\;\;\; \text{for} \;\; x\in\mathbb R.
  • Fréchet or type II extreme value distribution, if \xi=\alpha^{-1}>0 and  y = 1 + \xi (x-\mu)/\sigma
 F(x;\mu,\sigma,\xi)=\begin{cases} e^{-y^{-\alpha}} & y > 0 \\ 0 & y \leq 0. \end{cases}
  • Reversed Weibull or type III extreme value distribution, if \xi=-\alpha^{-1}<0 and  y = - \left( 1 + \xi (x-\mu)/\sigma \right)
 F(x;\mu,\sigma,\xi)=\begin{cases} e^{-(-y)^{\alpha}} & y<0 \\ 1 & y\geq 0 \end{cases}

Remark I: The theory here relates to maxima and the distribution being discussed is an extreme value distribution for maxima. A generalised extreme value distribution for minima can be obtained, for example by substituting (−x) for x in the distribution function, and subtracting from one: this yields a separate family of distributions.

Remark II: The ordinary Weibull distribution arises in reliability applications and is obtained from the distribution here by using the variable  t = \mu - x , which gives a strictly positive support - in contrast to the use in the extreme value theory here. This arises because the Weibull distribution is used in cases that deal with the minimum rather than the maximum. The distribution here has an addition parameter compared to the usual form of the Weibull distribution and, in addition, is reversed so that the distribution has an upper bound rather than a lower bound. Importantly, in applications of the GEV, the upper bound is unknown and so must be estimated while when applying the Weibull distribution the lower bound is known to be zero.

Remark III: Note the differences in the ranges of interest for the three extreme value distributions: Gumbel is unlimited, Fréchet has a lower limit, while the reversed Weibull has an upper limit.

One can link the type I to types II and III the following way: if the cumulative distribution function of some random variable X is of type II, and with the positive numbers as support, i.e. F(x; 0, \sigma, \alpha), then the cumulative distribution function of \ln X is of type I, namely F(x; \ln \sigma, 1/\alpha, 0). Similarly, if the cumulative distribution function of X is of type III, and with the negative numbers as support, i.e. F(x; 0, \sigma, -\alpha), then the cumulative distribution function of \ln (-X) is of type I, namely F(x; -\ln \sigma, 1/\alpha, 0).

Link to logit models (logistic regression)

Multinomial logit models, and certain other types of logistic regression, can be phrased as latent variable models with error variables distributed as Gumbel distributions (type I generalized extreme value distributions). This phrasing is common in the theory of discrete choice models, which include logit models, probit models, and various extensions of them, and derives from the fact that the difference of two type-I GEV-distributed variables follows a logistic distribution, of which the logit function is the quantile function. The type-I GEV distribution thus plays the same role in these logit models as the normal distribution does in the corresponding probit models.

Properties

The cumulative distribution function of the generalized extreme value distribution solves the stability postulate equation.[citation needed] The generalized extreme value distribution is a special case of a max-stable distribution, and is a transformation of a min-stable distribution.

Applications

The GEV distribution is widely used in the treatment of "tail risks" in fields ranging from insurance to finance. In the latter case, it has been considered as a means of assessing various financial risks via metrics such as Value at Risk.[2] However, the resulting shape parameters have been found to lie in the range leading to undefined means and variances, which poses a threat to reliable data analysis.[3]

Related distributions

See also

Lua error in package.lua at line 80: module 'strict' not found.

Notes

  1. 1.0 1.1 Muraleedharan. G, C. Guedes Soares and Cláudia Lucas (2011). "Characteristic and Moment Generating Functions of Generalised Extreme Value Distribution (GEV)". In Linda. L. Wright (Ed.), Sea Level Rise, Coastal Engineering, Shorelines and Tides, Chapter-14, pp. 269–276. Nova Science Publishers. ISBN 978-1-61728-655-1
  2. Moscadelli, Marco. "The modelling of operational risk: experience with the analysis of the data collected by the Basel Committee." Available at SSRN 557214 (2004).
  3. Kjersti Aas, lecture, NTNU, Trondheim, 23 Jan 2008

References

  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.
  • Lua error in package.lua at line 80: module 'strict' not found.