Stable distribution

From Infogalactic: the planetary knowledge core
Jump to: navigation, search

Lua error in package.lua at line 80: module 'strict' not found.

<templatestyles src="Module:Hatnote/styles.css"></templatestyles>

Stable
Probability density function
Symmetric stable distributions
Symmetric α-stable distributions with unit scale factor
Skewed centered stable distributions
Skewed centered stable distributions with unit scale factor
Cumulative distribution function
CDF's for symmetric α-stable distributions
CDFs for symmetric α-stable distributions
CDF's for skewed centered Lévy distributions
CDFs for skewed centered stable distributions
Parameters α ∈ (0, 2] — stability parameter

β ∈ [−1, 1] — skewness parameter (note that skewness is undefined)
c ∈ (0, ∞) — scale parameter

μ ∈ (−∞, ∞) — location parameter
Support xR, or x ∈ [μ, +∞) if α < 1 and β = 1, or x ∈ (-∞, μ] if α < 1 and β = −1
PDF not analytically expressible, except for some parameter values
CDF not analytically expressible, except for certain parameter values
Mean μ when α > 1, otherwise undefined
Median μ when β = 0, otherwise not analytically expressible
Mode μ when β = 0, otherwise not analytically expressible
Variance 2c2 when α = 2, otherwise infinite
Skewness 0 when α = 2, otherwise undefined
Ex. kurtosis 0 when α = 2, otherwise undefined
Entropy not analytically expressible, except for certain parameter values
MGF undefined
CF \exp\!\Big[\; it\mu - |c\,t|^\alpha\,(1-i \beta\,\mbox{sgn}(t)\Phi) \;\Big],
where \Phi = \begin{cases} \tan\tfrac{\pi\alpha}{2} & \text{if }\alpha \ne 1 \\ -\tfrac{2}{\pi}\log|t| & \text{if }\alpha = 1 \end{cases}

In probability theory, a distribution is said to be stable (or a random variable is said to be stable) if a linear combination of two independent copies of a random sample has the same distribution, up to location and scale parameters. The stable distribution family is also sometimes referred to as the Lévy alpha-stable distribution, after Paul Lévy, the first mathematician to have studied it.[1][2]

Of the four parameters defining the family, most attention has been focused on the stability parameter, α (see panel). Stable distributions have 0 < α ≤ 2, with the upper bound corresponding to the normal distribution, and α = 1 to the Cauchy distribution. The distributions have undefined variance for α < 2, and undefined mean for α ≤ 1. The importance of stable probability distributions is that they are "attractors" for properly normed sums of independent and identically-distributed (iid) random variables. The normal distribution defines a family of stable distributions. By the classical central limit theorem the properly normed sum of a set of random variables, each with finite variance, will tend towards a normal distribution as the number of variables increases. Without the finite variance assumption the limit may be a stable distribution. Mandelbrot referred to stable distributions that are non-normal as "stable Paretian distributions",[3][4][5] after Vilfredo Pareto. Mandelbrot referred to "positive" stable distributions (meaning maximally skewed in the positive direction) with 1<α<2 as "Pareto-Levy distributions".[1] He also regarded this range as relevant for the description of stock and commodity prices.

q-analogs of all symmetric stable distributions have been defined, and these recover the usual symmetric stable distributions in the limit of q → 1.[6]

Definition

A non-degenerate distribution is a stable distribution if it satisfies the following property:

Let X1 and X2 be independent copies of a random variable X. Then X is said to be stable if for any constants a > 0 and b > 0 the random variable aX1 + bX2 has the same distribution as cX + d for some constants c > 0 and d. The distribution is said to be strictly stable if this holds with d = 0.[7]

Since the normal distribution, the Cauchy distribution, and the Lévy distribution all have the above property, it follows that they are special cases of stable distributions.

Such distributions form a four-parameter family of continuous probability distributions parametrized by location and scale parameters μ and c, respectively, and two shape parameters β and α, roughly corresponding to measures of asymmetry and concentration, respectively (see the figures).

Although the probability density function for a general stable distribution cannot be written analytically, the general characteristic function can be. Any probability distribution is given by the Fourier transform of its characteristic function φ(t) by:

 f(x)=\frac{1}{2\pi}\int_{-\infty}^\infty \varphi(t)e^{-ixt}\,dt

A random variable X is called stable if its characteristic function can be written as[7][8]

 \varphi(t;\alpha,\beta,c,\mu) =  \exp\left[~it\mu\!-\!|c t|^\alpha\,(1\!-\!i \beta\,\textrm{sgn}(t)\Phi)~\right]

where sgn(t) is just the sign of t and Φ is given by

\Phi=\tan(\pi \alpha/2)\,

for all α except α = 1 in which case:

\Phi=-\frac{2}{\pi}\log|t|.\,

μ ∈ R is a shift parameter, β ∈ [−1, 1], called the skewness parameter, is a measure of asymmetry. Notice that in this context the usual skewness is not well defined, as for α < 2 the distribution does not admit 2nd or higher moments, and the usual skewness definition is the 3rd central moment.

In the simplest case β = 0, the characteristic function is just a stretched exponential function; the distribution is symmetric about μ and is referred to as a (Lévy) symmetric alpha-stable distribution, often abbreviated SαS.

When α < 1 and β = 1, the distribution is supported by [μ, ∞).

The parameter |c| > 0 is a scale factor which is a measure of the width of the distribution and α is the exponent or index of the distribution and specifies the asymptotic behavior of the distribution for α < 2.

Parameterizations

The above definition is only one of the parameterizations in use for stable distributions; it is the most common but is not continuous in the parameters. For example, for the case α = 1 we could replace Φ by:[7]

 \Phi'=-\frac{1}{\pi}\ln\left(\frac{t}{|c|}\right)

and μ by

\mu'=\mu+\frac{\beta|c|\ln(|c|)}{\pi}

This parameterization has the advantage that we may define a standard distribution using

 \tau=\frac{t}{|c|}

and

 y=\frac{x-\mu'}{|c|}

The pdf for all α will then have the following standardization property:

f(x;\alpha,\beta,c,\mu')dx=f(y;\alpha,\beta,1,0)dy\,

The distribution

A stable distribution is therefore specified by the above four parameters. It can be shown that any non-degenerate stable distribution has a smooth (infinitely differentiable) density function.[7] If f(x;\alpha, \beta,c,\mu) denotes the density of X and Y is the sum of independent copies of X:

Y = \sum_{i=1}^N k_i (X_i-\mu)\,

then Y has the density s^{-1}f(y/s;\alpha,\beta,c,0) with

s=\left(\sum_{i=1}^N |k_i|^\alpha\right)^{1/\alpha}.\,

The asymptotic behavior is described, for α< 2, by:[7]

f(x)\sim\frac{c^\alpha (1+\mbox{sgn}(x)\beta) \sin(\pi \alpha / 2)\Gamma(\alpha+1)/\pi}{|x|^{1+\alpha}}

where Γ is the Gamma function (except that when α < 1 and β = ±1, the tail vanishes to the left or right, resp., of μ). This "heavy tail" behavior causes the variance of stable distributions to be infinite for all α < 2. This property is illustrated in the log-log plots below.

When α = 2, the distribution is Gaussian (see below), with tails asymptotic to exp(−x2/4c2)/(2c√π).

Properties

Stable distributions are closed under convolution for a fixed value of α. Since convolution is equivalent to multiplication of the Fourier-transformed function, it follows that the product of two stable characteristic functions with the same α will yield another such characteristic function. The product of two stable characteristic functions is given by:

\exp\left[it\mu_1+it\mu_2 - |c_1 t|^\alpha - |c_2 t|^\alpha +i\beta_1|c_1 t|^\alpha\textrm{sgn}(t)\Phi +i\beta_2|c_2 t|^\alpha\,\textrm{sgn}(t)\Phi \right]

Since Φ is not a function of the μ, c or β variables it follows that these parameters for the convolved function are given by:

\mu=\mu_1+\mu_2\,
|c|=(|c_1|^\alpha+|c_2|^\alpha)^{1/\alpha}\,
\beta=\frac{\beta_1|c_1|^\alpha+\beta_2|c_2|^\alpha}{|c|^\alpha}

In each case, it can be shown that the resulting parameters lie within the required intervals for a stable distribution.

A generalized central limit theorem

Another important property of stable distributions is the role that they play in a generalized central limit theorem. The central limit theorem states that the sum of a number of independent and identically distributed (i.i.d.) random variables with finite variances will tend to a normal distribution as the number of variables grows.

A generalization due to Gnedenko and Kolmogorov states that the sum of a number of random variables with a power-law tail (Paretian tail) distribution decreasing as |x|−α−1 where 0 < α < 2 (and therefore having infinite variance) will tend to a stable distribution f(x;\alpha,0,c,0) as the number of summands grows.[9] If α>2 then the sum converges to a stable distribution with stability parameter equal to 2, i.e. a Gaussian distribution.[10]

Special cases

Log-log plot of symmetric centered stable distribution PDF's showing the power law behavior for large x. The power law behavior is evidenced by the straight-line appearance of the PDF for large x, with the slope equal to −(α+1). (The only exception is for α = 2, in black, which is a normal distribution.)
Log-log plot of skewed centered stable distribution PDF's showing the power law behavior for large x. Again the slope of the linear portions is equal to −(α+1)

There is no general analytic solution for the form of p(x). There are, however three special cases which can be expressed in terms of elementary functions as can be seen by inspection of the characteristic function:[7][8][11]

  • For α = 2 the distribution reduces to a Gaussian distribution with variance σ2 = 2c2 and mean μ; the skewness parameter β has no effect.
  • For α = 1 and β = 0 the distribution reduces to a Cauchy distribution with scale parameter c and shift parameter μ.
  • For α =1/2 and β = 1 the distribution reduces to a Lévy distribution with scale parameter c and shift parameter μ.

Note that the above three distributions are also connected, in the following way: A standard Cauchy random variable can be viewed as a mixture of Gaussian random variables (all with mean zero), with the variance being drawn from a standard Lévy distribution. And in fact this is a special case of a more general theorem [12] which allows any symmetric alpha-stable distribution to be viewed in this way (with the alpha parameter of the mixture distribution equal to twice the alpha parameter of the mixing distribution—and the beta parameter of the mixing distribution always equal to one).

A general closed form expression for stable PDF's with rational values of α is available in terms of Meijer G-functions.[13] Fox H-Functions can also be used to express the stable probability density functions. For simple rational numbers, the closed form expression is often in terms of less complicated special functions. Several closed form expressions having rather simple expressions in terms of special functions are available. In the table below, PDF's expressible by elementary functions are indicated by an E and those that are expressible by special functions are indicated by an s.[12]

α
1/3 1/2 2/3 1 4/3 3/2 2
β=0 s s s E s s E
β=1 s E s s s

Some of the special cases are known by particular names:

  • For α = 1 and β = 1, the distribution is a Landau distribution which has a specific usage in physics under this name.
  • For α = 3/2 and β = 0 the distribution reduces to a Holtsmark distribution with scale parameter c and shift parameter μ.

Also, in the limit as c approaches zero or as α approaches zero the distribution will approach a Dirac delta function δ(x−μ).

Series representation

The stable distribution can be restated as the real part of a simpler integral:[14]

f(x;\alpha,\beta,c,\mu)=\frac{1}{\pi}\Re\left[ \int_0^\infty e^{it(x-\mu)}e^{-(ct)^\alpha(1-i\beta\Phi)}\,dt\right].

Expressing the second exponential as a Taylor series, we have:

f(x;\alpha,\beta,c,\mu)=\frac{1}{\pi}\Re\left[ \int_0^\infty e^{it(x-\mu)}\sum_{n=0}^\infty\frac{(-qt^\alpha)^n}{n!}\,dt\right]

where q=c^\alpha(1-i\beta\Phi). Reversing the order of integration and summation, and carrying out the integration yields:

f(x;\alpha,\beta,c,\mu)=\frac{1}{\pi}\Re\left[ \sum_{n=1}^\infty\frac{(-q)^n}{n!}\left(\frac{i}{x-\mu}\right)^{\alpha n+1}\Gamma(\alpha n+1)\right]

which will be valid for x ≠ μ and will converge for appropriate values of the parameters. (Note that the n = 0 term which yields a delta function in x−μ has therefore been dropped.) Expressing the first exponential as a series will yield another series in positive powers of x−μ which is generally less useful.

Simulation of stable variables

Simulating sequences of stable random variables is not straightforward, since there are no analytic expressions for the inverse F^{-1}(x) nor the CDF F(x) itself.[15][16] All standard approaches like the rejection or the inversion methods would require tedious computations. A much more elegant and efficient solution was proposed by Chambers, Mallows and Stuck (CMS),[17] who noticed that a certain integral formula[18] yielded the following algorithm:[19]

  • generate a random variable U uniformly distributed on (-\frac{\pi}{2},\frac{\pi}{2}) and an independent exponential random variable W with mean 1;
  • for \alpha\ne 1 compute:

X = (1+\zeta^2)^\frac{1}{2\alpha}

\frac{\sin\{ \alpha(U+\xi)\}}{\{\cos(U)\}^{1/ \alpha}}

\left[\frac{\cos\{U - \alpha(U+\xi) \}}{W} \right]^\frac{1-\alpha}{\alpha},

  • for \alpha=1 compute:

X = \frac{1}{\xi}\left\{\left(\frac{\pi}{2}+\beta U \right)\tan U-

\beta\log\left(\frac{\frac{\pi}{2} W\cos U}{\frac{\pi}{2}+\beta U}\right)\right\}, where

\xi =\begin{cases}
	 \frac{1}{\alpha} \arctan(-\zeta), & \alpha \ne 1, \\
	 \frac{\pi}{2} , & \alpha=1,
	\end{cases}

and

\zeta = -\beta\tan\frac{\pi\alpha}{2}.

This algorithm yields a random variable X\sim S_\alpha(\beta,1,0). For a detailed proof see.[20]

Given the formulas for simulation of a standard stable random variable, we can easily simulate a stable random variable for all admissible values of the parameters \alpha, c, \beta and \mu using the following property. If X\sim S_\alpha(\beta,1,0) then

Y=\begin{cases}

c X+\mu, & \alpha \ne 1, \\

c X+\frac{2}{\pi}\beta c\log c + \mu, & \alpha=1,

\end{cases}

is S_\alpha(\beta,c,\mu). It is interesting to note that for \alpha=2 (and \beta=0) the CMS method reduces to the well known Box-Muller transform for generating Gaussian random variables.[21] Many other approaches have been proposed in the literature, including application of Bergström and LePage series expansions, see [22] and,[23] respectively. However, the CMS method is regarded as the fastest and the most accurate.

Applications

Stable distributions owe their importance in both theory and practice to the generalization of the central limit theorem to random variables without second (and possibly first) order moments and the accompanying self-similarity of the stable family. It was the seeming departure from normality along with the demand for a self-similar model for financial data (i.e. the shape of the distribution for yearly asset price changes should resemble that of the constituent daily or monthly price changes) that led Benoît Mandelbrot to propose that cotton prices follow an alpha-stable distribution with α equal to 1.7.[24] Lévy distributions are frequently found in analysis of critical behavior and financial data.[8][25]

They are also found in spectroscopy as a general expression for a quasistatically pressure broadened spectral line.[14]

Lévy distribution of solar flare waiting time events (time between flare events) was demonstrated for CGRO BATSE hard x-ray solar flares December 2001. Analysis of the Lévy statistical signature revealed that two different memory signatures were evident; one related to the solar cycle and the second whose origin appears to be associated with a localized or combination of localized solar active region effects.[26]

See also

Notes

  • The STABLE program for Windows is available from John Nolan's stable webpage: http://academic2.american.edu/~jpnolan/stable/stable.html. It calculates the density (pdf), cumulative distribution function (cdf) and quantiles for a general stable distribution, and performs maximum likelihood estimation of stable parameters and some exploratory data analysis techniques for assessing the fit of a data set.
  • Matlab codes for simulation of stable variables and estimation of stable parameters are available from RePEc: https://ideas.repec.org/e/pwe42.html#software

References

  1. 1.0 1.1 B. Mandelbrot, The Pareto-Lévy Law and the Distribution of Income, International Economic Review 1960 http://www.jstor.org/stable/2525289
  2. Paul Lévy, Calcul des probabilités 1925
  3. B.Mandelbrot, Stable Paretian Random Functions and the Multiplicative Variation of Income, Econometrica 1961 http://www.jstor.org/stable/pdfplus/1911802.pdf
  4. B. Mandelbrot, The variation of certain Speculative Prices, The Journal of Business 1963 [1]
  5. Eugene F. Fama, Mandelbrot and the Stable Paretian Hypothesis, The Journal of Business 1963
  6. Lua error in package.lua at line 80: module 'strict' not found.
  7. 7.0 7.1 7.2 7.3 7.4 7.5 Lua error in package.lua at line 80: module 'strict' not found.
  8. 8.0 8.1 8.2 Lua error in package.lua at line 80: module 'strict' not found.
  9. B.V. Gnedenko, A.N. Kolmogorov. Limit distributions for sums of independent random variables, Cambridge, Addison-Wesley 1954 https://books.google.com/books/about/Limit_distributions_for_sums_of_independ.html?id=rYsZAQAAIAAuJ&redir_esc=y
  10. Vladimir V. Uchaikin, Vladimir M. Zolotarev, Chance and Stability: Stable Distributions and their Applications, De Gruyter 1999 https://books.google.com/books/about/Chance_and_Stability.html?id=Y0xiwAmkb_oC&redir_esc=y
  11. Lua error in package.lua at line 80: module 'strict' not found.
  12. 12.0 12.1 Lua error in package.lua at line 80: module 'strict' not found.
  13. Lua error in package.lua at line 80: module 'strict' not found.
  14. 14.0 14.1 Lua error in package.lua at line 80: module 'strict' not found.
  15. Lua error in package.lua at line 80: module 'strict' not found.
  16. Lua error in package.lua at line 80: module 'strict' not found.
  17. Lua error in package.lua at line 80: module 'strict' not found.
  18. Lua error in package.lua at line 80: module 'strict' not found.
  19. Lua error in package.lua at line 80: module 'strict' not found.
  20. Lua error in package.lua at line 80: module 'strict' not found.
  21. Lua error in package.lua at line 80: module 'strict' not found.
  22. Lua error in package.lua at line 80: module 'strict' not found.
  23. Lua error in package.lua at line 80: module 'strict' not found.
  24. Mandelbrot, B., New methods in statistical economics The Journal of Political Economy, 71 #5, 421-440 (1963).
  25. Lua error in package.lua at line 80: module 'strict' not found.
  26. Leddon, D., A statistical Study of Hard X-Ray Solar Flares