Large deviations theory
{{Short description|Branch of probability theory}}
{{Use dmy dates|date=August 2020}}
In probability theory, the theory of large deviations concerns the asymptotic behaviour of remote tails of sequences of probability distributions. While some basic ideas of the theory can be traced to Laplace, the formalization started with insurance mathematics, namely ruin theory with Cramér and Lundberg. A unified formalization of large deviation theory was developed in 1966, in a paper by Varadhan.S.R.S. Varadhan, Asymptotic probability and differential equations, Comm. Pure Appl. Math. 19 (1966),261-286. Large deviations theory formalizes the heuristic ideas of concentration of measures and widely generalizes the notion of convergence of probability measures.
Roughly speaking, large deviations theory concerns itself with the exponential decline of the probability measures of certain kinds of extreme or tail events.
Introductory examples
{{Quote|text=Any large deviation is done in the least unlikely of all the unlikely ways!|title=Large Deviations|author=Frank den Hollander|source=p. 10}}
= An elementary example =
Consider a sequence of independent tosses of a fair coin. The possible outcomes could be heads or tails. Let us denote the possible outcome of the i-th trial by {{nowrap|,}} where we encode head as 1 and tail as 0. Now let denote the mean value after trials, namely
:{{nowrap|.}}
Then lies between 0 and 1. From the law of large numbers it follows that as N grows, the distribution of converges to (the expected value of a single coin toss).
Moreover, by the central limit theorem, it follows that is approximately normally distributed for large {{nowrap|.}} The central limit theorem can provide more detailed information about the behavior of than the law of large numbers. For example, we can approximately find a tail probability of {{nowrap|}} {{ndash}} the probability that is greater than some value {{nowrap|}} {{ndash}} for a fixed value of {{nowrap|.}} However, the approximation by the central limit theorem may not be accurate if is far from and is not sufficiently large. Also, it does not provide information about the convergence of the tail probabilities as {{nowrap|.}} However, the large deviation theory can provide answers for such problems.
Let us make this statement more precise. For a given value {{nowrap|
:{{nowrap|
Note that the function
:{{nowrap|
The probability
= Large deviations for sums of independent random variables =
{{main|Cramér's theorem (large deviations)}}
In the above example of coin-tossing we explicitly assumed that each toss is an
independent trial, and the probability of getting head or tail is always the same.
Let
:{{nowrap|
Here
:{{nowrap|
as before.
Function
The above-mentioned limit means that for large {{nowrap|
:{{nowrap|
which is the basic result of large deviations theory.{{Cite web |date=2 February 2012 |title=Large Deviations |url=https://math.nyu.edu/~varadhan/Spring2012/Chapters1-2.pdf |access-date=11 June 2024 |website=www.math.nyu.edu}}S.R.S. Varadhan, Large Deviations and Applications (SIAM, Philadelphia, 1984)
If we know the probability distribution of {{nowrap|
:{{nowrap|
where
:
is called the cumulant generating function (CGF) and
If
If
= Moderate deviations for sums of independent random variables =
The previous example controlled the probability of the event
{{Math theorem
| math_statement = Let
}}
In particular, the limit case
Formal definition
Given a Polish space
:{{nowrap|
where
Brief history
The first rigorous results concerning large deviations are due to the Swedish mathematician Harald Cramér, who applied them to model the insurance business.Cramér, H. (1944). On a new limit theorem of the theory of probability. Uspekhi Matematicheskikh Nauk, (10), 166-178. From the point
of view of an insurance company, the earning is at a constant rate per month (the monthly premium) but the claims come randomly. For the company to be successful over a certain period of time (preferably many months), the total earning should exceed the total claim. Thus to estimate the premium you have to ask the following question: "What should we choose as the premium
A very incomplete list of mathematicians who have made important advances would include Petrov,Petrov V.V. (1954) Generalization of Cramér's limit theorem. Uspehi Matem. Nauk, v. 9, No 4(62), 195--202.(Russian) Sanov,Sanov I.N. (1957) On the probability of large deviations of random magnitudes. Matem. Sbornik, v. 42 (84), 11--44. S.R.S. Varadhan (who has won the Abel prize for his contribution to the theory), D. Ruelle, O.E. Lanford, Mark Freidlin, Alexander D. Wentzell, Amir Dembo, and Ofer Zeitouni.Dembo, A., & Zeitouni, O. (2009). Large deviations techniques and applications (Vol. 38). Springer Science & Business Media
Applications
Principles of large deviations may be effectively applied to gather information out of a probabilistic model. Thus, theory of large deviations finds its applications in information theory and risk management. In physics, the best known application of large deviations theory arise in thermodynamics and statistical mechanics (in connection with relating entropy with rate function).
= Large deviations and entropy =
{{main|asymptotic equipartition property}}
The rate function is related to the entropy in statistical mechanics. This can be heuristically seen in the following way. In statistical mechanics the entropy of a particular macro-state is related to the number of micro-states which corresponds to this macro-state. In our coin tossing example the mean value
There is a relation between the "rate function" in large deviations theory and the Kullback–Leibler divergence, the connection is established by Sanov's theorem (see Sanov and
Novak,Novak S.Y. (2011) Extreme value methods with applications to finance. Chapman & Hall/CRC Press. {{isbn|978-1-4398-3574-6}}. ch. 14.5).
In a special case, large deviations are closely related to the concept of Gromov–Hausdorff limits.Kotani M., Sunada T. Large deviation and the tangent cone at infinity of a crystal lattice, Math. Z. 254, (2006), 837-870.
See also
- Large deviation principle
- Cramér's large deviation theorem
- Chernoff's inequality
- Sanov's theorem
- Contraction principle (large deviations theory), a result on how large deviations principles "push forward"
- Freidlin–Wentzell theorem, a large deviations principle for Itō diffusions
- Legendre transformation, Ensemble equivalence is based on this transformation.
- Laplace principle, a large deviations principle in Rd
- Laplace's method
- Schilder's theorem, a large deviations principle for Brownian motion
- Varadhan's lemma
- Extreme value theory
- Large deviations of Gaussian random functions
References
{{Reflist}}
Bibliography
- [https://arxiv.org/abs/0804.2330v1 Special invited paper: Large deviations] by S. R. S. Varadhan The Annals of Probability 2008, Vol. 36, No. 2, 397–419 {{doi|10.1214/07-AOP348}}
- [https://arxiv.org/abs/1106.4146 A basic introduction to large deviations: Theory, applications, simulations], Hugo Touchette, arXiv:1106.4146.
- Entropy, Large Deviations and Statistical Mechanics by R.S. Ellis, Springer Publication. {{isbn|3-540-29059-1}}
- Large Deviations for Performance Analysis by Alan Weiss and Adam Shwartz. Chapman and Hall {{isbn|0-412-06311-5}}
- Large Deviations Techniques and Applications by Amir Dembo and Ofer Zeitouni. Springer {{isbn|0-387-98406-2}}
- A course on large deviations with an introduction to Gibbs measures by Firas Rassoul-Agha and Timo Seppäläinen. Grad. Stud. Math., 162. American Mathematical Society {{isbn|978-0-8218-7578-0}}
- Random Perturbations of Dynamical Systems by M.I. Freidlin and A.D. Wentzell. Springer {{isbn|0-387-98362-7}}
- "Large Deviations for Two Dimensional Navier-Stokes Equation with Multiplicative Noise", S. S. Sritharan and P. Sundar, Stochastic Processes and Their Applications, Vol. 116 (2006) 1636–1659.[http://www.nps.edu/Academics/Schools/GSEAS/SRI/R37.pdf]
- "Large Deviations for the Stochastic Shell Model of Turbulence", U. Manna, S. S. Sritharan and P. Sundar, NoDEA Nonlinear Differential Equations Appl. 16 (2009), no. 4, 493–521.[http://www.nps.edu/Academics/Schools/GSEAS/SRI/R41.pdf]
{{DEFAULTSORT:Large Deviations Theory}}