von Mises distribution

{{Probability distribution

|name =von Mises

|type =density

|pdf_image =File:VonMises distribution PDF.png
The support is chosen to be [−{{pi}},{{pi}}] with μ = 0

|cdf_image =File:VonMises distribution CDF.png
The support is chosen to be [−{{pi}},{{pi}}] with μ = 0

|parameters = $\mu$ real
$\kappa>0$

|support = $x\in$ any interval of length 2π

|pdf = $\frac{e^{\kappa\cos(x-\mu)}}{2\pi I_0(\kappa)}$

|cdf =(not analytic – see text)

|mean = $\mu$

|median = $\mu$

|mode = $\mu$

|variance = $\textrm{var}(x)=1-I_1(\kappa)/I_0(\kappa)$ (circular)

|skewness =

|kurtosis =

|entropy = $-\kappa\frac{I_1(\kappa)}{I_0(\kappa)}+\ln[2\pi I_0(\kappa)]$ (differential)

|mgf =

|char = $\frac{I_$

(\kappa)}{I_0(\kappa)}e^{i t \mu}

}}

In probability theory and directional statistics, the von Mises distribution (also known as the circular normal distribution or the Tikhonov distribution) is a continuous probability distribution on the circle. It is a close approximation to the wrapped normal distribution, which is the circular analogue of the normal distribution. A freely diffusing angle $\theta$ on a circle is a wrapped normally distributed random variable with an unwrapped variance that grows linearly in time. On the other hand, the von Mises distribution is the stationary distribution of a drift and diffusion process on the circle in a harmonic potential, i.e. with a preferred orientation.{{cite book |title=The Fokker–Planck Equation |last=Risken |first=H. |year=1989|publisher=Springer |isbn=978-3-540-61530-9 }} The von Mises distribution is the maximum entropy distribution for circular data when the real and imaginary parts of the first circular moment are specified. The von Mises distribution is a special case of the von Mises–Fisher distribution on the N-dimensional sphere.

Definition

The von Mises probability density function for the angle x is given by:{{cite book |title=Directional Statistics |last=Mardia |first=Kantilal |author-link=Kantilal Mardia |author2=Jupp, Peter E. |year=1999|publisher=Wiley |isbn=978-0-471-95333-3 }}

: $f(x\mid\mu,\kappa)=\frac{\exp(\kappa\cos(x-\mu))}{2\pi I_0(\kappa)}$

where I₀( $\kappa$ ) is the modified Bessel function of the first kind of order 0, with this scaling constant chosen so that the distribution sums to unity: $\int_{-\pi}^\pi \exp(\kappa\cos x)dx = {2\pi I_0(\kappa)}.$

The parameters μ and 1/ $\kappa$ are analogous to μ and σ{{i sup|2}} (the mean and variance) in the normal distribution:

μ is a measure of location (the distribution is clustered around μ), and
$\kappa$ is a measure of concentration (a reciprocal measure of dispersion, so 1/ $\kappa$ is analogous to σ{{i sup|2}}).
If $\kappa$ is zero, the distribution is uniform, and for small $\kappa$ , it is close to uniform.
If $\kappa$ is large, the distribution becomes very concentrated about the angle μ with $\kappa$ being a measure of the concentration. In fact, as $\kappa$ increases, the distribution approaches a normal distribution in x with mean μ and variance 1/ $\kappa$ .

The probability density can be expressed as a series of Bessel functionssee Abramowitz and Stegun [http://www.math.sfu.ca/~cbm/aands/page_376.htm §9.6.34]

: $f(x\mid\mu,\kappa) = \frac{1}{2\pi}\left(1+\frac{2}{I_0(\kappa)} \sum_{j=1}^\infty I_j(\kappa) \cos[j(x-\mu)]\right)$

where I_j(x) is the modified Bessel function of order j.

The cumulative distribution function is not analytic and is best found by integrating the above series. The indefinite integral of the probability density is:

: $\Phi(x\mid\mu,\kappa)=\int f(t\mid\mu,\kappa)\,dt =\frac{1}{2\pi}\left(x + \frac{2}{I_0(\kappa)} \sum_{j=1}^\infty I_j(\kappa) \frac{\sin[j(x-\mu)]}{j}\right).$

The cumulative distribution function will be a function of the lower limit of

integration x₀:

: $F(x\mid\mu,\kappa)=\Phi(x\mid\mu,\kappa)-\Phi(x_0\mid\mu,\kappa).\,$

Moments

The moments of the von Mises distribution are usually calculated as the moments of the complex exponential z = e{{i sup|ix}} rather than the angle x itself. These moments are referred to as circular moments. The variance calculated from these moments is referred to as the circular variance. The one exception to this is that the "mean" usually refers to the argument of the complex mean.

The nth raw moment of z is:

: $m_n=\langle z^n\rangle=\int_\Gamma z^n\,f(x|\mu,\kappa)\,dx$

: $= \frac{I_$

(\kappa)}{I_0(\kappa)}e^{i n \mu}

where the integral is over any interval $\Gamma$ of length 2π. In calculating the above integral, we use the fact that z{{i sup|n}} = cos(nx) + i sin(nx) and the Bessel function identity:See Abramowitz and Stegun [http://www.math.sfu.ca/~cbm/aands/page_376.htm §9.6.19]

: $I_n(\kappa)=\frac{1}{\pi}\int_0^\pi e^{\kappa\cos(x)}\cos(nx)\,dx.$

The mean of the complex exponential z is then just

: $m_1= \frac{I_1(\kappa)}{I_0(\kappa)}e^{i\mu}$

and the circular mean value of the angle x is then taken to be the argument μ. This is the expected or preferred direction of the angular random variables. The circular variance of x is:

: $V = 1 - |E[e^{ix}]| = 1 - \frac{I_1(\kappa)}{I_0(\kappa)}.$

Generation of von Mises Variates

A notable advancement in generating Tikhonov (or von Mises) random variates was introduced by Abreu in 2008.{{Cite journal |last=de Abreu |first=Giuseppe Thadeu Freitas |title=On the Generation of Tikhonov Variates |journal=IEEE Transactions on Communications |volume=56 |issue=7 |year=2008 |pages=1157–1168 |doi=10.1109/TCOMM.2008.060510}} This method, termed the "random mixture" (RM) technique, offers a simple and efficient alternative to traditional approaches like the accept-reject (AR) algorithm, which often suffer from inefficiency due to sample rejection and computational complexity. The RM method generates Tikhonov variates by randomly selecting samples from a predefined set of Cauchy and Gaussian generators, followed by a straightforward transformation. Specifically, it uses a bank of $K$ distinct generators (e.g., one Cauchy and two Gaussian processes), with mixture probabilities derived from the characteristic functions of the Cauchy, Gaussian, and Tikhonov distributions, all of which are available in closed form.

The technique leverages the circular moment-determinance property of the Tikhonov distribution, where the distribution is uniquely defined by its circular moments. By ensuring that the first $N$ dominant circular moments of the generated variates closely match the theoretical Tikhonov moments, the method achieves high accuracy. The mixture probabilities and parameters (e.g., variance for Gaussian and half-width for Cauchy) can be computed using either least squares (LS) optimization or a simpler Moore-Penrose pseudo-inverse approach, with the latter offering a practical trade-off between complexity and precision. Unlike AR methods, the RM technique consumes only one pair of uniform random numbers per Tikhonov sample, regardless of the concentration parameter $\alpha$ , and avoids sample rejection or repetitive evaluation of complex functions.{{Cite AV media

| title = [On the Generation of Tikhonov Random Variates]

| url = https://www.youtube.com/watch?v=7irV9Yng2mM

| publisher = [The Wireless Channel]

| date = 2025-03-19

| access-date = 2025-03-21

| medium = Video

| website = YouTube

}}

Limiting behavior

When $\kappa$ is large, the distribution resembles a normal distribution. Mardia, K. V.; Jupp, P. E. (2000). "Directional Statistics". Wiley Series in Probability and Statistics. Chichester: John Wiley & Sons. ISBN 978-0-471-95333-3. p. 36. More specifically, for large positive real numbers $\kappa$ ,

: $f(x\mid\mu,\kappa) \approx \frac 1 {\sigma\sqrt{2\pi}} \exp\left[\dfrac{-(x-\mu)^2}{2\sigma^2}\right]$

where σ² = 1/ $\kappa$ and the difference between the left hand side and the right hand side of the approximation converges uniformly to zero as $\kappa$ goes to infinity. Also, when $\kappa$ is small, the probability density function resembles a uniform distribution:

: $\lim_{\kappa\rightarrow 0}f(x\mid\mu,\kappa)=\mathrm{U}(x)$

where the interval for the uniform distribution $\mathrm{U}(x)$ is the chosen interval of length $2\pi$ (i.e. $\mathrm{U}(x) = 1/(2\pi)$ when $x$ is in the interval and $\mathrm{U}(x)=0$ when $x$ is not in the interval).

Estimation of parameters

A series of N measurements $z_n=e^{i\theta_n}$ drawn from a von Mises distribution may be used to estimate certain parameters of the distribution.{{cite book |title=Statistics of earth science data : their distribution in time, space, and orientation |last=Borradaile |first=G. J. |year=2003 |publisher=Springer |isbn=978-3-662-05223-5 }} The average of the series $\overline{z}$ is defined as

: $\overline{z}=\frac{1}{N}\sum_{n=1}^N z_n$

and its expectation value will be just the first moment:

: $\langle\overline{z}\rangle=\frac{I_1(\kappa)}{I_0(\kappa)}e^{i\mu}.$

In other words, $\overline{z}$ is an unbiased estimator of the first moment. If we assume that the mean $\mu$ lies in the interval $[-\pi,\pi]$ , then Arg $(\overline{z})$ will be a (biased) estimator of the mean $\mu$ .

Viewing the $z_n$ as a set of vectors in the complex plane, the $\bar{R}^ 2$ statistic is the square of the length of the averaged vector:

: $\bar{R}^ 2=\overline{z}\,\overline{z^*}=\left(\frac{1}{N}\sum_{n=1}^N \cos\theta_n\right)^2+\left(\frac{1}{N}\sum_{n=1}^N \sin\theta_n\right)^2$

and its expectation value is {{cite journal |last=Kutil |first=Rade |title=Biased and unbiased estimation of the circular mean resultant length and its variance.|date=August 2012 |url=https://www.researchgate.net/publication/233474043 |journal=Statistics: A Journal of Theoretical and Applied Statistics |volume= 46 |issue=4 |pages=549–561 |doi=10.1080/02331888.2010.543463 |s2cid=7045090 |citeseerx=10.1.1.302.8395 }}

: $\langle \bar{R}^2\rangle=\frac{1}{N}+\frac{N-1}{N}\,\frac{I_1(\kappa)^2}{I_0(\kappa)^2}.$

In other words, the statistic

: $R_e^2=\frac{N}{N-1}\left(\bar{R}^2-\frac{1}{N}\right)$

will be an unbiased estimator of $\frac{I_1(\kappa)^2}{I_0(\kappa)^2}\,$ and solving the equation $R_e=\frac{I_1(\kappa)}{I_0(\kappa)}\,$ for $\kappa\,$ will yield a (biased) estimator of $\kappa\,$ . In analogy to the linear case, the solution to the equation $\bar{R}=\frac{I_1(\kappa)}{I_0(\kappa)}\,$ will yield the maximum likelihood estimate of $\kappa\,$ and both will be equal in the limit of large N. For approximate solution to $\kappa\,$ refer to von Mises–Fisher distribution.

Distribution of the mean

The distribution of the sample mean $\overline{z} = \bar{R}e^{i\overline{\theta}}$ for the von Mises distribution is given by:{{cite book |title=Topics in Circular Statistics |last=Jammalamadaka |first=S. Rao |author2=Sengupta, A. |year=2001 |publisher=World Scientific Publishing Company |isbn=978-981-02-3778-3 }}

: $P(\bar{R},\bar{\theta})\,d\bar{R}\,d\bar{\theta}=\frac{1}{ (2\pi I_0(\kappa))^N}\int_\Gamma \prod_{n=1}^N \left( e^{\kappa\cos(\theta_n-\mu)} d\theta_n\right) = \frac{e^{\kappa N\bar{R}\cos(\bar{\theta}-\mu)}}{I_0(\kappa)^N}\left(\frac{1}{(2\pi)^N}\int_\Gamma \prod_{n=1}^N d\theta_n\right)$

where N is the number of measurements and $\Gamma\,$ consists of intervals of $2\pi$ in the variables, subject to the constraint that $\bar{R}$ and $\bar{\theta}$ are constant, where $\bar{R}$ is the mean resultant:

: $\bar{R}^2=|\bar{z}|^2= \left(\frac{1}{N}\sum_{n=1}^N \cos(\theta_n) \right)^2 + \left(\frac{1}{N}\sum_{n=1}^N \sin(\theta_n) \right)^2$

and $\overline{\theta}$ is the mean angle:

: $\overline{\theta}=\mathrm{Arg}(\overline{z}). \,$

Note that the product term in parentheses is just the distribution of the mean for a circular uniform distribution.

This means that the distribution of the mean direction $\mu$ of a von Mises distribution $VM(\mu, \kappa)$ is a von Mises distribution $VM(\mu, \bar{R}N\kappa)$ , or, equivalently, $VM(\mu, R\kappa)$ .

Entropy

By definition, the information entropy of the von Mises distribution is

: $H = -\int_\Gamma f(\theta;\mu,\kappa)\,\ln(f(\theta;\mu,\kappa))\,d\theta\,$

where $\Gamma$ is any interval of length $2\pi$ . The logarithm of the density of the Von Mises distribution is straightforward:

: $\ln(f(\theta;\mu,\kappa))=-\ln(2\pi I_0(\kappa))+ \kappa \cos(\theta)\,$

The characteristic function representation for the Von Mises distribution is:

: $f(\theta;\mu,\kappa) =\frac{1}{2\pi}\left(1+2\sum_{n=1}^\infty\phi_n\cos(n\theta)\right)$

where $\phi_n= I_$

(\kappa)/I_0(\kappa). Substituting these expressions into the entropy integral, exchanging the order of integration and summation, and using the orthogonality of the cosines, the entropy may be written:

: $H = \ln(2\pi I_0(\kappa))-\kappa\phi_1 = \ln(2\pi I_0(\kappa))-\kappa\frac{I_1(\kappa)}{I_0(\kappa)}$

For $\kappa=0$ , the von Mises distribution becomes the circular uniform distribution and the entropy attains its maximum value of $\ln(2\pi)$ .

Notice that the Von Mises distribution maximizes the entropy when the real and imaginary parts of the first circular moment are specified{{cite book |title=Topics in circular statistics |last=Jammalamadaka |first=S. Rao |author2=SenGupta, A.|year=2001 |publisher=World Scientific |location=New Jersey |isbn=981-02-3778-2 |url=https://books.google.com/books?id=sKqWMGqQXQkC&q=Jammalamadaka+Topics+in+circular |access-date=2011-05-15}} or, equivalently, the circular mean and circular variance are specified.

References

Works cited

Abramowitz, M. and Stegun, I. A. (ed.), Handbook of Mathematical Functions, National Bureau of Standards, 1964; reprinted Dover Publications, 1965. {{isbn|0-486-61272-4}}

Category:Continuous distributions

Category:Directional statistics

Category:Exponential family distributions