Normal-inverse-Wishart distribution
{{Short description|Multivariate parameter family of continuous probability distributions}}
{{Probability distribution |
name =normal-inverse-Wishart|
type =density|
pdf_image =|
cdf_image =|
notation =|
parameters = location (vector of real)
(real)
inverse scale matrix (pos. def.)
(real)|
support = covariance matrix (pos. def.)|
pdf =|
cdf =|
mean =|
median =|
mode =|
variance =|
skewness =|
kurtosis =|
entropy =|
mgf =|
char =|
}}
In probability theory and statistics, the normal-inverse-Wishart distribution (or Gaussian-inverse-Wishart distribution) is a multivariate four-parameter family of continuous probability distributions. It is the conjugate prior of a multivariate normal distribution with an unknown mean and covariance matrix (the inverse of the precision matrix).Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [http://www.cs.ubc.ca/~murphyk/Papers/bayesGauss.pdf]
Definition
Suppose
:
has a multivariate normal distribution with mean and covariance matrix , where
:
has an inverse Wishart distribution. Then
has a normal-inverse-Wishart distribution, denoted as
:
Characterization
=Probability density function=
:
The full version of the PDF is as follows:Simon J.D. Prince(June 2012). [http://www.computervisionmodels.com/ Computer Vision: Models, Learning, and Inference]. Cambridge University Press. 3.8: "Normal inverse Wishart distribution".
\boldsymbol{\mu}_0,\lambda,\boldsymbol{\Psi},\nu )
=\frac{\lambda^{D/2}|\boldsymbol{\Psi}|^{\nu /
2}|\boldsymbol{\Sigma}|^{-\frac{\nu + D + 2}{2}}}{(2
\pi)^{D/2}2^{\frac{\nu
D}{2}}\Gamma_D(\frac{\nu}{2})}\text{exp}\left\{
-\frac{1}{2}Tr(\boldsymbol{\Psi
\Sigma}^{-1})-\frac{\lambda}{2}(\boldsymbol{\mu}-\boldsymbol{\mu}_0)^T\boldsymbol{\Sigma}^{-1}(\boldsymbol{\mu}
- \boldsymbol{\mu}_0) \right\}
Here is the multivariate gamma function and is the Trace of the given matrix.
Properties
=Scaling=
=Marginal distributions=
By construction, the marginal distribution over is an inverse Wishart distribution, and the conditional distribution over given is a multivariate normal distribution. The marginal distribution over is a multivariate t-distribution.
Posterior distribution of the parameters
Suppose the sampling density is a multivariate normal distribution
:
where is an matrix and (of length ) is row of the matrix .
With the mean and covariance matrix of the sampling distribution is unknown, we can place a Normal-Inverse-Wishart prior on the mean and covariance parameters jointly
:
(\boldsymbol\mu,\boldsymbol\Sigma) \sim \mathrm{NIW}(\boldsymbol\mu_0,\lambda,\boldsymbol\Psi,\nu).
The resulting posterior distribution for the mean and covariance matrix will also be a Normal-Inverse-Wishart
:
(\boldsymbol\mu,\boldsymbol\Sigma|y) \sim \mathrm{NIW}(\boldsymbol\mu_n,\lambda_n,\boldsymbol\Psi_n,\nu_n),
where
:
\boldsymbol\mu_n = \frac{\lambda\boldsymbol\mu_0 + n \bar{\boldsymbol y}}{\lambda+n}
:
\lambda_n = \lambda + n
:
\nu_n = \nu + n
:
\boldsymbol\Psi_n = \boldsymbol{\Psi + S} +\frac{\lambda n}{\lambda+n}
(\boldsymbol{\bar{y}-\mu_0})(\boldsymbol{\bar{y}-\mu_0})^T
~~~\mathrm{ with }~~\boldsymbol{S}= \sum_{i=1}^{n} (\boldsymbol{y_i-\bar{y}})(\boldsymbol{y_i-\bar{y}})^T
.
To sample from the joint posterior of , one simply draws samples from , then draw . To draw from the posterior predictive of a new observation, draw , given the already drawn values of and .Gelman, Andrew, et al. Bayesian data analysis. Vol. 2, p.73. Boca Raton, FL, USA: Chapman & Hall/CRC, 2014.
Generating normal-inverse-Wishart random variates
Generation of random variates is straightforward:
- Sample from an inverse Wishart distribution with parameters and
- Sample from a multivariate normal distribution with mean and variance
Related distributions
- The normal-Wishart distribution is essentially the same distribution parameterized by precision rather than variance. If then .
- The normal-inverse-gamma distribution is the one-dimensional equivalent.
- The multivariate normal distribution and inverse Wishart distribution are the component distributions out of which this distribution is made.
Notes
{{reflist}}
References
- Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media.
- Murphy, Kevin P. (2007). "Conjugate Bayesian analysis of the Gaussian distribution." [http://www.cs.ubc.ca/~murphyk/Papers/bayesGauss.pdf]
{{ProbDistributions|multivariate}}
Category:Multivariate continuous distributions