Uniformly most powerful test
{{Short description|Theoretically optimal hypothesis test}}
In statistical hypothesis testing, a uniformly most powerful (UMP) test is a hypothesis test which has the greatest power among all possible tests of a given size α. For example, according to the Neyman–Pearson lemma, the likelihood-ratio test is UMP for testing simple (point) hypotheses.
Setting
Let denote a random vector (corresponding to the measurements), taken from a parametrized family of probability density functions or probability mass functions , which depends on the unknown deterministic parameter . The parameter space is partitioned into two disjoint sets and . Let denote the hypothesis that , and let denote the hypothesis that .
The binary test of hypotheses is performed using a test function with a reject region (a subset of measurement space).
:
\begin{cases}
1 & \text{if } x \in R \\
0 & \text{if } x \in R^c
\end{cases}
meaning that is in force if the measurement and that is in force if the measurement .
Note that is a disjoint covering of the measurement space.
Formal definition
A test function is UMP of size if for any other test function satisfying
:
we have
:
The Karlin–Rubin theorem
The Karlin–Rubin theorem can be regarded as an extension of the Neyman–Pearson lemma for composite hypotheses.Casella, G.; Berger, R.L. (2008), Statistical Inference, Brooks/Cole. {{ISBN|0-495-39187-5}} (Theorem 8.3.17) Consider a scalar measurement having a probability density function parameterized by a scalar parameter θ, and define the likelihood ratio .
If is monotone non-decreasing, in , for any pair (meaning that the greater is, the more likely is), then the threshold test:
:
\begin{cases}
1 & \text{if } x > x_0 \\
0 & \text{if } x < x_0
\end{cases}
:where is chosen such that
is the UMP test of size α for testing
Note that exactly the same test is also UMP for testing
Important case: exponential family
Although the Karlin-Rubin theorem may seem weak because of its restriction to scalar parameter and scalar measurement, it turns out that there exist a host of problems for which the theorem holds. In particular, the one-dimensional exponential family of probability density functions or probability mass functions with
:
has a monotone non-decreasing likelihood ratio in the sufficient statistic , provided that is non-decreasing.
Example
Let denote i.i.d. normally distributed -dimensional random vectors with mean and covariance matrix . We then have
:
f_\theta (X) = {} & (2 \pi)^{-MN/2} |R|^{-M/2} \exp \left\{-\frac 1 2 \sum_{n=0}^{M-1} (X_n - \theta m)^T R^{-1}(X_n - \theta m) \right\} \\[4pt]
= {} & (2 \pi)^{-MN/2} |R|^{-M/2} \exp \left\{-\frac 1 2 \sum_{n=0}^{M-1} \left (\theta^2 m^T R^{-1} m \right ) \right\} \\[4pt]
& \exp \left\{-\frac 1 2 \sum_{n=0}^{M-1} X_n^T R^{-1} X_n \right\} \exp \left\{\theta m^T R^{-1} \sum_{n=0}^{M-1}X_n \right\}
\end{align}
which is exactly in the form of the exponential family shown in the previous section, with the sufficient statistic being
:
Thus, we conclude that the test
:
is the UMP test of size for testing vs.
Further discussion
In general, UMP tests do not exist for vector parameters or for two-sided tests (a test in which one hypothesis lies on both sides of the alternative). The reason is that in these situations, the most powerful test of a given size for one possible value of the parameter (e.g. for where ) is different from the most powerful test of the same size for a different value of the parameter (e.g. for where ). As a result, no test is uniformly most powerful in these situations.
{{More footnotes|date=November 2010}}
References
{{reflist}}
Further reading
- {{cite book|last1=Ferguson|first1=T. S.|author-link= Thomas S. Ferguson |title=Mathematical Statistics: A decision theoretic approach|publisher=Academic Press|place=New York|section=Sec. 5.2: Uniformly most powerful tests|year=1967}}
- {{cite book|last1=Mood|first1=A. M.|last2=Graybill|first2=F. A.|last3=Boes|first3=D. C.|title=Introduction to the theory of statistics|edition=3rd|publisher=McGraw-Hill|place=New York|section=Sec. IX.3.2: Uniformly most powerful tests|year=1974}}
- L. L. Scharf, Statistical Signal Processing, Addison-Wesley, 1991, section 4.7.
{{Statistics|inference}}
{{DEFAULTSORT:Uniformly Most Powerful Test}}