CUSUM

{{Infobox control chart

| name = CUSUM chart

| proposer = E. S. Page

| subgroupsize = n = 1

| measurementtype = Cumulative sum of a quality characteristic

| qualitycharacteristictype = Variables data

| distribution = Normal distribution

| sizeofshift = ≤ 1.5σ

| arl =

| varchart =

| varcenter =

| varupperlimit =

| varlowerlimit =

| varstatistic =

| meanchart =

| meancenter = The target value, T, of the quality characteristic

| meanupperlimit = $C_i^+ = \max \lbrack 0, x_i - \left ( T + K \right ) + C_{i - 1}^+\rbrack$

| meanlowerlimit = $C_i^- = \max \lbrack 0, \left ( T - K \right ) - x_i + C_{i - 1}^-\rbrack$

| meanstatistic = $C_i = \sum_{j=1}^i \bar x_j - T$

}}

In statistical quality control, the CUSUM (or cumulative sum control chart) is a sequential analysis technique developed by E. S. Page of the University of Cambridge. It is typically used for monitoring change detection.{{cite journal

|title=The Use of Risk-Adjusted CUSUM and RSPRT Charts for Monitoring in Medical Contexts

|author=Grigg

|journal=Statistical Methods in Medical Research

|volume=12

|pages=147–170

|year=2003

|doi=10.1177/096228020301200205

|pmid=12665208

|last2=Farewell

|first2=VT

|last3=Spiegelhalter

|first3=DJ

|issue=2

|display-authors=etal}}

CUSUM was announced in Biometrika, in 1954, a few years after the publication of Wald's sequential probability ratio test (SPRT).{{cite journal

|first=E. S.

|last=Page

|title=Continuous Inspection Scheme

|journal=Biometrika

|volume=41

|issue=1/2

|date=June 1954

|pages=100–115

|jstor=2333009

|doi=10.1093/biomet/41.1-2.100|hdl=10338.dmlcz/135207

|hdl-access=free

}}

E. S. Page referred to a "quality number" $\theta$ , by which he meant a parameter of the probability distribution; for example, the mean. He devised CUSUM as a method to determine changes in it, and proposed a criterion for deciding when to take corrective action. When the CUSUM method is applied to changes in mean, it can be used for step detection of a time series.

A few years later, George Alfred Barnard developed a visualization method, the V-mask chart, to detect both increases and decreases in $\theta$ .{{cite journal

|first=G.A.

|last=Barnard

|authorlink=George Alfred Barnard

|title=Control charts and stochastic processes

|journal=Journal of the Royal Statistical Society

|volume=B (Methodological)

|issue=21, number 2

|year=1959

|pages=239–71

|jstor=2983801

}}

Method

As its name implies, CUSUM involves the calculation of a cumulative sum (which is what makes it "sequential"). Samples from a process $x_n$ are assigned weights $\omega_n$ , and summed as follows:

: $S_0=0$

: $S_{n+1}=\max(0, S_n+x_{n+1}-\omega_n)$

When the value of S exceeds a certain threshold value, a change in value has been found. The above formula only detects changes in the positive direction. When negative changes need to be found as well, the min operation should be used instead of the max operation, and this time a change has been found when the value of S is below the (negative) value of the threshold value.

Page did not explicitly say that $\omega$ represents the likelihood function, but this is common usage.

This differs from SPRT by always using zero function as the lower "holding barrier" rather than an actual lower "holding barrier". Also, CUSUM does not require the use of the likelihood function.

As a means of assessing CUSUM's performance, Page defined the average run length (A.R.L.) metric; "the expected number of articles sampled before action is taken." He further wrote:

When the quality of the output is satisfactory the A.R.L. is a measure of the expense incurred by the scheme when it gives false alarms, i.e., Type I errors (Neyman & Pearson, 1936{{cite journal

|title=Sufficient statistics and uniformly most powerful tests of statistical hypotheses

|journal=Statistical Research Memoirs

|volume=I

|pages=113–137

}}). On the other hand, for constant poor quality the A.R.L. measures the delay and thus the amount of scrap produced before the rectifying action is taken, i.e., Type II errors.

Example

The following example shows 20 observations $X$ of a process with a mean of 0 and a standard deviation of 0.5.

From the $Z$ column, it can be seen that $X$ never deviates by 3 standard deviations ( $3 \sigma$ ), so simply alerting on a high deviation will not detect a failure, whereas CUSUM shows that the $S_H$ value exceeds 4 at the 17th observation.

Column	Description
$X$	The observations of the process with an expected mean $\bar{x}$ of 0 and an expected standard deviation $\sigma_X$ of 0.5
$Z$	The normalized observations, i.e. centered around the mean and scaled by the standard deviation $Z_n = \frac{X_n - \bar{x}}{\sigma_X}$
$S_H$	The high CUSUM value, detecting a positive anomaly, ${S_H}_{n+1} = \max(0, {S_H}_n + Z_{n+1} - \omega)$
$S_L$	The low CUSUM value, detecting a negative anomaly, ${S_L}_{n+1} = \max(0, {S_L}_n - Z_{n+1} - \omega)$

where $\omega$ is a critical level parameter (tunable, same as threshold T) that's used to adjust the sensitivity of change detection: larger $\omega$ makes CUSUM less sensitive to the change and vice versa.

Comments

Performance

Variants

Cumulative observed-minus-expected plots are a related method.

References

External links

[http://www.itl.nist.gov/div898/handbook/pmc/section3/pmc323.htm "Engineering Statistics Handbook - Cusum Control Charts" ]

Category:Statistical charts and diagrams

Category:Quality control tools

Category:Sequential methods