Separation principle

In control theory, a separation principle, more formally known as a principle of separation of estimation and control, states that under some assumptions the problem of designing an optimal feedback controller for a stochastic system can be solved by designing an optimal observer for the state of the system, which feeds into an optimal deterministic controller for the system. Thus the problem can be broken into two separate parts, which facilitates the design.

The first instance of such a principle is in the setting of deterministic linear systems, namely that if a stable observer and a stable state feedback are designed for a linear time-invariant system (LTI system hereafter), then the combined observer and feedback is stable. The separation principle does not hold in general for nonlinear systems.

Another instance of the separation principle arises in the setting of linear stochastic systems, namely that state estimation (possibly nonlinear) together with an optimal state feedback controller designed to minimize a quadratic cost, is optimal for the stochastic control problem with output measurements. When process and observation noise are Gaussian, the optimal solution separates into a Kalman filter and a linear-quadratic regulator. This is known as linear-quadratic-Gaussian control. More generally, under suitable conditions and when the noise is a martingale (with possible jumps), again a separation principle applies and is known as the separation principle in stochastic control.{{cite book |author=Karl Johan Astrom |title=Introduction to Stochastic Control Theory |publisher=Academic Press |volume=58 |year=1970 |isbn=0-486-44531-3}}{{cite journal |author=Tyrone Duncan and Pravin Varaiya |title=On the solutions of a stochastic control system |journal=SIAM J. Control |volume=9 |issue=3 |pages=354–371 |year=1971|doi=10.1137/0309026 |hdl=1808/16692 |hdl-access=free }}{{cite journal |author=M.H.A. Davis and P. Varaiya |title=Information states for stochastic systems |journal=J. Math. Anal. Applications |volume=37 |pages=384–402 |year=1972|doi=10.1016/0022-247X(72)90281-8 |doi-access=free }}{{cite journal |author=Anders Lindquist|title=On Feedback Control of Linear Stochastic Systems |journal=SIAM Journal on Control |volume=11 |pages=323–343 |year=1973|issue=2 |doi=10.1137/0311025 }}{{cite book |author=A. Bensoussan |title=Stochastic Control of Partially Observable Systems |publisher=Cambridge University Press |year=1992}}{{cite journal |author=Tryphon T. Georgiou and Anders Lindquist |title=The Separation Principle in Stochastic Control, Redux |journal=IEEE Transactions on Automatic Control |volume=58 |issue=10 |pages=2481–2494 |year=2013 |doi=10.1109/TAC.2013.2259207|arxiv=1103.3005 |s2cid=12623187 }}

The separation principle also holds for high gain observers used for state estimation of a class of nonlinear systems{{Cite book|last1=Atassi|first1=A.N.|last2=Khalil|first2=H.K.|title=Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171) |chapter=A separation principle for the control of a class of nonlinear systems |chapter-url=http://dx.doi.org/10.1109/cdc.1998.760800|year=1998|volume=1|pages=855–860|publisher=IEEE|doi=10.1109/cdc.1998.760800|isbn=0-7803-4394-8|s2cid=126270534}} and control of quantum systems.

Proof of separation principle for deterministic LTI systems

Consider a deterministic LTI system:

: $\begin{align}
\dot{x}(t) & = A x(t) + B u(t) \\
y(t) & = C x(t)
\end{align}$

where

: $u(t)$ represents the input signal,

: $y(t)$ represents the output signal, and

: $x(t)$ represents the internal state of the system.

We can design an observer of the form

: $\dot{\hat{x}} = ( A - L C ) \hat{x} + B u + L y \,$

and state feedback

: $u(t) = - K \hat{x} \, .$

Define the error e:

: $e = x - \hat{x} \, .$

Then

: $\dot{e} = (A - L C) e \,$

: $u(t) = - K ( x - e ) \, .$

Now we can write the closed-loop dynamics as

: $\begin{bmatrix}
\dot{x} \\
\dot{e} \\
\end{bmatrix} =
\begin{bmatrix}
A - B K & BK \\
0 & A - L C \\
\end{bmatrix}
\begin{bmatrix}
x \\
e \\
\end{bmatrix}.$

Since this is a triangular matrix, the eigenvalues are just those of A − BK together with those of A − LC.Proof can be found in this math.stackexchange [https://math.stackexchange.com/q/21454]. Thus the stability of the observer and feedback are independent.

References

Brezinski, Claude. Computational Aspects of Linear Control (Numerical Methods and Algorithms). Springer, 2002.

Category:Control theory

Category:Stochastic control