Iterative rational Krylov algorithm

The iterative rational Krylov algorithm (IRKA), is an iterative algorithm, useful for model order reduction (MOR) of single-input single-output (SISO) linear time-invariant dynamical systems.{{cite web |title=Iterative Rational Krylov Algorithm |url=https://morwiki.mpi-magdeburg.mpg.de/morwiki/index.php/Iterative_Rational_Krylov_Algorithm |website=MOR Wiki |access-date=3 June 2021}} At each iteration, IRKA does an Hermite type interpolation of the original system transfer function. Each interpolation requires solving r shifted pairs of linear systems, each of size n \times n; where n is the original system order, and r is the desired reduced model order (usually r \ll n).

The algorithm was first introduced by Gugercin, Antoulas and Beattie in 2008.{{Citation|title=H_{2} Model Reduction for Large-Scale Linear Dynamical Systems|first1=S.|last1=Gugercin|author2-link=Athanasios C. Antoulas|first2=A.C. |last2=Antoulas|first3=C.|last3=Beattie|volume=30|pages=609–638|year=2008|series=Journal on Matrix Analysis and Applications|issue=2|publisher=SIAM}} It is based on a first order necessary optimality condition, initially investigated by Meier and Luenberger in 1967.{{Citation|title=Approximation of linear constant systems|last1=L. Meier|volume=12|pages=585–588|year=1967|last2=D.G. Luenberger|series=IEEE Transactions on Automatic Control|issue=5}} The first convergence proof of IRKA was given by Flagg, Beattie and Gugercin in 2012,{{Citation|title=Convergence of the Iterative Rational Krylov Algorithm|last1=G. Flagg|volume=61|pages=688–691|year=2012|last2=C. Beattie|last3=S. Gugercin|series=Systems & Control Letters|issue=6}} for a particular kind of systems.

MOR as an optimization problem

Consider a SISO linear time-invariant dynamical system, with input v(t), and output y(t):

: \begin{cases}

\dot{x}(t) = A x(t) + b v(t)\\

y(t) = c^T x(t)

\end{cases} \qquad A \in \mathbb{R}^{n \times n}, \, b,c \in \mathbb{R}^n, \, v(t),y(t) \in \mathbb{R}, \, x(t) \in \mathbb{R}^n.

Applying the Laplace transform, with zero initial conditions, we obtain the transfer function G, which is a fraction of polynomials:

:G(s)=c^T (sI-A)^{-1} b, \quad A \in \mathbb{R}^{n \times n}, \, b,c \in \mathbb{R}^n.

Assume G is stable. Given r < n, MOR tries to approximate the transfer function G, by a stable rational transfer function G_r, of order r:

: G_r(s) = c_r^T (sI_r-A_r)^{-1} b_r, \quad A_r \in \mathbb{R}^{r \times r}, \, b_r, c_r \in \mathbb{R}^r.

A possible approximation criterion is to minimize the absolute error in H_{2} norm:

:G_{r} \in \underset{ \dim(\hat{G})=r, \, \hat{G} \text{ stable}} {\operatorname{arg \min}} \|G-\hat{G}\|_{H_2}, \quad \|G\|_{H_2}^2:= \frac{1}{2 \pi} \int \limits_{-\infty}^\infty |G(ja)|^2 \, da .

This is known as the H_{2} optimization problem. This problem has been studied extensively, and it is known to be non-convex; which implies that usually it will be difficult to find a global minimizer.

Meier–Luenberger conditions

The following first order necessary optimality condition for the H_{2} problem, is of great importance for the IRKA algorithm.

{{math theorem

| Assume that the H_{2} optimization problem admits a solution G_{r} with simple poles. Denote these poles by: \lambda_{1}(A_{r}), \ldots, \lambda_{r}(A_{r}). Then, G_{r} must be an Hermite interpolator of G, through the reflected poles of G_{r}:

:G_{r}(\sigma_{i}) = G(\sigma_{i}), \quad G_{r}^{\prime}(\sigma_{i}) = G^{\prime}(\sigma_{i}), \quad \sigma_{i} = - \lambda_{i}(A_{r}), \quad \forall \, i=1,\ldots,r .

| note = [Theorem 3.4] [Theorem 1.2]

}}

Note that the poles \lambda_i(A_r) are the eigenvalues of the reduced r \times r matrix A_r.

Hermite interpolation

An Hermite interpolant G_r of the rational function G, through r distinct points \sigma_1, \ldots, \sigma_r \in \mathbb{C}, has components:

: A_r = W_r^* A V_r, \quad b_r = W_r^* b, \quad c_{r}=V_r^* c, \quad A_r \in \mathbb{R}^{r \times r}, \, b_r \in \mathbb{R}^r, \, c_r \in \mathbb{R}^r;

where the matrices V_r = ( v_1 \mid \ldots \mid v_r ) \in \mathbb{C}^{n \times r} and W_r = ( w_1 \mid \ldots \mid w_r ) \in \mathbb{C}^{n \times r} may be found by solving r dual pairs of linear systems, one for each shift [Theorem 1.1]:

:(\sigma_i I-A) v_i=b, \quad (\sigma_i I-A)^* w_i=c, \quad \forall \, i=1,\ldots,r .

IRKA algorithm

As can be seen from the previous section, finding an Hermite interpolator G_r of G, through r given points, is relatively easy. The difficult part is to find the correct interpolation points. IRKA tries to iteratively approximate these "optimal" interpolation points.

For this, it starts with r arbitrary interpolation points (closed under conjugation), and then, at each iteration m, it imposes the first order necessary optimality condition of the H_2 problem:

1. find the Hermite interpolant G_r of G, through the actual r shift points: \sigma_1^m,\ldots,\sigma_r^m.

2. update the shifts by using the poles of the new G_r: \sigma_i^{m+1} = -\lambda_i(A_r), \, \forall \, i=1,\ldots,r .

The iteration is stopped when the relative change in the set of shifts of two successive iterations is less than a given tolerance. This condition may be stated as:

:\frac{ |\sigma_i^{m+1}-\sigma_i^m| }

\sigma_i^m
< \text{tol}, \, \forall \, i=1,\ldots,r .

As already mentioned, each Hermite interpolation requires solving r shifted pairs of linear systems, each of size n \times n:

: (\sigma_i^m I-A) v_{i} = b, \quad (\sigma_i^m I-A)^* w_i = c, \quad \forall \, i=1,\ldots,r .

Also, updating the shifts requires finding the r poles of the new interpolant G_r. That is, finding the r eigenvalues of the reduced r \times r matrix A_r.

Pseudocode

The following is a pseudocode for the IRKA algorithm [Algorithm 4.1].

algorithm IRKA

input: A,b,c, \text{tol}>0, \sigma_1,\ldots,\sigma_r closed under conjugation

(\sigma_i I-A)v_i=b, \, \forall \, i=1,\ldots,r % Solve primal systems

(\sigma_i I-A)^* w_i=c, \, \forall \, i=1,\ldots,r % Solve dual systems

while relative change in {\sigma_{i}} > tol

A_{r} = W_r^* AV_r % Reduced order matrix

\sigma_i = -\lambda_i(A_r), \, \forall \, i=1,\ldots,r % Update shifts, using poles of G_{r}

(\sigma_i I-A)v_i=b, \, \forall \, i=1,\ldots,r % Solve primal systems

(\sigma_i I-A)^{*}w_{i}=c, \, \forall \, i=1,\ldots,r % Solve dual systems

end while

return A_r=W_r^* AV_r, \, b_r=W_r^{*}b, \, c_r^T=c^T V_r % Reduced order model

Convergence

A SISO linear system is said to have symmetric state space (SSS), whenever: A=A^{T}, \, b=c . This type of systems appear in many important applications, such as in the analysis of RC circuits and in inverse problems involving 3D Maxwell's equations. For SSS systems with distinct poles, the following convergence result has been proven: "IRKA is a locally convergent fixed point iteration to a local minimizer of the H_{2} optimization problem."

Although there is no convergence proof for the general case, numerous experiments have shown that IRKA often converges rapidly for different kind of linear dynamical systems.

Extensions

IRKA algorithm has been extended by the original authors to multiple-input multiple-output (MIMO) systems, and also to discrete time and differential algebraic systems [Remark 4.1].

See also

References

{{Reflist}}