ETNA Electronic Transactions on Numerical Analysis. Volume 30, pp. 187-202, 2008. Copyright 2008, Kent State University. ISSN 1068-9613. Kent State University http://etna.math.kent.edu LOW-RANK ITERATIVE METHODS FOR PROJECTED GENERALIZED LYAPUNOV EQUATIONS TATJANA STYKEL Abstract. We generalize an alternating direction implicit method and the Smith method for large-scale projected generalized Lyapunov equations. Such equations arise in model reduction of descriptor systems. Low-rank versions of these methods are also presented, which can be used to compute low-rank approximations to the solution of projected generalized Lyapunov equations with low-rank symmetric, positive semidefinite right-hand side. Numerical examples are presented. Key words. projected generalized Lyapunov equations, alternating direction implicit method, Smith method, low-rank approximation AMS subject classifications. 65F10, 65F30, 15A22, 15A24, 1. Introduction. Consider a linear time-invariant descriptor system ! where , #"$&%(' % , )"*$%(' + , #"*$,-' % , ."/$% is the state vector, 0"*$&+ is the input and "1$, is the output. The matrix may be singular, but the pencil 2 4control 2 3 is assumed 9 3 :<;=> . Descriptor systems arise in many to be regular, i.e., 5768 (1.1) different applications including electrical circuit simulation, multibody dynamics and spatial discretization of partial differential equations, e.g., [6, 7, 8, 40]. Stability analysis and some control problems for (1.1) are strongly related to the projected generalized continuous-time algebraic Lyapunov equations (GCALEs) ? A @B C? @C 3 DE FG@ D E @ @ ? @ C? 3D H @ @ D H (1.2) (1.3) ? ? DH?CD H @ D E @ ?CD E and the projected generalized discrete-time algebraic Lyapunov equations (GDALEs) (1.4) (1.5) 2 L3 where DE DH ? @ 13 <?C @ *I E G @ I E@ @ ? 3K @ ?C *I H@ @ <I H ? JI H ? I H@ ? JI E@ ? I E I E NM 3/D E I H OM 3/D H and are the spectral projectors onto the left and right deflating subspaces of corresponding to the finite eigenvalues, and are the spectral projectors onto the left and right deflating subspaces corresponding to the eigenvalue at infinity. Let be in Weierstrass canonical form 2 3 > T > QPSR MU) (1.6) > VJW X and J*PSRZY> M %([ T W\X P and X are nonsingular, Y corresponds to the finite eigenvalues of where 2 ]3 theandmatrices V being nilpotent corresponds to the eigenvalue at infinity. The index ^ of _ Received March 10, 2006. Accepted for publication March 21, 2008. Published online on August 4, 2008. Recommended by P. Van Dooren. This work was supported by the DFG Research Center M ATHEON ”Mathematics for key technologies” in Berlin. Institut für Mathematik, MA 4-5, Technische Universität Berlin, Straße des 17. Juni 136, 10623 Berlin, Germany (stykel@math.tu-berlin.de). 187 ETNA Kent State University http://etna.math.kent.edu 188 T. STYKEL V nilpotency of is the index of the pencil be represented as 2 3 T > D E JP R M ) > > W P [a` (1.7) . Using (1.6) the projectors and 2 d3 D E and D H can T > D H X [` R M b > > W Xc It has been shown in [34] that if the pencil is stable, i.e., all its finite eigenvalues have negative real part, then the projected GCALEs (1.2) and (1.3) have the unique symmetric, positive semidefinite solutions which define the proper controllability and observability Gramians of the descriptor system (1.1). Furthermore, the projected GDALEs (1.4) and (1.5) have the unique symmetric, positive semidefinite solutions which are the improper controllability and observability Gramians of (1.1). The Gramians play a central role in analysis and control design problems for descriptor systems, such as the characterization of controllability and observability properties, computing or Hankel norm, minimal and balanced realizations as well as balanced truncation model order reduction [4, 34, 35, 36]. has been the topic The numerical solution of standard Lyapunov equations with of numerous publications [2, 3, 12, 13, 14, 17, 18, 22, 24, 29]. A variety of direct and iterative methods has been proposed there for computing the solutions of such equations, their Cholesky factors or low-rank approximations. The case of nonsingular has been considered in [5, 9, 15, 16, 21]. Until now, only direct methods have been extended to projected Lyapunov equations [33]. The solutions of (1.2) – (1.5) can be computed by the generalized Schur-Bartels-Stewart or the generalized Schur-Hammarling methods that are based on the preliminary reduction of the pencil to the generalized Schur form, solution of the generalized Sylvester and Lyapunov equations and back transformation. Since these methods cost operations and require memory location, they can be used only for problems of small or medium size. Due to the practical importance of the numerical solution of large-scale projected generalized Lyapunov equations that occur in balanced truncation model reduction of descriptor systems [35], the development of iterative methods for such equations is a challenging problem. In this paper we generalize the alternating direction implicit (ADI) method [17, 18, 22] and the Smith method [22, 29] to the projected generalized Lyapunov equations (1.2) and (1.4) with large sparse matrix coefficients. The dual equations (1.3) and (1.5) can be solved in a similar way. Low-rank versions of the ADI and Smith methods are also presented, which can be used to compute low-rank approximations to the solutions of (1.2) and (1.4) with a low-rank right-hand side. Such a problem arises, for example, in model reduction. Note that the number of columns of the matrix in (1.2) and (1.4) relates to the number of inputs of the underlying descriptor system (1.1) and it is usually small compared to the state space dimension of the problem. A major difficulty in the numerical solution of projected Lyapunov equations is that we need to compute the spectral projectors , or , . Fortunately, in many applications such as computational fluid dynamics and constrained structural mechanics, the matrices and have some special block structure. As the following examples show, this structure can be used to construct the projectors and in explicit form. E XAMPLE 1.1. Consider the descriptor system (1.1) with egf 4M 2 h3 i j f i jakl m j DE DH IE IH (1.8) DE `o` DH R > `n` > ` f W Q R n` ` ` f W f ` of f where is nonsingular. Such a system arises, for example, after linearization and spatial discretization of the Euler equation that describes the flow of a fluid through a supersonic ETNA Kent State University http://etna.math.kent.edu ITERATIVE METHODS FOR PROJECTED LYAPUNOV EQUATIONS 189 f ` D `na[ `E ` ` f D 3 H fnf is nonsingular, then the pencil 2 p3 in (1.8) and are given by D E )R M> q ` f 3 `n` `n[a` ` ` f > f ` `n[a` ` ` f 3 fnf ![` W D H R M 3 `n[a ` ` ` f q[ ` f ` `n[a3` ` ` f 3 [ ` fnf r[a`! f ` 3 Ms`n[a` g` q ` f f [a` ` `o[` ` 3 ` f 3 [afnf ` ![ `! fof W c f ` `n` ` f fof f ` f ` `n` ` f fof fnf E 1.2. Consider the descriptor system (1.1), where and have the form R > `n` >> W Q R `n` > ` f W c (1.9) f` Such systems arise in spatial discretization of the instationary incompressible Stokes equation [` [6, 7] and the convection equation [19]. If `n` and f ` `n` ` f are nonsingular, then the 2 t 3 in (1.9) is of index 2. In this case the spectral projectors D E and D H have the pencil following form D E uRwv > E 3 v E `o` `n[` ` ` f > q f ` `n[a` ` ` f r[a` W D H 4R 3 q [` r[v aH` [` H >> W f ` `o` ` f f ` `o` `n` v E )M 3 ` f f ` `n[` ` ` f ![` f ` `n[a` ` is a projector onto the kernel of f ` `n[` ` where v H /M 31 `n[a` ` ` f f ` `n[` ` ` f r[`x f ` `n[` ` v E `o` . along the image of ` f and v diffuser [40]. If the matrix is of index 1, and the projectors XAMPLE E XAMPLE 1.3. The motion of multibody systems with holonomic constraints can be described by nonlinear differential-algebraic equations of the first order [8, 28]. Linearization of these equations around an equilibrium state leads to the descriptor system (1.1) with > M > > M 3 > } > | > zy { Q }y { (1.10) > >}@ ~ > > >~ where | is a nonsingular mass matrix, is a stiffness matrix, 2 is a damping matrix and 4 3 is of index , and is a matrix of constraints. If H has full row rank, then the pencil E D D the spectral projectors and can be computed as 3 B D E y{ 3 v @ v M 3 v v > @ 3 v @ v | [v ` | [a` ` ` ~ > > > > > 3 v | [` v M 3 v D H y{ > c @` v 31 v | [a` qM 3 v @` v v >~ | [` @ | [a` @ r[a` and v QM 3 | [` @ | [a` @ r[` *M 31 ` is Here ` a projector onto the kernel of along the image of | [a` E @ ,H see E [28] forH details. D D In the following we will assume that the projectors , , I and I are given. Clearly, we do not compute these projectors explicitly. Instead, we use matrix-vector multiplication and linear system solvers if an inverse is required. In Section 2, we present a generalization of the ADI method and its low-rank version for the projected GCALE (1.2). In Section 3, ETNA Kent State University http://etna.math.kent.edu 190 T. STYKEL we discuss the numerical solution of the projected Lyapunov equations (1.2) and (1.4) via the (cyclic) Smith method. Section 4 contains some results of numerical experiments. Throughout the paper the open left half-plane is denoted by . We will denote by and the spaces of real and complex matrices, respectively. The real part of a complex number is denoted by Re . The matrix stands for the transpose of , denotes the complex conjugate and transpose of , and . An identity matrix of order is denoted by or simply . We will denote by the spectral matrix norm and by the Frobenius matrix norm of . $%(' + %7' + " $ %(' + <[ @ 9q<[a`U @ f jJ m j [ @ M% " M %(' + " %(' + 2. Alternating direction implicit method. The ADI method was originally proposed for linear systems [20] and then used in [17, 18, 22, 38] to solve standard continuous-time Lyapunov equations. The case of nonsingular has been considered in [16]. In this section, we present a generalization of the ADI method for the projected GCALE (1.2). Assume that the pencil is stable. Then the matrix is nonsingular, and the projected GCALE (1.2) is equivalent to the projected standard Lyapunov equation 2 3 (2.1) [a` ? ? q [a` @ 3DH [` GF@ [ @ D H @ ? DH?CD H @ c In this case an approximate solution of (1.2) can be computed by the ADI method applied to (2.1). The ADI iteration is given by ?. tq<[a` p M(r[a`- [` 3 M ?. [` <[a` 3 M( @ [` M(r[ @ (2.2) 3A Re [` M [` DH [` F @ [ @ D H @ [` M( [ @ ?. > and the shift parameters ` cUcc o " [ . It follows from with an initial matrix D H <[a` 3 M(9q<[` 3 M D H D H [` M![`t <[a` p M(r[a` D H ? D H ?-D H @ , i.e., the second equation in (1.2) and (2.1) is satisfied exactly. The that iteration (2.2) can also be written as (2.3) ? h :![`Z 3 : ? [` 3 : @ 3A Re p r[a` D E F @ D E @ 2 t3 ? :r[ @ :![ @ c ` cUcc " [ The following propositions give the convergence results for the ADI iteration. P ROPOSITION 2.1. If the pencil is stable and , then the ADI iteration (2.3) converges to the solution of the projected GCALE (1.2). Proof. Let be a solution of the projected GCALE (1.2). The error matrix can be computed from (2.3) recursively as ? (2.4) where (2.5) ?L3? ?3?B Q ? D H p [` 3 cUcc ` : [` 3 ` c Using the Weierstrass canonical form (1.6) and representation (1.7), we find that (2.6) X [` R Y > >> W X ETNA Kent State University http://etna.math.kent.edu ITERATIVE METHODS FOR PROJECTED LYAPUNOV EQUATIONS 191 2 h3 and 3 ` (Y Y . ,If t¡ Uc cc nis¢ ,stable lie inside ? the unit circle, and, hence, converges toward the solution . 2 £3 P 2.2. Consider the projected GCALE (1.2). Assume that the pencil ? ¢ is in Weierstrass canonical form (1.6), where Y is diagonal. Then the -th iterate of the N MG Y r[`Z M 3 Y cUcc MG ` Y r[a`-qM 3 Y ` cUcc " [ , then all the?B eigenvalues of M Y r[`Z M with ROPOSITION ADI method satisfies the estimate ?¤3K? Uf:¥/¦ f X ¨§ f ? f where ¦ X © X f X [a` f is the spectral condition number of the right transformation is the spectral radius of the matrix given in (2.5). matrix X in (1.6) and §ª (2.7) Proof. Estimate (2.7) immediately follows from (2.4) and (2.6). 2.1. Computing the shift parameters. As Proposition 2.2 shows, the convergence rate of the ADI iteration is determined by the spectral radius of the matrix as in (2.5) and depends strongly on the choice of shift parameters. The minimization of this spectral radius with respect to the parameters leads to the generalized ADI minimax problem ` ccUc o ¡ 3 ` ccUc Ç¡ 3 ( ¬ o ¯ ­ n ® ² ° 0 ± µ ³ ´ ± . ¯ ­ ¾ « ` c cUc (2.8) ¶o· '¹¸¹¸¹¸ ' ¶ º»¯¼-½ ¿ » wÀ Áà' ÄÅ&Æ Ç¡\ ` ccUc Ç¡p Æ Æ 2 p3 . TheÆ computation where SpÈ o: denotes the set of finite eigenvalues of the pencil of the optimal shift parameters is a difficult problem, since the finite eigenvalues of the pencil 2 L3 are, in general, unknown and expensive to compute. This problem is solved for ÉM and symmetric , e.g., [39], while the case of standard Lyapunov equations with complex eigenvalues of is still not completely understood; see [18, 27, 32, 39] for some Sp contributions. To compute the suboptimal ADI shift parameters for the standard problem, a heuristic algorithm has been proposed in [22]. This algorithm is based on Arnoldi iterations [25] applied to the matrices and . It can also be extended to the generalized problem (2.8). Due to the nonsingularity of this problem is equivalent to F[` 3 ` & cUcc 3 ¬ o ¯ ­ n ® & ° 0 ± à ³ ´ ± . ­ ¾ Ì (2.9) « ` Uc cc ¶ · '¹¸¹¸¹¸ ' ¶ º »¯¼ ½ ¿ » Á Ä ½ ·  qÅ ÊrË Æ p ` & cUcc ap Æ Æ denotes the spectrum of the matrix F[Æ ` . Thus, the suboptimal where Sp q[` ADI shift ` cUcc can be determined by the heuristic procedure parameters [22, Algorithm 5.1] from a set of largest and smallest (in modulus) non-zero approximate eigenvalues of [` . A conventional approach for computing largest and smallest eigenvalues of a matrix is to apply an Arnoldi process to this matrix and its inverse, respectively. However, if is singular, then the inverse of F[` does not exist. Observe that the recip rocals of the smallest non-zero eigenvalues of G[` are the largest finite eigenvalues of the 2 4 3 pencil D , where . The latter can be determined by an Arnoldi procedure applied to the matrix T > P [` D D H D H 3 I H [` t D E 3 I E : [` D E X [` R M >) >W and X , P are the transformation as in (1.6), see [30] for details. Similarly to the D H and D E , the matrix D matrices projectors can be obtained in explicit form using the special block structure of the matrices and . Sp ETNA Kent State University http://etna.math.kent.edu 192 T. STYKEL 2.2. Low-rank version of the generalized ADI method. Recently, an efficient modification of the ADI method has been proposed to compute low-rank approximations to the solutions of standard Lyapunov equations with large-scale matrix coefficients [17, 22]. This is the low-rank alternating direction implicit (LR-ADI) method. It was observed that the eigenvalues of the symmetric solutions of Lyapunov equations with low-rank right-hand side generally decay very rapidly, and such solutions may be well approximated by low-rank matrices [1, 10, 23, 31]. A similar result holds for projected generalized Lyapunov equations. In other words, it is possible to find a matrix with a small number of columns such that is an approximate solution of the projected GCALE (1.2). The matrix is referred to as the low-rank Cholesky factor of the solution of (1.2). A low-rank version of the generalized ADI iteration (2.3) can be derived analogously to the standard case [17, 22]. First of all note that the matrix in (2.3) is Hermitian, positive semidefinite, and the Cholesky factor of has the form ? Í Í Í:Í @ ?B Í ? Í Í p r[`Z 3 Í [`sÐ Í hÎUÏ 3A ReE U p :![` D E B hÎ Ñ ¯Òa-D ÓÑ [` Òa-ÔÕ¯Òa [` D E B cUcc ÓÑ ` ÒalÔÕ U Ô f Ò ` D E Ð 3A Re rl , Ò :4 Krwr[` and Ô A Q3 . Taking into account that where Ña: Ï Ò Ò : Ò Ò Ô [` Ô Ô [a` Ô Ò Ô :Q [` Ô Ò for ¢ , 4¡Z ccUc , the matrix Í can be rewritten as (2.10) Í tÎ nÖ [` £Ö [ f Ö [` cUcc £Ö ` Ö f UUnÖ [` Ð 4Ï 3A Re :![` D E ttÏ 3A Re D H p :![`U and where Ö t× Re r o Ø Ò Ô oØ ` 9× Re ! o Ø 1Ù M 3 oØ ` U p : [` :Ú c Re Re ` ` If we reenumerate the shift parameters in reverse order, then we obtain the following algorithm for computing the low-rank Cholesky factor of the solution of (1.2). A LGORITHM 2.1. The generalized LR-ADI method for the projected GCALE. D E "g$%(' % Û"£$%7' + ` ?O cUcà c o rÜÞÝqß " [ . Í Í of (1.2). Í E A 3 D Í Á ` Å 9Ï¢. ` p ` [a` Í ` Í Á ` Å ccUc Á Å t× Re M 3 p U p : [` Ú Á [` Å Í Í (2.11) [a` Re [` <Ù Í 9Î Í [a` Í Á Å Ð c I NPUT: , , , , shift parameters of the solution O UTPUT: A low-rank Cholesky factor 1. Re , . 2. FOR END FOR The ADI iteration can be stopped as soon as a normalized residual norm given by (2.12) á Í Í Í @ D E Í F Í @ D E @ @ D E F @ D E @ ETNA Kent State University http://etna.math.kent.edu ITERATIVE METHODS FOR PROJECTED LYAPUNOV EQUATIONS á Í ¥ Çâ-ã Çâ-ã 193 with a user-defined tolerance or a stagnation of norsatisfies the condition malized residual norms is observed. If the number of shift parameters is smaller than the number of iterations required to attain a prescribed tolerance, then we reuse these parameters in a cyclic manner. Note that computing the normalized residuals even via the efficient methods proposed in [22, 27] can still be quite expensive for large-scale problems. It should also be noted that for ill-conditioned problems, the small residual norm does not imply that the error in the computed solution is also small; see [33]. If converges to the solution of (1.2), then á Í ? Í Í ä å³Ã± æ Í Á Å Í Á Å ä å³µ± æ ?:3? [a` ² > c Therefore, just as in the standard case [17], the stopping criterion in Algorithm 2.1 can also be based on the condition wÍ Á Å :¥ Çâ-ã or Í Á Å wçÍ ¥ Çâ-ã with some matrix norm . R 2.3. The matrices J r[a` in Algorithm 2.1 do not have Eqto be com D è either puted explicitly. Instead, we solve linear systems of the form ¨p by EMARK computing (sparse) LU factorizations and forward/backward substitutions or by using iterative Krylov subspace methods [26]. In the latter case the generalized LR-ADI method has the memory complexity and costs flops, where is the number of outer ADI iterations and is the number of inner linear solver iterations. This method becomes efficient for large-scale sparse Lyapunov equations only if is much smaller than . R EMARK 2.4. In exact arithmetic the matrices satisfy and, hence, the second equation in (1.2) is fulfilled for the low-rank approximation . However, in finite precision arithmetic a drift-off effect may occur. In this case we need to project onto the image of by pre-multiplication with . In order to limit the additional computation cost we can do this, for example, at every second or third iteration step. R EMARK 2.5. If is nonsingular, but ill-conditioned with respect to inversion, then Algorithm 2.1 may provide a better result than the classical LR-ADI method [16, 17, 22] applied to the matrices and . Note that if at least one of the shift parameters is complex, then the low-rank Cholesky factor may be complex although the solution of (1.2) is real. As in the standard case [17, 22], in order to keep the real factors, we can take the complex shift parameters in complex conjugate pairs with and compute the iterates as follows i é ¢ Ä ê²Z¢ ë ì ím j j i q ¢Zì ír¢ Ä ê²ë m j Í DH Í (2.13) DH [` « Ø `¬ Í 9Î Í [`U Ø ` [a` ÁÍ Å Ð ¢ Äê²ë ¢(ì ín¢ Äê²ë m Í D @ H Í Í Í Í ÁÅ ? if Í is real Í 4Î Í [a` Í ` Á Å Ð (2.14) î Í Ø ` 4Î Í Í f Á Å Ð if ï Ø is` complex c Å 4ð Ø r[`U Á Å Á Å tð Ø r[` Á Å Á Í ,Í f Í and Í Á Å is Here Í ` ` ` as in (2.11). One Æ canÆ show that (2.13) and the double step (2.14) provide the real Cholesky factors of the solution of the projected GCALE (1.2). 2 h3 is stable P 2.6. Let , ñ"$&%(' % and u"$%(' + . Assume that ¬ . and that the complex shift parameters appear in complex conjugate pairs « Ø ` Then the sequence of the matrices Í in (2.13), (2.14) is real. Furthermore, for complex , we have Í Ø ` Í @ Ø ` tÎ Í [` Í Á Å Í Á Ø ` Å Ð Î Í [a` Í Á Å Í Á Ø `ÇÅ Ð c and ROPOSITION ETNA Kent State University http://etna.math.kent.edu 194 T. STYKEL ¢ ` Proof. We prove the first part of this proposition by induction on . If is real, then Í ` Í Á ` Å ð 3A ` p ` [a` D E ` , then we have is real. If ` is complex and f Á ` Å Ï 3 Re ` ` ` :![`F ` :![` D E B Í Í ` ` (2.15) Í&f hÎ Í ` Ï 3 Re ` Æ Æ ` :![` p ` :r[a` D E Ð c 3 [a` and : [` [` Next we show that the matrices Þ: [` F 2 t is stable, the matrix is nonsingular. are real for any complex . Since the pencil Therefore, ò:r[a`G p:![` Ù Ç<[a`¯ ò: Ú [` h [` Re Þ f r[` c Æ Æ Clearly, the inverse of a real matrix is also real. Furthermore, we have that [` [` * [` Þ: [` F p: [` is real. Thus, Í ` and Í f in (2.15) are both real. Assume now that the matrix Í is real, where ¢ is the index of a real parameter or of the [a` ¬ . Then the matrix second element in a complex conjugate pair « [` Í ó Á Åh 3 : Í Á Å hÏ 3A Re 3 p r[a`\ ccUc 3 ` p ` :![` D E is real. For real Ø ` , we obtain that Ø Í Á Ø `ÇÅ × ReRe ` p Ø ` : [` Í ó Á Å Ø is real, and, hence, Í Ø ` 4Î Í Í Á ` Å Ð is also real. If Ø ` is complex, then ÁÍ ` Ø ` Å Ø ` × Re Ø ` Ø ` : [` F Ø ` [a` Í ó Á Å Á Ø ` Å ×Æ ReÆ Ø Re` Ø : [a` Ø : [a` ó Á Å Íf ` ` Í Re Á Ø ` Å Ø Ð and Í Ø f 4Î Í Ø ` Í f Á Ø `ÇÅ Ð are also real. are real. Therefore, Í Ø ` tÎ Í Í ` 3 Ø ` Ø ` :r[a`! Ú Í Á Å and Further, for complex , we have Í Á ` Ų Ù M Í Ø ` Í @ Ø ` Í [a` Í @ [` Í ` Á Å Ù Í ` Á Å Ú @ Í f Á Å Ù Í f Á Å Ú @ Í [a` Í @ [` Í Á År Í Á Åo 3A Ø ` G Ø ` :![`Ã Í Á Å! Í Á Å 3F Ø ` Í Á Å Í Á ÅÇ @ Ø ` r[ @ :ô Ø ` f G Ø ` [` Í Á Å Í Á Å @ Ø ` [ @ Í [aÆ ` Í @ [Æ ` Í Á Å! ÍØ Á Åo Í Á Ø ` Å! Í Á Ø ` ØÅ dÎ Í [` Í Á År Í Á ` Å Ð Î Í [a` Í Á År Í Á `ÇÅ Ð c ETNA Kent State University http://etna.math.kent.edu ITERATIVE METHODS FOR PROJECTED LYAPUNOV EQUATIONS 195 Observe that (2.13) and (2.14) involve, in general, complex matrix operations. Complex arithmetic can be avoided if we rewrite (2.13) and (2.14) as ÁÍ Å × Re : [` Í ó Á [` Å Re [` Í dÎ Í [` Í Á Å Ð Í ó Á Å&4 3 Í Á Å and õÕù öµ÷!ø hú û ÷ úxü ý Reû þ û ÷-ÿ þ ù ý Re þ û ÷ ÿ ú û ÷ ú ÿ ù õ öµ÷ ù ø Re þ ÷ ù ÿ õÕ öµ÷!ø ü ý Reû þ û ÷¯ÿ ù þ ù ý Re þ û ÷ ÿ ú û ÷ ú ÿ ù õ öµ÷ ù ø ú û ú ù FõÕù öµ÷!ø ÷ Re þ ÷ ù ÿ Í Ø ` dÎ Í [` Í ` Á Å Í f Á Å Ð õ öµ÷ ù ø Re ! #"$ Re %&('*) +) ,-!./ ! ('0$ Re 123'*) +) ,4!. 6795 8 &: for real Re for a complex conjugate pair ; < =?>@<=ACBED <F=G . A drawback of this procedure is that the B inverse of HIJ HLKNM Re O <=%P2HQKQR <=+R SI is required. Solving linear systems with this matrix is usually more expensive than with HTKU<V=1I . 3. Smith method. For any parameter <#W#X J , the projected GCALE (1.2) is equivalent to the projected discrete-time Lyapunov equation (3.1) Y ZNY0[]\EZ^D_\a`bdc ca[e` g fb > ZhD `ib/ZN` 0 fb > where B YjDjO HTKQ<+I.PJ crDjs O Hk\ <lI.PmD \tM Re O <lPuO HKQ<+I.PJ n \UM Re O <lP/OoHpKq<+I.PJ Bv3w B I> Note that if the pencil xlHy\qI is stable, then ` b is the spectral projector onto the invariant subspace of the matrix Y corresponding to the eigenvalues inside the unit circle. In this case the Smith iteration (3.2) Z{z.D` b cmc [ ` fb > Z = DT` b c c [ ` fb KqY Z = J B Y [ can be used to compute an approximate solution of (3.1); see [29]. The number of iterations required for a desired accuracy in Z = depends on the parameter < . Note that the Smith method (3.2) is, in fact, the generalized ADI iteration (2.3) with a single parameter www <(D|< BqD D}<= . A modification of the Smith method has been proposed in [22] for computing a low-rank Cholesky factor of the solution of standard Lyapunov equations with a low-rank right-hand side. This version of the Smith method is based on the LR-ADI iteration with ~ shift parameters applied in a cyclic manner and referred to as the low-rank cyclic Smith (LR-Smith O~P ) method. It can be generalized for the projected GCALE (1.2) as follows: first one determines i www using the generalized LR-ADI method with the shift parameters < B > >2< and then solve the discrete-time Lyapunov equation Y ZNY [ \EZD\ [ > ETNA Kent State University http://etna.math.kent.edu 196 ¢. T. STYKEL . In summary, we have the following algorithm to compute where is as in (2.5) with the low-rank Cholesky factor of the solution of the projected GCALE (1.2). method for the projected GCALE. D E D H "£$%(' % , Û"£$%(' + , shift parameters ` ccUc o " [ . I : , , ?Oà Í Í of (1.2). O : A low rank Cholesky factor Í of the solution 1. Compute Í using Algorithm 2.1 and set Í Á `r' Å Í . cUcc 2. FOR ¢. Í Á ' Å ¡Í Á [a`r'& Å , FOR Á ' Å cUcc M 3 rwU pr [` :Ú Á ' [` Å Í Í (3.3) Ù Re END FOR Í tÎ Í Á [` Å D H Í Á ' Å Ð . A LGORITHM 3.1.The generalized LR-Smith NPUT UTPUT END FOR Í Í Note that if is real and the complex shift parameters appear in complex conjugate pairs, then the matrices are also real. R EMARK 3.1. The generalized LR-Smith( ) method is equivalent to the generalized LRADI method with cyclically repeated shift parameters. However, comparing (2.11) and (3.3), where and , respectively, one can see that Algorithm 2.1 is more efficient than Algorithm 3.1. R EMARK 3.2. At every iteration step in the generalized LR-ADI and LR-Smith methods the number of columns of the approximate solution factors and grows by and , respectively. To keep the low-rank structure in the Cholesky factors in case of large and/or slow convergence, we can replace the iterate by its low-rank approximation computed via the updated singular value decomposition; see [11] for details. Consider now the projected GDALE (1.4). If the matrix is nonsingular, then equation (1.4) is equivalent to the projected discrete-time Lyapunov equation Í Á [` Å" %(' + Í Á ' [ ` Å" %(' + Í m Í qãq m m ?3 q [a` ? q [` @ QI H [` F @ [ @ I @H ? JI H!? I @H c H is the spectral projector onto the invariant subspace of the matrix G[a` corNote that I H t<[a` I H is nilpotent with the responding to the zero eigenvalues. In this case I [` 2 3 . The unique solution of index of nilpotency ^ that is equal to the index of the pencil (3.4) the projected Lyapunov equation (3.4) is given by a[ ` q [` I H [` F @ [ @ I ? @ Thus, the Cholesky factor of the solution GDALE (1.4) has the form H [` ² [` I H [` B cUcc 4ÎoI ? T H@ q [` @ c of (3.4) and also of the projected [` [` I H [ ` Ð c It can be computed by the following algorithm that is a generalization of the Smith method for the projected GDALE (1.4). ETNA Kent State University http://etna.math.kent.edu ITERATIVE METHODS FOR PROJECTED LYAPUNOV EQUATIONS 197 A LGORITHM 3.2. The generalized Smith method for the projected GDALE. I H "£$%(' % Û"£$%(' + ? @ H Á ` ÅQI [`! ` Á `ÇÅ ¢0 cUcc ^ Á Ų/[` Á [a`ÇÅ 9Î [` Á Å Ð . I NPUT: , , and . O UTPUT: A Cholesky factor of the solution 1. , ; 2. FOR , END FOR ^ of (1.4). Á Å 2 Û3Çâ-ã Á Å Çâ-ã ¥ w ç G¥ IH Á Å Note that if the index of the pencil is unknown, then the iteration in Algorithm 3.2 can be stopped as soon as / or / with some matrix norm and a tolerance . If we want to compute the solution of (1.4) as accurate as possible, we should set to the machine precision. To avoid the drift-off of the columns of from the image of , the matrices should be pre-multiplied with after some iteration steps. In the case of singular , we can consider the Lyapunov equation Çâ-ã H I Çâ-ã ? 3 I < ?C I@ QI<GF@I<@ ñ H D H I H ![`¤ D E I E :![`I E . This equation has the same solution with ILhI as the projected GDALE (1.4), see [41]. Therefore, the Cholesky factor of the solution ? @ of (1.4) can also be computed by applying the Smith method to equation (3.5). H and I E , the matrix I can be constructed in explicit form using to the projectors I Similarly the special block structure of and . (3.5) 4. Numerical examples. In this section we present some results of numerical experiments. Computations were done on IBM RS 6000 44P Modell 270 with machine precision 2 using MATLAB 7. In all examples the suboptimal ADI shift parameters were computed as described in Section 2.1. E XAMPLE 4.1. Consider the 2D instationary Stokes equation that describes the flow of an incompressible fluid in a domain. The spatial discretization of this equation by the finite difference method on a uniform staggered grid leads to the descriptor system (1.1) with the matrices and as in (1.9), where , and . In our experiments the state space dimension of the problem is , and the matrix is chosen at random. Note that and are symmetric and is positive semidefinite. In this case the finite eigenvalues of are real. In Figure 4.1 we present the normalized residual norm as in (2.12) and the ratio for the generalized LR-ADI method with real shift parameters. The spectral radius of the matrix as in (2.5) is . One can see that the generalized LR-ADI method converges fast, and the solution of the projected GCALE (1.2) can be approximated quite accurately by a matrix of rank 30. The normalized residual norm stagnates on a relatively small level, which is caused by round-off errors. Note that does not decrease monotonically and more iteration steps are required to achieve than . Figure 4.2 shows the convergence history of the normalized residual norm for the generalized LR-ADI method and the generalized LR-Smith(10) method versus the iteration number. As expected, both methods give similar results. Furthermore, we computed the full rank Cholesky factor of the solution of the projected GDALE (1.4) using Algorithm 3.2. The Frobenius norms of the àh c p¡ > [` 2 3 Í ² Í Á Å UçÍ ? á Í Í Í ¥ Çâ-ã f f@ `n` *M n` ` Q @`n` jL¡Z ¡ > ô > ` f ` Q @` f "1$ %7' ` ñ á Í A¡ > §sq ` : > c > ¡ á Í ¥ Çâ-ã af "$%(' f ETNA Kent State University http://etna.math.kent.edu 198 T. STYKEL Normalized residual norms and relative updates PSfrag replacements ºº Generalized LR−ADI iterations, l=10 0 10 o −2 10 −4 10 −6 10 −8 10 −10 10 0 10 20 30 40 50 60 F IG . 4.1. Example 4.1: convergence history for the generalized LR-ADI method. Generalized LR−ADI and LR−Smith(10) iterations 0 10 LR−ADI, l=10 LR−Smith(10) −2 Normalized residual norm 10 Iteration k −4 10 −6 10 −8 10 −10 10 −12 10 0 5 10 15 20 25 Iteration k 30 35 40 F IG . 4.2. Example 4.1: normalized residuals for the generalized LR-ADI and LR-Smith(10) methods. Á ` Å c 2 ôÕB3 ¡ > , / Á f Å 9¡ c % ©¡ > f is of index . update matrices are ? is not surprising because the pencil Ák Å ¥ and / . This E XAMPLE 4.2. Consider the descriptor system (1.1) with non-symmetric matrices and as in (1.9) that has been obtained by the finite element discretization of a convection problem with a boundary control, see [19] for details. The problem has the state space dimension , and the matrix results from the boundary control. In Figure 4.3 we present the convergence history in terms of the normalized residual norms for the generalized LR-ADI method and the generalized LR-Smith(11) method. One can see that the solution of the projected GCALE (1.2) can be approximated by a matrix of rank 35. The spectral radius of is . The solution of the projected GDALE (1.4) has been computed in factored form with the full rank Cholesky factor . The Frobenius norms of the update matrices are and . , / j > f f@ d9Î > f@ Ð @ "$%(' ` `n` s§ q n` ` ² > c > ¡ ªf "9$K%(' f¡ > [ k f Ç ` Å Å 9¡ Á Á c c ? Ák Å ¥ ETNA Kent State University http://etna.math.kent.edu 199 ITERATIVE METHODS FOR PROJECTED LYAPUNOV EQUATIONS Generalized LR−ADI and LR−Smith(11) iterations 0 10 LR−ADI, l=11 LR−Smith(11) −2 Normalized residual norm 10 −4 10 −6 10 −8 10 −10 10 −12 10 0 5 10 15 20 25 Iteration k 30 35 40 45 F IG . 4.3. Example 4.2: normalized residuals for the generalized LR-ADI and LR-Smith(11) methods. *¡w E XAMPLE 4.3. Consider a damped mass-spring system with masses, see [37, Section 3.9]. The -th mass of weight is connected to the -st mass by a spring and a damper with constants and , respectively, and also to the ground by another spring and damper with constants and F , respectively. Additionally, we assume that the first mass is connected to the last one by a rigid bar and it can be influenced by a control. The vibration of this system is described by the descriptor system (1.1) with the matrices and as in (1.10). For , we obtain a problem of the state space dimension with . The system parameters are and ¢ ¦ m j #¡ >>Z> ¡ Q >>Z> m ` c cUc m 9¡ >Z> ¢ ` cUcUc Q¢ [ ` Q¢0 ¦ ` cUcUc ¦ ¦ ôò ` cUcUc [` ` ccUc (c Í and the relative updates Í á Figure 4.4 shows the normalized residual norms for the generalized LR-ADI method with :4¡ complex ADI shift parameters. We see that the low-rank Cholesky factor Í of the solution of the projected GCALE (1.2) computed with the stopping criterion Í ¥ Çâ-ã has about twice more columns than those computed with the stopping criterion á Í ¥ Çâ-ã . Since the spectral radius §sq ` f > c >Z> is very small, the generalized LR-ADI iteration converges very fast. > c and > ec that results Furthermore, we change the damping constants to > ª § ¡ in residual norms á Í ` f c . Figure 4.5 shows that in this case the normalized s § q decay quite slowly compared to the previous experiment with ` f > c >Z> . This example Û"g$%(' ` demonstrates that the convergence rate of the generalized LR-ADI method strongly depends on the spectral radius of the iteration matrix in (2.5). Finally, we investigate the impact of the growing number of columns in on the convergence of the LR-ADI method. Assuming that the first masses are influenced 2¡ ¡ by controls, we obtain , where ¡ denotes the -th column of . Figure 4.6 shows the normalized residual norms for the generalized LR-ADI method applied to the problems with , and inputs. One can see that for different the convergence is about the same. ¤]Î Ø ` ccUc Ø + Ð ¡> m ¡ m m " $%(' + Mf Ø ` 5. Conclusion. In this paper we have discussed the numerical solution of large-scale projected generalized Lyapunov equations that arise, for example, in model reduction of descriptor systems. We have presented the generalized low-rank alternating direction implicit ETNA Kent State University http://etna.math.kent.edu 200 T. STYKEL Normalized residual norms and relative updates PSfrag replacements ºº Generalized LR−ADI iterations, l=12 0 10 ¢ −5 10 −10 10 −15 10 0 10 20 30 40 Iteration k 50 60 70 F IG . 4.4. Example 4.3: convergence history for the generalized LR-ADI method. Generalized LR−ADI iterations, l=12 0 Normalized residual norms 10 PSfrag replacements d = 3, δ = 7 d = 0.3, δ = 0.7 −5 10 −10 10 −15 10 0 50 100 150 Iteration k 200 250 F IG . 4.5. Example 4.3: convergence history for the generalized LR-ADI method for different sets of damping parameters. method and the generalized low-rank cyclic Smith method for computing low-rank approximations to the solutions of these equations. The efficiency of these methods has been demonstrated by numerical experiments. REFERENCES [1] A. C. A NTOULAS , D. C. S ORENSEN , AND Y. Z HOU , On the decay rate of the Hankel singular values and related issues, Systems Control Lett., 46 (2002), pp. 323–342. [2] R. H. B ARTELS AND G. W. S TEWART , Solution of the equation £C¤¦¥¤9§¦¨*© , Comm. ACM, 15 (1972), pp. 820–826. [3] U. B AUR AND P. B ENNER , Factorized solution of Lyapunov equations based on hierarchical matrix arithmetic, Computing, 78 (2006), pp. 211–234. [4] D. J. B ENDER , Lyapunov-like equations and reachability/observability Gramians for descriptor systems, IEEE Trans. Automat. Control, 32 (1987), pp. 343–348. [5] P. B ENNER AND E. S. Q UINTANA -O RT Í , Solving stable generalized Lyapunov equations with the matrix sign function, Numer. Algorithms, 20 (1999), pp. 75–100. ETNA Kent State University http://etna.math.kent.edu ITERATIVE METHODS FOR PROJECTED LYAPUNOV EQUATIONS Generalized LR−ADI iterations, l=12 0 10 Normalized residual norms 201 LR−ADI, m = 1 LR−ADI, m = 5 LR−ADI, m = 10 −5 10 −10 10 −15 10 0 5 10 15 20 Iteration k 25 30 35 F IG . 4.6. Example 4.3: normalized residual norms for different number of inputs. [6] K. B ERNERT , Differenzenverfahren zur Lösung der Navier-Stokes-Gleichungen über Orthogonalen Netzen, Wissenschaftliche Schriftenreihe 10/1990, Technische Universität Chemnitz, 1990. [German] [7] K. E. B RENAN , S. L. C AMPBELL , AND L. R. P ETZOLD , The Numerical Solution of Initial-Value Problems in Differential-Algebraic Equations, Elsevier, North-Holland, New York, 1989. [8] E. E ICH -S OELLNER AND C. F ÜHRER , Numerical Methods in Multibody Dynamics, B.G. Teubner, Stuttgart, 1998. [9] J. D. G ARDINER , A. J. L AUB , J. J. A MATO , AND C. B. M OLER , Solution of the Sylvester matrix equation £C¤ §ª ¥3©i¤9«]ª{¨#¬ , ACM Trans. Math. Software, 18 (1992), pp. 223–231. [10] L. G RASEDYCK , Existence of a low rank of H-matrix approximation to the solution of the Sylvester equation, Numer. Linear Algebra Appl., 11 (2004), pp. 371–389. [11] S. G UGERCIN , D. C. S ORENSEN , AND A. C. A NTOULAS , A modified low-rank Smith method for large-scale Lyapunov equations, Numer. Algorithms, 32 (2003), pp. 27–55. [12] S. J. H AMMARLING , Numerical solution of the stable non-negative definite Lyapunov equation, IMA J. Numer. Anal., 2 (1982), pp. 303–323. [13] D. Y. H U AND L. R EICHEL , Krylov-subspace methods for the Sylvester equation, Linear Algebra Appl., 172 (1992), pp. 283–313. [14] I. M. JAIMOUKHA AND E. M. K ASENALLY , Krylov subspace methods for solving large Lyapunov equations, SIAM J. Numer. Anal., 31 (1994), pp. 227–251. [15] V. B. L ARIN AND F. A. A LIEV , Generalized Lyapunov equation and factorization of matrix polynomials, Systems Control Lett., 21 (1993), pp. 485–491. [16] J.-R. L I AND T. P ENZL , Square root method via Cholesky factor ADI, in Proceedings of the 14th International Symposium of Mathematical Theory of Networks and Systems, Perpignan, France, 2000, CD-ROM. [17] J.-R. L I AND J. W HITE , Low rank solution of Lyapunov equations, SIAM J. Matrix Anal. Appl., 24 (2002), pp. 260–280. [18] A. L U AND E. WACHSPRESS, Solution of Lyapunov equations by alternating direction implicit iteration, Comput. Math. Appl., 21 (1991), pp. 43–58. [19] K. H. A. O LSSON , Model Order Reduction in FEMLAB by Dual Rational Arnoldi, Master thesis, Department of Mathematics, Chalmers University of Technology and Göteborg University, Göteborg, Sweden, 2002. [20] D. W. P EACEMAN AND H. H. R ACHFORD , The numerical solution of parabolic and elliptic differential equations, J. Soc. Indust. Appl. Math., 3 (1955), pp. 28–41. [21] T. P ENZL , Numerical solution of generalized Lyapunov equations, Adv. Comput. Math., 8 (1998), pp. 33–48. [22] , A cyclic low-rank Smith method for large sparse Lyapunov equations, SIAM J. Sci. Comput., 21 (2000), pp. 1401–1418. [23] , Eigenvalue decay bounds for solutions of Lyapunov equations: the symmetric case, Systems Control Lett., 40 (2000), pp. 139–144. [24] Y. S AAD , Numerical solution of large Lyapunov equations, in Signal Processing, Scattering, Operator Theory, and Numerical Methods, M. A. Kaashoek, J. H. Van Schuppen, and A. C. M. Ran, eds., Birkhäuser, Boston, 1990, pp. 503–511. [25] , Numerical Methods for Large Eigenvalue Problems, Series in Algorithms and Architectures for Advanced Scientific Computing, Manchester University Press, Manchester, 1992. ETNA Kent State University http://etna.math.kent.edu 202 T. STYKEL , Iterative Methods for Sparse Linear Systems, PWS Publishing Company, Boston, 1996. [26] [27] J. S ABINO , Solution of Large-Scale Lyapunov Equations via the Block Modified Smith Method, Ph.D. thesis, Department of Computational and Applied Mathematics, Rice University, Houston, 2006. [28] R. S CH ÜPPHAUS , Regelungstechnische Analyse und Synthese von Mehrkörpersystemen in Deskriptorform, Fortschritt-Berichte VDI, Reihe 8, Nr. 478, VDI Verlag, Düsseldorf, 1995. [German] [29] R. A. S MITH , Matrix equation ¤9£0¥3§¤r¨­© , SIAM J. Appl. Math., 16 (1968), pp. 198–201. [30] V. I. S OKOLOV , Contributions to the Minimal Realization Problem for Descriptor Systems, Ph.D. thesis, Institut für Mathematik, Technische Universität Chemnitz, 2006. [31] D. C. S ORENSEN AND Y. Z HOU , Bounds on eigenvalue decay rates and sensitivity of solutions of Lyapunov equations, Technical Report TR02-07, Rice University, Houston, 2002. [32] G. S TARKE , Optimal alternating direction implicit parameters for nonsymmetric systems of linear equations, SIAM J. Numer. Anal., 28 (1991), pp. 1431–1445. [33] T. S TYKEL , Numerical solution and perturbation theory for generalized Lyapunov equations, Linear Algebra Appl., 349 (2002), pp. 155–185. , Stability and inertia theorems for generalized Lyapunov equations, Linear Algebra Appl., 355 (2002) [34] pp. 297–314. [35] , Gramian-based model reduction for descriptor systems, Math. Control Signals Systems, 16 (2004), pp. 297–319. , On some norms for descriptor systems, IEEE Trans. Automat. Control, 51 (2006), pp. 842–847. [36] [37] F. T ISSEUR AND K. M EERBERGEN , The quadratic eigenvalue problem, SIAM Rev., 43 (2001), pp. 235–286. [38] E. WACHSPRESS, Iterative solution of the Lyapunov matrix equation, Appl. Math. Lett., 1 (1988), pp. 87–90. [39] , The ADI minimax problem for complex spectra, in Iterative Methods for Large Linear Systems, D. Kincaid and L. Hayes, eds., Academic Press, 1990, pp. 251–271. [40] K. W ILLCOX AND G. L ASSAUX , Model reduction of actively controlled supersonic diffuser, in Dimension Reduction of Large-Scale Systems, P. Benner, V. Mehrmann and D. Sorensen, eds., Lect. Notes in Comput. Sci. Eng. Vol. 45, Springer, Berlin, 2005, pp. 357–361. [41] L. Z HANG , J. L AM , AND Q. Z HANG , Lyapunov and Riccati equations for discrete-time descriptor systems, IEEE Trans. Automat. Control, 44 (1999), pp. 2134–2139.