IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 1607 Operator Theoretic Approach to the Optimal Two-Disk Problem Seddik M. Djouadi, Member, IEEE Abstract—The nonstandard two-disk problem plays a fundamental role in robust feedback optimization. Here, it is shown via Banach space duality theory that its solutions satisfy an extremal identity, and may be viewed as a dual extremal kernel of a particular 1 -optimization problem. A novel operator theoretic framework to characterize explicitly its solutions is developed, in particular, the two-disk optimization is shown to be equal to the induced norm of a specific operator defined on a projective tensor product space involving a non-Hilbert version of a vector valued 2 space. Moreover, this operator is shown to be a combination of multiplication and Toeplitz operators. Under certain conditions, existence of maximal vectors is established leading to an explicit formula for the optimal controller. An “infinite matrix” representation with respect to a canonical basis is derived, together with an algorithm to compute it. The norm of the relevant operator is approximated by special finite dimensional optimizations whose solutions lead to solving semi-definite programming problems involving the computation of a matrix projective tensor norm. (3) , is known where as the trace-class norm. denotes the unit disk of the com. denotes the plex plane, . If is a subset boundary of , , then denotes the complement of in . deof notes the normalized Lebesgue measure on the unit circle , . is the label used for “Lebesgue almost everywhere.” For a matrix or vector-valued function on the unit is the real-valued function defined on the unit circle circle, , . If denotes a finite diby , , stands mensional complex Banach space, for the Lebesgue-Bochner space of th power absolutely inteunder the norm grable -valued functions on Index Terms—Feedback systems, functional analysis, optimal control, optimization methods, robustness, uncertainty. for for NOTATION stands for the field of complex numbers. denotes either the inner or duality product depending on the context. denotes the identity map. If is a Banach space, then de, where denotes notes its dual space. For an -vector is the Euclidean norm. the -dimensional complex space, is the space of matrices , where is the largest , and denote the complex singular value of . Banach space of -vectors , , and (4) (5) where , denotes the norm on [1], and is the abbreviation used for the essential supremum. If , , the th Fourier coefficient is defined by , which define the well-known , , is the Fourier series representation of . Hardy space of -valued analytic functions on the unit disk , . In fact these spaces can viewed as a closed subspace of be realized as with, respectively, the norms if and Clearly, , of (1) is the dual space of and vise versa. , , and denote the complex Banach space matrices , , is defined as such that . denotes the space of continuous -valued Finally, . functions defined on The space with, respectively, the following norms: (2) Manuscript received June 29, 2001; revised May 13, 2002, November 2, 2002, and May 26, 2004. Recommended by Associate Editor V. Balakrishnan. The author is with the Electrical and Computer Engineering Department, University of Tennessee, Knoxville, TN 37996-2100 USA (e-mail: djouadi@ece.utk.edu). Digital Object Identifier 10.1109/TAC.2004.835356 (6) I. INTRODUCTION I T IS WELL KNOWN that there are mainly two reasons for using feedback, the first is to reduce the effect of any unmeasured disturbances acting on the system, the second is to reduce the effect of plant uncertainty. The objective of this paper is to investigate the ability of feedback to reduce the effect of unceroptimization known as tainty by solving the nonstandard the two-disk problem 0018-9286/04$20.00 © 2004 IEEE (7) 1608 where IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 is a stable plant and are outer weighting functions. It is well known that the problem of worst-case disturbance rejection in a feedback system subject to a pair of uncertain disturbances representing, for example, output and sensor noises, without the simplifying assumption that these are mutually orthogonal reduces to (7) [2], [3]. More importantly, the two-disk problem reflects the fundamental tradeoff between sensitivity , and complementary sensitivity , as outlined in [2], where it was termed the robust performance problem. Other fundamental feedback control problems related to (7) are the optimal robust disturbance attenuation problem (ORDAP) [4]–[7], where an uncertain plant is subject to disturbances at the output, and where the objective is to find a feedback control law which provides the best possible uniform attenuation of output disturbances in spite of uncertainty in the plant model; a plant uncertainty reduction or filtering problem posed by Zames [4, p. 316], where the following question is posed: If a plant lies in some uncertainty set what is the best achievable uncertainty reduction if a feedback control is to be used?, as well as questions of well posedness in feedback optimization studied by Smith [8] and rank one -synthesis. The two-disk optimization problem (7) was considered by, among others, Francis, Owen, and Zames [5]–[7]. In particular, they showed that ORDAP reduces to finding the smallest fixed point of the function shown in (8) at the bottom of the page. In the single-input–single-output (SISO) case, (8) takes the form shown in (9) at the bottom of the page, where the right-hand side (RHS) member is equivalent to the optimization (7). In [5] and [6], it was shown that there is a parametric verproblem which provides sion of the standard two-block an approximate solution based on the standard two-block problem. Zames and Owen showed that such an approximation can be arbitrary poor, and is never better than a factor of , and therefore they stressed the need for an “exact” theory for such problems. The point is that in situations where the , a more fixed-point is sensitive to correct estimation of exact optimization theory is required than that provided by the quadratic two-block problem [9], [10], [7]. Using duality theory they showed that under certain conditions there exists a solution to the two-disk problem, which satisfies an allpass or flatness condition, and is unique in the SISO case. By gridding the frequency domain they proposed a numerical algorithm based on convex programming. However, this algorithm assumes uniform Lipschitz continuity of the problem data (plant and weights), which may be restrictive and has the drawback of solely ap- proximating the optimal solution at a finite number of points. Moreover, duality theory has not fully been exploited, and there has been no closed form solution reported in the literature. To some extent the two-disk problem has also been considered by Holohan and Safonov in the context of optimizing the stability and robustness margins, where part of the duality of the problem was characterized in the SISO case [11]. Recently, Besson and Shenton proposed an approximate graphical ad-hoc solution to the suboptimal version of (7) [12]. The nonstandard nature of the norms involved in the optimizations (7)–(9) make this class of problems intractable by state-space techniques, simply because these norms (as shown later) are induced on certain Banach (non-Hilbert) spaces for which the Parseval Theorem does not hold. Here, it is resorted to developing tools from functional analysis and operator theory parallel to those used during the control theory in the early eighties (see, e.g., emergence of [6], [13], and [14]). In particular, we develop a Banach space duality theory which leads to an exact solution in terms of operator theory. First, it is shown that an extremal measure in the dual is absolutely continuous with respect to the Lebesgue measure, a consequence of this result is that the optimal solution of the two-disk problem satisfies an extremal identity. It is then shown that the optimal solution is a dual extremal function for an space. approximation problem in a particular vector valued Next, it is shown, in particular when flatness holds, that the optimal performance in the two-disk optimization is equal to the induced norm of a special operator. This operator is a Banach space projection of a multiplication operator. The projection is computed explicitly showing that our operator is in fact a combination of multiplication and Toeplitz operators defined on and a subspace of the the projective tensor product of non-Hilbertian (Banach) vector valued space. Under certain conditions, the existence of maximal vectors is proved leading to an explicit formula for the optimal controller in the SISO case. An infinite matrix representation with respect to a canonical basis is given together with general formulas and a recursive algorithm to compute its columns. An approximation of the operator norm by particular convex optimizations subject to a matrix projective tensor norm is also provided, and allows the computation of optimal performance as closely as desired. Finally, it is shown that the computation of the matrix tensor norms reduce to solving a particular semidefinite programming (SDP) with linear matrix inequalities. The latter results in a uniform approximation of the optimal solution unlike the pointwise approximation suggested in [7], [10] under the assumption of (8) (9) DJOUADI: OPERATOR THEORETIC APPROACH TO THE OPTIMAL TWO-DISK PROBLEM uniform Lipschitz continuity of the plant and weights. Although the results here are developed for stable plants, generalization to unstable plants can be made without much effort by working with coprime factorization techniques [15]. In view of the isoon the unit disk and on metric isomorphism between the right-half plane, we will confine ourselves to the unit disk. The results obtained apply to both discrete and continuous time systems. Parts of the results here appeared in [15]–[19]. II. EQUIVALENT DISTANCE PROBLEM IN 1609 III. BANACH SPACE DUALITY STRUCTURE AND EXISTENCE OF OPTIMAL SOLUTIONS denotes the space of Lebesgue measurable, Let and absolutely integrable functions defined on the unit circle with values in under the norm (13) where was defined earlier as the Banach space of (eswith sentially) bounded analytic functions in the unit disk , the norm is given by values in , of , , 2. Let , and the singular values be the quotient space , where (10) such that where . Since is analytic is a subharmonic function there in , then [1], hence, it satisfies the maximum principle and, therefore (11) Letting have inner-outer factorization , and inner–outer factorization , the outer part of , then (7) is equivalent to [10], [9] (12) to the Equation (12) is the shortest distance from subspace and are outer in where [9] and [10], if we assume that of , and (14) the space is obtained by taking complex conjugates. The quotient norm is (15), as shown at the bottom of denotes the coset associated to . It has the page, where is isobeen shown in [21], [22], [9], and [10] that . Since metrically isomorphic to the dual space of may be viewed as a subspace of , and is the preannihilator (or preorthogonal) in , then is isoof metrically isomorphic to the dual space of and we write . Every defines a continon given by uous linear functional , (16) inner. As in then is a closed subspace of , and there exists an , such that outer spectral factor of . Assumption (A1) precludes strictly proper plants, but as in the standard theory, our theory can be extended to them by modifying (see, e.g., [20], [9], and [15]). Moreover, has the following equivalent representation [10], , where , , , so that , Lebesgue almost everywhere ), that is, is inner. This notation should not be confused ( with that for rational functions in . Banach space duality theory is a very important tool in solving optimization problems in normed linear spaces, in particular, the so-called duality product plays a role analogous to the inner product in Hilbert space. In Section III, we give a similar duality structure to [9], [10] which will prove to be more useful for our purpose. where space of . Let be the sub- defined by (17) where denotes the direct sum of two subspaces, is the identity map, and (18) The following Lemma characterizes the preannihilator (or preorthogonal) of . is the preannihilator of in . Lemma 1: Proof: Since is the preannihilator of , the Lemma follows from Lemma 1 in [10], [7], and [3]. From a standard result in Banach space duality theory relating the distance from a vector to a subspace and an extremal (15) 1610 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 functional in the predual [23, Th. 2, p. 121], we deduce the following theorem, which asserts the existence of at least one optimal (controller) . Theorem 1: Under assumption (A1), that is, , , there exist at least one optimal such that is given by the space The dual space of of -valued functions of bounded variations on the norm [21], [9], [10], [3] under (22) , and denotes the sum of the total where of all entries of and . Let variations on be the space of -valued functions which are continuous on the closure of the unit disk and analytic in . is known as the disk algebra. The annihilator of is completely described by the vector-valued version of the F. and M. Riesz Theorem [25], which yields (19) denotes the coset norm in . where In general, the supremum on the left-hand side of (19) is not . The best we can say at this point achieved by any is that, it follows by the Bishop–Phelps Theorem [24], that the for which the supremum in set of functions . However, if we (19) is attained is norm dense in assume is continuous on the unit circle and where (20) i.e., when the open unit disk analyticity constraint ), is removed, (in the scalar case ( ), then the optimal solution is flat [9], [10], [7], and the supremum in question is always achieved. This motivates the following section, where an alignment condition and an extremal expression are shown to characterize the optimal solution. Note that assumption (A2) is weaker than a similar assumption in [10], [7], and [9]. IV. ABSOLUTE CONTINUITY AND EXISTENCE OF AN EXTREMAL IDENTITY FOR THE OPTIMUM (23) The dual space of norm is then given by [26], , under the quotient . Next define the subspace . It folis lows from Lemma 3 [10], [3] that the annihilator of given by (24), as shown at the bottom of the page, where . In the following lemma, we establish that the distance from the to the subspace is the same as vector function to . Lemma 2: Under assumptions (A1) and (A2), the following holds: (25) (26) A. Alignment in the Dual Recall that denote the space of continuous functions on with values in . If we set (21) It turns out that under assumptions (A1) and (A2), the maximum in (26) is achieved by a measure absolutely continuous with respect to the Lebesgue measure. It will then follow by the F. and M. Riesz Theorem that the supremum in (19) is in fact achieved . in (24) DJOUADI: OPERATOR THEORETIC APPROACH TO THE OPTIMAL TWO-DISK PROBLEM Lemma 3: Under assumptions (A1) and (A2), we have 1611 . The optimal performance is then a dual extremal function for the approximation problem . , Proof: “Only if” by assumption , and such that where (27) where denotes the coset norm in . Proof: See the Appendix. Lemma 3 implies that under mild assumptions the optimal solution is flat or allpass. It follows also from the proof that the optimal solutions in the dual are aligned. (31) but the integrand B. Dual Extremal Property and Extremal Identity for the Optimum -spaces of scalar valued A well-known fact in the theory of contains a representative functions is that every coset in of the least possible coset norm [27], [28], [26]. More precisely, , consider the linear functional on given a kernel defined by . Knowing that , find a kernel function , such that , and . A dual extremal function is a function , such that, [29], [27], [26]. We will show that these remarkable properties hold , and are inherited by . The opfor can then be intertimal solution preted as the dual extremal function of an optimization problem . in Lemma 4: [16], [15], [19] , there exists a) Given such that (28) b) Given there exists such that (29) is then an extremal kernel or function for . Proof: See the Appendix. Combining Lemmas 3 and 4, we obtain the following theorem which looks familiar in the theory of extremal problems [26]. Theorem 2: Under assumptions (A1) and (A2), , , is an extremal kernel for , and is optimal if and only if (30) integrating implies equality must hold throughout, and (30) holds. This, combined with flatness, implies (32) “If” suppose that (30) holds, integrating it yields hence, equality must hold throughout and is then optimal. Remark 1: Theorem 2 shows that if the convex programming method of [7] and [10] is to be used, the search in the predual which is convex can be restricted to functions which , , (or satisfy , , in the scalar case), moreover, expression (30) can be used as a test for optimality . The RHS of (30) shows that the optimal primal and for dual solutions are “pointwise” aligned in the dual. Note that is not constant, ( ), and are nonzero, in case ), and under assumptions (A1), (A2), , ( , That is at each frequency the dual extremal function , (at least in the scalar . From the case), is an extreme point of the unit ball of flatness condition the RHS of expression (30) is equal almost everywhere to , and points to a tradeoff between the optimal nominal sensitivity represented by , and the optimal nominal complementary sensitivity at each frequency . Since (30) is equal to , it also shows how the weighting functions affect the optimal solution at each frequency. A novel operator theoretic framework to characterize explicitly the solutions to the two-disk problem is developed next. 1612 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 V. OPERATOR THEORETIC SOLUTION TO THE TWO-DISK PROBLEM First, introduce a key multiplication operator acting on parand spaces which are not Hilbert. ticular vector valued As pointed out previously, the induced norm of this multiplication operator is equal to the norm involved in the two-disk, and clearly shows that the non-Hilbertian nature of the vector valued and preventing using state–space or time domain techniques to solve this problem. However, these spaces essentially capture the geometry of the problem. Proof: See the Appendix. This proposition shows that these spaces although Banach (non-Hilbertian) are not only reflexive, but also isomorphic to and , respecthe standard Hilbert spaces tively. Moreover, their norms are equivalent. This will allow us to bring some Hilbert space structure into the problem. In the next section, we characterize the optimal solution in terms of a particular operator. VI. EXACT OPERATOR THEORETIC SOLUTION A. Key Multiplication Operator For the remainder of this paper, we will assume that (A1) and (A2) hold. is the complex space of Lebesgue inRecall that -valued functions defined on the unit circle , tegrable under the norm A. Optimal Performance for (33) Note that is not given by any Hilbert space is not a Hilbert space, but is innorm, so that , where deed isomorphic to the Hilbert space is the standard -dimensional Euclidean space. Let , the multiplication operator associated with , and mapping the standard Lebesgue into is denoted by . More precisely space It is clear that the identity map from the Hardy space onto the standard Hilbert space is isomorphic, since these spaces are topologically identical and their norms are equivalent. be the orthogonal projection on the closed Let subspace of , where is understood to be the orthogonal , . complement of Next, define the bounded linear projection as follows: (34) In the next proposition, we show that the operator induced norm is equal to , and in fact it suffices to work in the standard Hardy space . , then Proposition 1: Let 1) (37) where is the usual composition operation. It is clear from the definition of , and the properties of the orthogonal projection that every vector function can be decomposed uniquely as the sum of two vector functions as follows: where (35) 2) (36) Proof: See the Appendix. is finite dimensional, then the dual space of Note that is clearly given by the Banach space [21]. is reflexive. That is It follows that (38) are orthogonal in the sense that . Also, by uniqueness, note that Since the orthogonal projection gives the best approxiby functions in the subspace mation of elements of in the -norm. The projection operator plays a similar role as but for approximations in -norm by elements of . the Now, define the following key bounded linear operator: and , by (39) Define the following continuous bilinear transformation: Recall that ( ) is the closed sub( ) consisting of space of ( )-bounded pairs of analytic functions in the unit disk. In the next proposition, their dual spaces are characterized. and be defined as Proposition 2: Let before, then the following statements hold. 1) . . 2) Hence, and are reflexive Banach spaces. (40) (41) Then, the norm of is given by (42) DJOUADI: OPERATOR THEORETIC APPROACH TO THE OPTIMAL TWO-DISK PROBLEM The bilinear map may be linearized by invoking a projective tensor product space [30]. To see this, define a bounded linear operator on the projective tensor product , by . The projective norm of is defined as shown in the first equation at the , for bottom of the page. More explicitely, letting , , may be written as , in which case (43) A useful property of the projective tensor norm is that it is finitely generated [30], that is, the search in (43) can be restricted and to finite dimensional subspaces of . This fact will be exploited in Section VI-D to derive . The operator plays a central role in an SDP to compute finding an explicit solution to our problem through the following theorem which quantifies optimal performance. Theorem 3: Let be the optimal performance index defined by (7), then under assumptions (A1) and (A2), the following hold. i) is equal to the induced norm of , namely 1613 operators are defined on the standard Hilbert spaces (or suband , and where a multiplication by a cerspaces of) tain unitary operator transform the problem to a one block problem. The difference here is in the definition of the nonHilbertian norm of the range space of , which precludes any transformation to one block problem by pre- or postmultiplication by unitary operators. The existence of a maximal vector for the operator under the previous conditions, implies that a well-known Hankel–Toeplitz operator, which solves the stanproblem, achieves its norm on its discrete dard two-block spectrum for (possibly) infinite-dimensional systems [15]. This settles an open question in [32] and [33]. The existence and computation of maximal vectors, although difficult, is important and lead to the exact computation of the optimal controller as shown in Section VI-B for the SISO case. be arbitrary but fixed, there Proof of Theorem 3: Let , , and exists , , such that (46) and has the form However, It follows then , . , thus (44) ii) There exist maximal vectors for , i.e., -norm 1, and , , or putting , then of such that (45) Remark 2: The operator may be viewed as an analog to the Sarason operator [31], and Young’s operator [13]. In parproblem, where these ticular, for the standard two-block since . Note that (47)–(49), as shown at the bottom of the page, hold true. Since is arbitrary, we get (50), as shown at the bottom of the page. The last equality follows from Theorem 1. where and (47) therefore (48) (49) (50) 1614 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 Now, we would like to prove the reverse inequality. Note that (51) , similarly and has norm 1. The member on the left-hand side of (53) is equal to (54)–(55), as shown at is the orthogonal projecthe bottom of the page, where to . In fact, tion from . is the left multiplication operator by and has norm , acting from to By Lemmas 3 and 4 and Theorem 2, there exists , with having , and denotes the duality -norm 1, and such that product in (52) By the proof of Theorem 2, , ; and there exists , , such that However, . Moreover, then , since for some , i.e., , and . ; see the first equation shown at Hence, , the bottom of the page. Noting that . , since , so . Therefore, . Likewise, . Therefore, . Now, let denote the identity matrix, then (53) Exploiting an idea in [34], the subspace consisting of functions each of whose columns belongs to can be identified with the tensor , and will henceforth be denoted so. Likewise . We have , , and (56) (57) . Expressions Hence, , and equality in (55) (50), (52), and (55) imply that must hold inferring existence of maximal vectors. The orthogonal projection plays a crucial role in solving problem, our problem as well as the standard two-block e.g., mixed sensitivity problem. Thus, it is important to compute it explicitly, hence the following theorem. be the orthogonal projection from Theorem 4: Let to , then (58) is the standard Riesz projection, i.e., if where has Fourier coefficient , , then and has Fourier coefficients such that , and , Remark 3: As noted previously, the projection not only twosolves our problem, but also the standard quadratic as acting on the Hilbert space block problem by viewing , and without having to rely on unitary transformations to transform the problem to an equivalent one block problem (see, e.g., [32] and [33].) The same remark applies to problem (when ) by viewing as the standard acting on instead (see, e.g., [35]). (54) (55) DJOUADI: OPERATOR THEORETIC APPROACH TO THE OPTIMAL TWO-DISK PROBLEM Proof of Theorem 4: For , let us compute 1615 Hence, equality must hold throughout, in which case it is posso as to satisfy sible to choose the optimal , Remark 4: For SISO systems ( ), is a scalar function, , and by uniqueness of the optimal solution, we can obtain a formula for the optimum (64) (59) is indeed a projection. Clearly, the adjoint of , although defined on different itself, so that is spaces, is equal to an orthogonal projection. Next, we show that the null space of , . then Let , since , then and, therefore, . Hence, . Conversely, let , then so thus , so and, therefore, It follows from [23, Th. 1, p. 155] that . (60) The following Corollary follows then from Theorems 3 and 4, and provides an explicit solution to the two-disk problem. is given exCorollary 1: The optimal performance index plicitly by the following expression for some , , and , : (61) Next, the optimal controller for the two-disk problem is characterized explicitly. which determines through we get uniquely. Multiplying and dividing , hence, , The optimal con, where , troller achieves the optimal robust performance. For the general case (MIMO systems) the optimal controller is in fact highly nonunique, and a more involved procedure is required in order to compute it. We need sufficiently many independent directions to determine the whole symbol . A good starting point are [34], [36], and [37]. The norm of the operator gives the optimal performance and controller. It is therefore important to develop an algorithm to compute it. This motivates the next section. C. Computation of the Operator Norm In this section, we show how it is possible to compute the and hence the optimal performance norm of the operator , has a power index . First, note that every , for some . series expansion Therefore, , ; ; form , where , is the usual a basis for . To determine a representation of with canonical basis of , ; ; respect to the basis it suffices to study the action of on each element of the for a fixed . We have then for basis say, for example, , , where is the th column of . In the representation of , the th column will be the representation of with respect to the basis , ; ; of B. Optimal Controller Theorem 1 implies that there exists a vector function such that . Hence, we can interpret (62) Moreover by Proposition 1, may be viewed as a acting from into multiplication operator such that . Equivmay be viewed as an operator acting from alently, into , the projectve tensor product . We have by Theorem 4 (63) , for each , as the matrix function whose columns are projections of the columns of onto the subspace . With this in mind, the action of on is then determined by . Therefore, with respect to the canonical basis , has the following representation: (65) Each column in (65) is the image of a basis vector after applying . These columns are determined by studying the action of on the basis vectors , as explained above; the th column corresponding to the representation of with ; is then formed by the coeffirespect to the basis in with respect to . To compute cients of 1616 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 these coefficients consider, , and belong to ries expansion in the unit disk , since , they all admit power se- At the th step, we have • Remark 5: The previous formulas show that when the first column is computed, all the remaining columns follow easily. However, from a computational point of view, it is more efficient to compute these columns recursively. Hence, the following recursive algorithm. Then, we have , and Recursive Algorithm: For • (66) (67) Where ; . The operator has then the following infinite matrix representation , and for , with respect to the basis , , the action of on is determined by , where We get also . Now, each column • • , . Hence, straightforward computations yield the following. • .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . .. . (68) of By the proof of Theorem 3, there exist -norm 1, of -norm 1, which form a maximal vector for . Next, we show that is such that • (69) That is , , ; we will get If , for (70) DJOUADI: OPERATOR THEORETIC APPROACH TO THE OPTIMAL TWO-DISK PROBLEM Equality (69) is important since it allows us to bring in some Hilbert space structure as shown by (70). Indeed alignment in , we the dual implies that (71) holds and, for some have (71) 1617 over . Clearly, ; and that is uniformly bounded. The defined as the dimension , is such that . The of the image space of operators , , have rank less than can be represented by a finite or equal . The operators matrice formed by the first columns and rows of the infinite matrix representing in (68). We show that in fact as tends to infinity converges to in the strong operator topology (SOT), i.e., for (79) (72) (73) Indeed, by adding and subtracting implies , the triangle inequality (74) Applying Cauchy–Schwarz inequality, we get (80) (75) thus Since is finite dimensional, then by the Riesz–Fischer Theorem (cf. [38, Lect. VI]) the two terms on the right-hand side of (80) tend to zero and, thus, (79) must hold. has the following Noting that any in form for some , to the similar arguments apply then by restricting over . Calling the orthogonal prospan of , , the following holds jection on as . Therefore, under assumptions (A1) and (A2) we have (76) (77) Hence, equality must hold throughout, and from (73) and (74) we deduce as and Note that the bilinear form (81) are finite rank operators. Defining (82) (78) hence , , and (69) and (70) must hold. This fact wil be used in approximating the operator norm in (84) as discussed in the next section. D. Approximation by Special Matrix-Norms In what follows, we will show that it is possible to approximate the norm of by the norms of finite-dimensional operators or matrices. Indeed, if we restrict to finite dimensional subspaces of its domain, then we will obtain a sequence of finite rank operators whose norms are “nearly” optimal, in the sense that they approach the norm of as the dimension of the initial spaces increases. More explicitly, let denote the orthogonal projection from to the subspace , ; and spanned by the orthogonal projection from onto the span of the corresponding linear operator is then defined by . Identity (81) implies as in the SOT. Moreover, with that , where respect to the canonical basis is the standard basis of , can be represented by a matrix obtained by truncating the infinite . matrix representation in (68) of , i.e., can be computed recursively using the The columns of previous recursive algorithm. Similarly, , can be represented by a matrix which columns can be computed as in the previous recursive algorithm , where each entry developed in Section VI-C, in belongs to , . Putting , . and , are the 1618 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 vectors containing the th-first Fourier coefficients of respectively. Expression (82) of reduces then to and subject to (83) (88) Each can be decomposed as Call , appropriately in such a way that , . Partitioning . , where , , are matrices, the rows of are the even numbered rows of , and ’s rows are the odd numbered rows of . Note that , for some (constant) permutation matrix of appropriate dimension. The optimum can then be approximated by If as , then the minimization (87) can also be written (89) subject to (90) .. . (84) where as . To show that the optimization in (84) is in fact convex, note first that is a bilinear form in and . To linearize it let , and invoke again the projective and , tensor product of , under the projective norm defined below. Observe that the space of bilinear forms on is isometrically isomorphic to the dual space of [30]. This means that the bilinear form may be replaced by a bounded linear functional on . We have then shown that the optimization (84) can be rewritten as (85) is the projective norm of and is defined as shown where in (86) at the bottom of the page. The computation (86) has been shown in [39] to reduce to the following SDP with linear matrix , inequality (LMI) constraints, by noting that and in fact it is a little easier to work with the latter. (87) (91) Therefore, the problem reduces to solving the optimization (85) subject to the SDP with LMI constraints (89) and (91). Remark 6: It is therefore possible to approximate uniformly as closely as desired the norm of , and hence , by computing the special matrix norms defined by (89). From expression (85), we see that explicit computation of the maximal vectors of , (or ), will shed light on the form of the maximal vector of (respectively, ) , and therefore the optimal controller . As showed this is realized by solving a convex optimization problem with LMI constraints. VII. CONCLUSION This paper was concerned with the so-called two-disk problem, which arises in many control problems, including the fundamental tradeoff between sensitivity and complementary sensitivity functions [2]. The two-disk problem is intractable by established methods in the standard theory, and the duality theory developed in the work of Owen and Zames does not characterize explicitly exact solutions, in particular when certain uniform Lipschitz conditions (see [9]) do not hold. This provided the motivation of the study undertaken here. We started by a re-examination of the duality theory of the problem working in a smaller (matrix-valued) space (rather than a larger (matrix-valued) ). The two-disk problem was then interpreted as a distance minimization, allowing predual and dual representations under a weaker continuity assumption for the latter. Useful geometric properties were deduced, for example, an extremal identity for optimal performance/controller which provides a test of optimality; the extremal function in the predual space satisfies a norm constraint. Then a new operator theoretic framework was developed based on the introduction of (86) DJOUADI: OPERATOR THEORETIC APPROACH TO THE OPTIMAL TWO-DISK PROBLEM two non-Hilbertian versions of the vector valued space. The optimal solution was shown, under specific conditions, to be equal to the norm of a certain operator, which is a combination of multiplication and Toeplitz operators subject to a projective tensor norm. Existence of at least one maximal vector was deduced, which resulted in an explicit formula for the optimal controller in the SISO case. An infinite matrix representation of the operator with respect to a canonical basis was provided, together with approximation by norms of finite rank operators, whose computations involve solving an SDP problem. It should be noted here that the results obtained are valid, at least in theory, for infinite dimensional systems. The theory developed also generalizes to unstable systems using coprime factorization techniques [15]. A similar duality and operator theory, (although the latter was already developed in the eighties), has problem in [15], been developed for the quadratic two-block which may be useful for infinite-dimensional systems. In particular, a numerical solution based on convex programming along the lines of [10] and [7] is possible for this class of systems. 1619 The reverse inequality is clear since follows from [23, Th. 1, p. 121]. . The third equality B. Proof of Lemma 3 By Lemma 2, there exists a measure , , such that (26) holds, i.e., if we let to be the sum of the total variations of the entries of , and its linear extension to which satisfies for all . Then, by (26), we have (92)–(95), as shown at the bottom of the page, where , and isfies written as . The extremal measure (96) APPENDIX A. Proof of Lemma 2 The second equality follows from the same argument used to prove Lemma 2 [10], [7], [3], where unlike here it is assumed is continuous on the unit circle. Let , be that . Define the scaling of the unit disk, and , and note that , and is continuous on the unit circle by assumption (A2), then Note that satcan be for some suppose that and and and , and . Next, is a Borel subset of , such that , . Define for , , for , then , which implies , on , and from a standard result from matrix theory , on , but since , then must be identically the zero matrix. is restricted Therefore the maximum in (26) remains , if , where (97), as shown at the bottom of the page, holds, to but , therefore, by duality is bounded above by , since is subharmonic and satisfies the maximum principle. By continuity as . Hence (98) contradicting hypothesis (A2) (i.e., solutely continuous with respect to ). Hence, is ab- . Let (92) (93) (94) (95) (97) 1620 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 be the Lebesgue decomposition of part. Since we have that , where is its singular By a), there exists such that (28) holds. On the other hand, by uniqueness of the limit we have thus , and then It follows by the F. and M. Riesz Theorem that is absolutely continuous w.r.t. the Lebesgue measure , therefore likewise for , and hence (27) holds. C. Proof of Lemma 4 a) There exists a sequence such that (99) have Equation (99) implies that the measures uniformly bounded total variations, by Alaoglu’s Thehas a cluster orem [23], the sequence say. Hence, point as we get . Putting , . D. Proof of Proposition 1 1) Equality (35) follows from a measure theoretic argument; is obvious since for first note that and ; if then we have (100) , However, therefore , and implying annihilates we deduce . Thus, , by the F. and M. Riesz Theorem (101) and, hence, a) holds. b) There exists a sequence where , , such that (102) and then by the same argument as (a), converges in the topology, we have for as hence, . For the reverse inequality, , have singular values arranged in decreasing let , , ; and order singular value decomposition , , 2, where and are unitary -matrix valued functions, and . Again by the Riesz Theorem, there exist such that Moreover, for each the columns of , , form a complete orthonormal basis of eigenvectors for each of , . The first column of , , , corresponds to the largest singular value of , , . Now let ; then by definition of of positive Lebesgue the supremum there exists a set measure such that , Let be the characteristic function of , and . Then L , and we define have . Therefore we have and . Replacing by their singular value decompositions, yields (103) Since However, with (103) imply that , and together and are unitary, we obtain then DJOUADI: OPERATOR THEORETIC APPROACH TO THE OPTIMAL TWO-DISK PROBLEM Since and correspond to the largest eigenvalues of and , respectively, then Since is arbitrary, then conclude that 2) Clearly by 1) , and we . (104) be a trigonometric polynomial in and . Then Let , where Multiplying by the th power of 1621 coincide, if and only if the positive Riesz projection on is bounded, i.e., if has , , then Fourier coefficient and has Fourier coefficients such , and , . We set that . Then, is a linear operator on with . The negative Riesz projection is values in defined analogously, it associates to its negative Fourier , where has Fourier coefficoefficients, i.e., such that , , and cients , . will be used later to prove a forthcoming theorem. Since the trigonometric polynomials with coare dense in , it suffices to efficients in , , prove that , trigonometric polynomial , ,with . Hence, we have , yields However, since the trigonometric polynomials are dense in then and so is bounded. 2) Follows in a similar fashion. ACKNOWLEDGMENT so (36) must hold and we are done. E. Proof of Proposition 2 1) The dual space of sociated space is equal to sociated norm can be identified with its as(Theorem 4 [40]), where , but under the as- (105) , and . Note that can be regarded [40]. Define the identity as a subspace of , then is conmapping , tinuous. By Theorem 5 [40], i.e., the associated norm and the norm of where This project was initiated while the author was a Ph.D. student under the supervision of the late Prof. G. Zames. The author would like to take this opportunity to extend his sincere appreciation to Prof. Zames for his guidance and kindness by dedicating this paper in his memory. REFERENCES [1] E. Hille and R. S. Phillips, Functional Analysis and Semi-Groups. Providence, RI: AMS, 1957. [2] J. C. Doyle, B. A. Francis, and A. R. Tannenbaum, Feedback Control Theory. New York: MacMillan, 1990. [3] J. G. Owen and G. Zames, “Robust disturbance minimization by duality,” Syst. Control Lett., vol. 19, pp. 255–263, 1992. [4] G. Zames, “Feedback and optimal sensitivity: Model reference transformations, multiplicative seminorms, and approximate inverses,” IEEE Trans. Automat. Contr., vol. AC-26, pp. 301–320, Apr. 1981. [5] J. F. Bird and B. A. Francis, “On the robust disturbance attenuation,” in Proc. IEEE Conf. Decision Control, 1986, pp. 1804–1809. [6] B. A. Francis, “On disturbance attenuation with plant uncertainty,” presented at the Workshop on New Perspectives in Industrial Control System Design, Oxford, U.K., 1986. [7] J. G. Owen and G. Zames, “Duality theory of robust disturbance attenuation,” Automatica, vol. 29, no. 3, pp. 695–705, 1993. 1622 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 49, NO. 10, OCTOBER 2004 [8] M. C. Smith, Well Posedness of H Optimal Control Problems. Philadelphia, PA: SIAM, 1988, vol. 28, pp. 342–358. [9] J. G. Owen, “Performance optimization of highly uncertain systems in H ,” Ph.D. dissertation, Dept. Elect. Eng., McGill Univ., Montreal, QC, Canada, 1993. [10] G. Zames and J. G. Owen, “Duality theory for MIMO robust disturbance rejection,” IEEE Trans. Automat. Contr., vol. 38, pp. 743–752, May 1993. [11] J. Holohan and M. Safonov, “Neoclassical control theory: A functional analysis approach to optimal frequency domain synthesis,” Control Dyna. Syst., vol. 40, pp. 297–329, 1992. [12] V. Besson and A. T. Shenton, “An interactive parameter space method for robust performance in mixed sensitivity problems,” IEEE Trans. Automat. Contr., vol. 44, pp. 1272–1276, June 1999. [13] C. Foias, H. Ozbay, and A. R. Tannenbaum, Robust Control of Infinite Dimensional Systems. New York: Springer-Verlag, 1996. [14] J. W. Helton, “Operator theory, analytic functions, matrices and electrical engineering,” in Proc. CBMS Regional Conf. Series Mathematics, vol. 68, 1987. [15] S. M. Djouadi, “Optimization of highly uncertain feedback systems in ,” Ph.D. dissertation, Dept. Elect. Eng., McGill Univ., Montreal, QC, Canada, 1998. , “Banach space optimization of uncertain systems in H ,” Ph.D. [16] dissertation, Dept. Elect. Eng. McGill Univ., Montreal, QC, Canada, 1997. [17] S. M. Djouadi and G. Zames, “Operator theoretic solution to the optimal robust disturbance attenuation problem,” in Proc. World Multiconf. Systemics, Cybernetics, and Informatics, vol. 3, July 1997, pp. 25–29. , “On optimal robust disturbance minmization,” in Proc. Amer. Con[18] trol Conf., vol. 3, 1998, pp. 1531–1535. [19] S. M. Djouadi, “Exact solution to the nonstandard H problem,” in Proc. IEEE Conf. Decision and Control, vol. 3, 1998, pp. 2843–2848. [20] M. G. Safonov, “Imaginary-axis zeros in multivariable H -optimal control,” in Modeling, Robustness and Sensitivity Reduction in Control Systems, R. F. Curtain, Ed. Berlin, Germany: Springer-Verlag, 1987, vol. F34, NATO ASI Series, pp. 71–78. [21] J. Dieudonnée, “Sur le théorème de Lebesgue Nikodym V,” Canadian J. Math., vol. 3, pp. 129–139, 1951. [22] R. Schatten, Norm Ideals of Completely Continuous Operators. Berlin, Germany: Springer-Verlag, 1960. [23] D. G. Luenberger, Optimization by Vector Space Methods. New York: Wiley, 1968. [24] E. Bishop and R. R. Phelps, “A proof that every banach space is subreflexive,” Bull. Amer. Math. Soc., vol. 67, pp. 97–98, 1961. [25] R. Ryan, “The F. and M. Riesz theorem for vector measures,” Indag. Math., vol. 25, pp. 408–412, 1963. [26] J. B. Garnett, Bounded Analytic Functions. San Diego, CA: Academic, 1981. [27] P. L. Duren, Theory of H Spaces. San Diego, CA: Academic, 1970. [28] V. P. Havin, “Spaces H and L = ,” J. Soviet Math., vol. 39, pp. 120–148, 1974. H H [29] W. W. Rogosinski and H. S. Shapiro, “On certain extremum problems for analytic functions,” Acta Mathematica, vol. 90, pp. 287–318, 1953. [30] A. Defant and K. Floret, Tensor Norms and Operator Ideals. Amsterdam, The Netherlands: North-Holland, 1993. [31] D. Sarason, “Generalized interpolation in H ,” Trans. Amer. Math. Soc., vol. 127, pp. 179–203, 1967. [32] E. A. Jonckheere and M. S. Verma, “A spectral characterization of H -optimal feedback performance and its efficient computation,” Syst. Control Lett., vol. 8, pp. 13–22, 1986. [33] G. Zames and S. K. Mitter, “A note on essential spectra and norms of mixed Hankel-Toeplitz operators,” Syst. Control Lett., vol. 10, pp. 159–165, 1988. [34] N. J. Young, “The Nevanlinna-Pick problem for matrix-valued functions,” J. Oper. Theory, vol. 15, pp. 239–265, 1986. [35] B. A. Francis and J. C. Doyle, “Linear control theory with an H optimality criterion,” SIAM J. Optim. Control, vol. 25, pp. 815–844, 1987. [36] V. V. Peller and N. J. Young, “Superoptimal analytic approximations of matrix functions,” J. Functional Anal., vol. 120, pp. 300–343, 1994. [37] V. V. Peller and S. R. Treil, “Approximation by analytic matrix functions: The four block problem,” J. Functional Anal., vol. 148, pp. 191–228, 1997. [38] H. Helson, Lectures on Invariant Subspaces. New York: Academic, 1964. [39] S. M. Djouadi, “MIMO disturbance and plant uncertainty attenuation by feedback,” IEEE Trans. Automat. Contr., 2004, submitted for publication. [40] S. Levi, “A note on Hardy spaces of vector valued analytic functions,” Bollettino U.M.I., vol. 5, no. 4, pp. 53–62, 1972. Seddik M. Djouadi received the B.Sc. degree (with first class honors) from Ecole Nationale Polytechnique, Algiers, the M.A.Sc. degree from Ecole Polytechnique, Montreal, QC, Canada, and the Ph.D. degree from McGill University, Montreal, QC, Canada, all in electrical engineering, in 1989, 1992, and 1999, respectively. He is currently an Assistant Professor in the Electrical and Computer Engineering Department, University of Tennessee, Knoxville. He was an Assistant Professor with University of Arkansas, Little Rock, and held postdoctoral positions in the Air Force Research Laboratory and the Georgia Institute of Technology, Atlanta, where he was also with American Flywheel Systems (AFS), Inc., Medina, WA. His research interests are power control for wireless networks, robust control, active vision, and identification. Dr. Djouadi received three U.S. Air Force/National Research Council Summer Faculty Fellowship awards, the Tibbet Award with AFS in 1999, and a finalist for the American Control Conference Best Student Paper Certificate (best five in competition) in 1998.