Proceedings of the 42nd IEEE Conferenceon Decision and Control Maui, Hawaii USA, December2003 ThP04-6 Characterization of the Optimal Disturbance Attenuation for Nonlinear Stochastic Uncertain Systems Charalambos D. Charalambous, t Farzad Rezaei and ISeddik M. Djouadi +School of Information Technology and Engineering University of Ottawa Ottawa, Ontario, KIN 6N5, Canada E-mail: {chadcha,frezaei} @ site.uottawa.ca $System Engineering Department, University of Arkansas at Little rock Little Rock, AR 72204 E-mail: msdjoudi@ualr.edu Abstract-This paper is concerned with an abstract formulation of stochastic optimal control systems, in which uncertainty is described by a relative entropy constraint between the nominal measure and the uncertain measure, while the payoff is a functional of the uncertain measure. This is a minimax game in which the controller seeks to minimize the pay-off, while the disturbance described by a set of measures aims at maximizing the pay-off. When stochastic dynamical systems are considered, this problem is known to be equivalent to the optimal disturbanceattenuationproblem arising in Hm control. The objective of this paper is to derive several properties of the optimal solution. One such property is the characterization of the optimal disturbance attenuation, in terms of the nominal measure and an estimate of the uncertain measure. The characterization is important for computing, as well as comparing, the solution of sub-optimal disturbance attenuation problems to the optimal one. Key Words: Nonlinear Uncertain Stochastic Systems, Large Deviations, Relative Entropy, Minimax Games, Duality Properties. I. INTRODUCTION Subsequent to the publication of Zames [ l ] seminal paper on H" control, several tools have been introduced to extend the H" techniques to nonlinear deterministic and stochastic systems. Three pay-off functionals which received significant attention are deterministic minimax games [3], risk-sensitivity pay-offs 141, [51, [61, [71, 181, [91, 1101, [ l l l , [12], [13], [15], andistochastic minimax games [ll], [12]. For nonlinear problems robustification is captured through dissipation inequalities [2]. Another class of problems aiming at robustification are described through relative entropy constraints, using duality relations between relative entropy and free energy. The problems are equivalent to risk-sensitive problems [18], [17], [15], and thus equivalent to the minimax game formulation [15] of the H m disturbance attenuation problem. For linear fully observable or partially observable problems, the characterization of the optimal disturbance attenuation is expressed in terms of certain Ricatti equations. Unfortunately, in nonlinear 0-7803-7924-1/03/$17.00 02003 IEEE problems such Ricatti equations are not available. Therefore, one considers the sub-optimal problem by either fixing the structure of the controller or by considering minimax dynamic games that result in strategies which ensure a certain dissipation inequality with respect to the supply rate (energy functional) [3]. Ideally, one would like to have an explicit expression which describes the optimal disturbance attenuation level for general nonlinear fully observed and partially observed (output feedback) problems. Such an expression will be useful in determining how far any sub-optimal solution is from the optimal one. The main contributions of this paper are the following. First, several properties associated with the optimal solution are derived, Second, the optimal disturbance attenuation level is characterized explicitly through the relative entropy between the worst possible estimate of the uncertain measure and the nominal measure. The results are quite general; they are applicable to both fully observed and partially observed stochastic uncertain dynamical systems. The abstract formulation is the following. Let (E, d) denote a complete separable metric space (Banach Space), and (E, B ( C ) ) the corresponding measurable space in which B ( C ) are identified as the Bore1 sets generated by open sets in E. Let M ( C ) denote the set of probability , the set of admissible controls, measures on ( C , B ( C ) ) Uad and let L" : C + R be a real-valued measurable function, bounded from below, which depends on the control U E U a d . Here, M (E) denotes the set of all possible measures induced by the stochastic systems, while Lu denotes the energy function associated with a given choice of the control law E Uad. Given a control law U E Uad and a nominal measure p" E M(E) induced by the nominal stochastic system, the pay-off is defined by 4260 J ( u ,v") = E,* (e") = J, Pdv" 0.1) Using duality relations, it is shown that "s*" is the optimal disturbance attenuation level associated with H m disturbance {vu E M ( C ) ; H ( v " l p " ) I R, vu << p " } attenuation problems. Thus, computing s* corresponds to where R E (O,oo),1-1" E M ( C ) and H ( v " ( p " ) denotes the computing the optimal disturbance attenuation level. One of relative entropy between the uncertain measure vu and the the main contributions of this paper is the characterization nominal measure p" (see Definition 2.1), and << denotes of the optimal disturbance attenuation s*. In particular, absolute continuity of v" E M ( C ) with respect to p" E the Lagrange multiplier s*, which corresponds to the optimal disturbance attenuation is characterized through the M (E). For a given control law U E Uad, and nominal measure p" E relative entropy between an estimate of the true measure M ( C ) ,let vu?* E M ( C )denote the measure which achieves vug* E M ( C ) and the nominal measure p" E M ( C ) . Thus, since s* corresponds to the optimal disturbance attenuation the supremum in (1.1) defined by level associated with the H" disturbance attenuation J ( u ,v " ~ ' ) (1.2) problem, this characterization provides an explicit formula for computing as well as comparing the solution of the SUP e"dv" { U ~ € M ( C ) ; H ( V ~ ~ ~ " ) ~ R , ~ ~ < <C~ ~ , / L " Esub-optimal M ( C ) } disturbance attenuation problem to the optimal one. The robust control problem is to find a control law U* E i?&d In addition, several monotonicity properties of the dual which impacts functional are derived, including a lower and upper bound J ( u * )= inf J ( u , vu>*) on the dual functional (ps'(u,R). UEUad subject to fidelity J = inf "EUod ( 11. DUALITY RELATIONS AND TILTED MEASURES In this section the basic definitions and duality relations between relative entropy, free energy, and cumulant moment generating functions are introduced. These quantities are employed in subsequent sections to derive the results discussed J ewvu> {~"EM(C);H(V~I~~)~R,V~<<~".~~€M(C)} SUP C (1.3) k" M(C)} The constraint E M(C);H(v"lp") 5 R, v" << p", p" E describes the set of all admissible in the introduction. measures induced by the stochastic system, relative to the nominal measure. The pay-off EuU P represents average Definition 2.1: For a given U E Uad, let U", p" E M ( C ) energy with respect to the unknown measure v" E M ( C ) , and C" : C -+ R a measurable function. 1) The moment generating function of e" with respect to p" such as integral quadratic constraints, tracking errors, etc.. The parameter R describes the admissible level of uncertainty is defined by associated with a family of stochastic systems with respect to the nominal one. Since H(v"1p") = 0 if and only if U" = p", the parameter R is always in the interval ( O , o o ) , where s E R. otherwise the trivial solution vu** = p" will maximize 2) The cumulant generating function of e" with respect to (1.2). Clearly, larger values of R imply larger uncertainties p" is defined by measured by the relative entropy. Such a problem formun lation is also considered in [18] for the case of discreteQJpu(s) = logMpy(s)=log ese"dp" E (--oo,oo] (II.6) time stochastic systems and in [17] for discrete-time linear systems. However, the aim of this paper is fundamentally where s E R. different from the results of [18], [17], in both the general 3) The free energy of e" with respect to p" is defined by conclusions as well as the application considered. E,U(C") 5 Sp(1) E (-Oo,oo]; Specifically, the current paper is concerned with computing 4) The relative entropy of vu with respect to p" is defined the optimal strategies and pay-off by using duality relations by between the primal problem 0.1)- (1.3) and its dual unconstrained problem. In the dual space, the problem is described H(u"1p") 5 (D.7) by the infimum over s 2 0 of the Lagrangian functional, namely, U L A pS*( U , R)= inf s1°{ v " E M ( C ) ; SUP vu<<pU, p " E M ( C ) } { P d v " - s ( H ( v " J p " )- R ) } (1.4) otherwise It can be shown that EpU(CU) as a function of C" is convex, H(v"Ip") as a function of p", U" E M ( C )is convex in both 4261 arguments, H(v"1p") 2 0, and H(v"1p") = 0, if and only if ,U" = vu, and Alpu(s),Q p u ( s ) are convex functions of s E 8. In addition, the following result, is a generalization of a theorem found in [ 151. Theorem 2.2: For every measurable function C" : C + 8 bounded below and p" E M ( C ) and s > 0 ( log L e S d p " ) ' = Next, for every s E X,define the Lagrangian associated with the problem of Definition 3.1 A J S . R ( ~ , v u=) E,=(C")- s(H(v"1p") - R) and its associated dual functional J37R(U, VU>*) <</A",puEM (E)} JS(u, V U ) (111.11 ) SUP { v u EM (E); v u < <pU, SUP {vu EM (E);v' (111.10) pu EM (E)} In addition, define the quantity Pdv" - sH(v"Ip")} (11.8) (III.12) Moreover, if Cue$ E L 1 ( p u ) ,then the supremum is attained at Proof. The derivation is &har which treats the case s = 1. io the one found in [15], The above duality relation is important in solving the dual problem associated with the primal problems stated in the introduction. 111. PROPERTIES OF DUALFUNCTIONAL This section is concerned with reformulating the constrained optimization described in the introduction, as an unconstrained optimization using the theory of Lagrange functionah. Thus, the dual functional is introduced and for a fixed U E Uad several properties of the pay-off are presented. These include the range of values of the Lagrange multiplier associated with the dual functional, for which the constrained and unconstrained problems are equivalent. In addition, the value of the Lagrange multiplier for which the dual functional is minimized is explicitly characterized. Because the Lagrange multiplier corresponds to the attenuation level associated with disturbance attenuation problems, this characterization is important in computing the optimal solution of disturbance attenuation problems. The primal problem stated in the introduction is now defined precisely as follows. Dejinition 3.1: Let U E e" measurable and bounded below, C" E L l ( p " ) , p" E M ( C ) which is a fixed nominal measure, and R E (0,CO). Find vu>* E M ( C ) which achieves the supremum in (I.]), that is, J ( u , VU(*) - SUP 1 C"dv" which may or may not exist. In general J s ~ R ( ~ , v Uis* * ) not defined throughout the entire real line with respect to s E 3,and there might be regions in [ O , o o ) for which P R ( u ,vu>*)= CO.Next, several properties of the dual functional are derived, and the regions for which J37R(u,vu,*) is finite, are identified. These results are employed in Theorem 3.3, to show equivalence between the unconstrained functional (In. 12) (dual functional) and the constrained problem of Definition 3.1 (primal problem). In addition, the value of s* that achieves the infimum of the dual functional is characterized, and it is shown that s* occurs on the boundary of the constraint. Lemma 3.2: For a given U E Uad. 8" a measurable function bounded from below, and for some 0 < s 5 s* the following statements hold. 1) The dual functional J S y R (vU3*) ~ , is related to the cumulant generating function of C" with respect to p" E M ( C ) via J 3 3 ( U , VU?*)= s 1 - 1 s c SUP { v ' € M ( C ) ; uu<<pu, pUEM(C)} + P d v " - H ( v U I p " ) } sR = s l o g L e CL" Tdp" + s R = s Q p u ( -S1) + sR { (III.13) (III.14) Moreover, if cue$ E L 1 ( p U ) the supremum in (III.13) is attained at vu?* E M ( C ) given by (111.15) 2) The dual functional P R ( u ,vu") is convex in s > 0. A 3) The function rpU(s) = s Q p u ( $ ) is a non-increasing function of s E (0, CO), that is, { ~ u E M ( C ) ; H ( ~ U I ~ u ) I R , u Y < < p U , p L " ECM ( C ) } "9) where R E (0, m). 4262 where 0 < s2 I SI. 4) The infimum of the dual functional J s % R ( u , ~ " 9over *) s > 0 defined by and if E L l ( p u " )then A @i,(R)= cp"*(u,R)= inf s>o is a concave functional of R 2 0. Moreover, if there exists q > 0 such that Pe@' E L1(p"), then a) q,(o) = {e.} = Jc e v p u (III.18) where 0 b) lim s l o g k e q d p " = k P ' d p u = (ai,(O) S'CC {e.} 5 ( R )= cps*(u,R ) 5 R + E,%{e.} (111.20) 7) If e$ E L1(pcL"), E Ll(pu")and ( l u ) 2 e $ E L1(p") then the infimum of the functional JStR(u,vu?*)over s > 0 is uniquely attained at H ( ~ " ' * l p . " ) l ~= = ~R* (nI.21) Remark 3.4: According to Theorem 3.3, at s = s* the maximizing measure vu?* E M ( C ) occur on the boundary of the relative entropy constraints, and satisfies the monotonicity property (III.25). If the above abstract formulation is applied to stochastic dynamical systems, then the value of the Lagrange multiplier s* corresponds to the optimal disturbance attenuation parameter associated with stochastic optimal H" disturbance attenuation problems. Using the results of the previous Lemma 3.2, the equivalence between the unconstrained and constrained problems is established in the next theorem. Theorem 3.3: For a given U E Uad,C" a measurable function bounded from below, e? E L1(pcL") and for some 0 < s 5 s* the following statements hold. The primal constrained problem of Definition 3.1 and the unconstrained dual problem are equivalent, that is, inf s>o { J S ~ R vu,*)} ( ~ , = ps*(u, R) (III.22) J,eudv") { = (pa* (21,R) = E,= eu}ls=s* Proof. The derivation is found &-[20]. where vu?*is given by (III.15). That is, assuming the constrained and unconstrained problems are equal (which is shown next), a necessary condition for the infimum of the dual functional J s ~ R ( u , ~ "over ~*) s > 0 is that s* occurs on the boundary of the relative entropy constraint. Proof. The derivation is found in [20]. J S * ~ R vu?*) (~, = < s* I:s1 5 s2 and the dual functional is given by JS*'R(~, (III.19) 6) If there exists r] > 0 such that t"e@y E L 1 ( p u ) ,then the ) s > 0 is infimum of the dual functional J s , R ( ~ , v " i *over bounded above and from below as follows. E,, Moreover, if in addition Cue!eu E L1(pu),(eu)2ete" E L l ( p " ) then the infimum over s > 0 in (III.22) is uniquely attained at s* > 0 given by (111.23) 4263 IV. REFERENCES G. Zames, "Feedback and optimal sensitivity: Model reference transformations, multiplicative seminoms, and approximate inverses," IEEE Transactions on Automatic Control, vol. 26, no. 2, pp. 301-320, 1981. J. Willems, "Dissipative dynamical systems part I: General theory," Arch. Rational Mech. Anal., vol. 45, pp. 321-351, 1972. A. Isidori and A. Astolfi, "Disturbance attenuation and X m-control via measurement feedback in nonlinear systems," IEEE Transactions on Automatic Control, vol. 37, no. 9, pp. 1283-1293, 1992. D. Jacobson, "Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic differential games," IEEE Tramactions on Automatic Control, vol. 18, no. 2, pp. 124-131, 1973. J. Speyer, "An adaptive terminal guidance scheme based on an exponential cost criterion with applications to homing missile guidance," IEEE Transactions on automatic control, vol. 21, pp. 371-375, 1976. P. Whittle, "Risk-sensitive Linear/Quadratic/Gaussian control," Advances in Applied Probability, vol. 13, pp. 764-777, 1981. [7] A. Bensoussan and J. H. van Schuppen, “Optimal control of partially observable stochastic systems with an exponential-of-integral performance index,” SIAM Joumal on Control and Optimization, vol. 23, no. 4, pp. 599-613, 1985. [8] K. Glover and J. Doyle, “State-space formulae for all stabilizing controllers that satisfy an H,-norm bound and relations to risk-sensitivity,” Systems and Control Letters, vol. 11, pp. 167-172, 1988. [9] P. Whittle, “A risk-sensitive maximum principle,” Systems and Control Letters, vol. 15, pp. 183-192, 1990. [lo] P. Whittle, “A risk-sensitive maximum principle: The case of imperfect state observations,” IEEE Transactions on Automatic Control, vol. 36, no. 7, pp. 793-801, 1991. [l 11 W. H. Fleming and W. M. McEneaney, “Risk-sensitive control and differential games, pp. 185- 197,” in Stochastic Theory and Adaptive Control, eds., T.E. Duncan and B. Pasik-Duncan, (New York), Springer-Verlag 1992. [12] M. James, J. Baras, and R. Elliott, “Risk-sensitive control and dynamic games for partially observed discretetime nonlinear systems,” IEEE Transaction on Auromatic Control, vol. 39, no. 4, pp. 780-792, 1994. [13] C. Charalambous and J. Hibey, “Minimum principle for partially observable nonlinear risk-sensitive control problems using measure-valued decompositions,” Stochastics and Stochastics Reports, vol. 57, 1996. [14] C. Charalambous, “The role of informations state and adjoint in relating nonlinear output feedback risksensitive control and dynamic games,” IEEE Transactions on Automatic Control, vol. 42, no. 8, pp. 11631170, 1997. [15] P. D. Pra, L. Meneghini, and W. Runggaldier, “Some connections between stochastic control and dynamic games.”Mathematics of Control, Signals, and Systems , vol. 9, pp. 303-326, 1996. [16] R. Elliott, L. Aggoun, and J. Moore, Hidden Markov Models Estimation and Control. New York: SpringerVerlag, 1995. [17] V. A. Ugninovskii and I. Petersen, “Minimax LGG control for stochastic partially observed uncertain systems,” SIAM Joumal on Control and Optimization, vol. 40, no. 4, pp. 1189-1226, 2001. [18] I. Petersen and M. R. James and P. Dupuis “Minimax optimal control of stochastic uncertain systems with relative entropy constrains,” IEEE Transactions on Automatic Control, vol. 45, pp. 398412, 2000. [I91 W. H. Fleming and E. Pardoux, “Optimal Control of Partially Observed Diffusions,” SIAM Journal on Control and Optimization, vol. 20, no. 2, pp. 261-285, 1982. [20] C. D. Charalambous and F. Rezaei and M. Djouadi, “Optimal Disturbance Attenuation for Nonlinear 4264 Stochastic Uncertain Systems: A Measure-Theoretic Minimax Approach,” Preprint, pages-45, 2003.