Optimal Control for a Scalar One-Step Linear System with Additive

2010 American Control Conference Marriott Waterfront, Baltimore, MD, USA June 30-July 02, 2010 WeB10.1 Optimal Control for a Scalar One-Step Linear System with Additive Cauchy Noise Moshe Idan, Amir A. Emadzadeh and Jason L. Speyer Abstract— An optimal control scheme is developed for scalar discrete linear dynamic systems driven by Cauchy distributed process and measurement noises. Since the Cauchy density has infinite variance, a cost function is defined for which the unconditional expectation with respect to the Cauchy densities produces a cost criterion that exists. After showing that this cost criterion allows a dynamic programming solution for the multistage problem, an optimal controller is determined for one step time update. Characteristics of the optimal controller is compared with the linear exponential Gaussian (LEG) controller. The dramatic performance difference between the Cauchy and the LEG controllers is studied. Furthermore, through different numerical examples, some interesting properties of the Cauchy controller are examined. I. I NTRODUCTION There are many engineering applications where one encounters random processes or noises which cannot be described by Gaussian probability distribution. Atmospheric and underwater acoustic noises are examples and are the governing noises in radar and sonar applications, which have a very impulsive character [1]. A class of probabilistic models used to represent impulsive noises is called stable non-Gaussian or symmetric alpha-stable (Sα-S) distributions (see [2] for a comprehensive treatment of Sα-S densities). In this class, the Gaussian distribution corresponds to α = 2, whereas α = 1 leads to the Cauchy probability density function (pdf). It has been shown that a Cauchy pdf better characterizes such impulsive types of sensor noise compared to the Gaussian [3], [4]. For example, in detecting a radar signal in clutter, it was shown that the in-phase component of radar clutter time series agrees extremely well with a SαS pdf with α = 1.7 [3]. In that study both the maximum likelihood Gaussian (MLG) and Cauchy (MLC) detectors were developed. The result is that for all α ∈ [1, 2] the MLC is very close to the Cramer-Rao bound, whereas the MLG deviates significantly from the Cramér-Rao bound as α goes from 2 to 1. The above observations about the nature of the impulsive noises and the robustness characteristics of the Cauchy This work was partially supported by Air Force Office of Scientific Research, Award No. FA9550-09-1-0374, and by the United States - Israel Binational Science Foundation, Grant 2008040. Moshe Idan is an Associate Professor at the Faculty of Aerospace Engineering, Technion, Haifa, 32000, Israel. moshe.idan@technion.ac.il Amir Emadzadeh is a Post Doctoral Scholar at the the Mechanical & Aerospace Engineering Department, University of California, Los Angeles, CA 90024, USA. amire@ee.ucla.edu Jason Speyer is a Professor at the Mechanical & Aerospace Engineering Department, University of California, Los Angeles, CA 90024, USA. speyer@seas.ucla.edu 978-1-4244-7425-7/10/$26.00 ©2010 AACC detectors motivated the development of sequential estimators for linear systems with additive Cauchy noises [5], [6]. Such heavy tailed pdfs do not posses well defined and/or finite moments of any order [7], which make the associated estimation problem significantly more challenging compared to the Gaussian case. Utilizing the characteristics and properties of the Cauchy estimators in [5], [6], a new optimal control paradigm for dynamic systems driven by Cauchy distributed noises is developed in this paper. The main challenge is the fact that in such systems the unconditional, time propagated pdf of the system state is not defined. This requires posing a different performance criterion than the ones normally used in, e.g., linear-quadratic-Gaussian (LQG) or robust control settings. The rest of the paper is organized as follows. The optimal control problem is formulated in Section II. A dynamic programming structure is developed in Section III. The solution to the problem for one step time update is presented in Section IV. The Gaussian equivalent optimal control problem is presented, solved, and discussed in Section V. Numerical examples are given in Section VI. Concluding remarks and suggestions for future work are presented in Section VII. II. P ROBLEM S TATEMENT Consider the following linear, discrete-time, scalar stochastic system, xk = Φxk−1 + uk−1 + wk−1 (1a) zk = xk + vk (1b) where xk is the state, uk is the control signal, zk is the measurement, and k is the time index. The signals wk and vk are process and measurement noise sequences, respectively, that are assumed to be independent of each other and Cauchy distributed with pdfs β/π , β>0 + β2 γ/π fVk (vk ) = 2 , γ > 0. vk + γ 2 fWk (wk ) = wk2 (2a) (2b) In addition, the initial state is assumed to be also Cauchy distributed α/π , α > 0. (3) fX0 (x0 ) = (x0 − x̄0 )2 + α2 In posing an optimal control problem for the model in (1), the commonly used cost criteria, such as quadratic, exponential, and more, cannot be used since the expectations requited to evaluate those criteria are infinite when the system noise inputs have Cauchy pdfs. Rather, in the context of the 1117 system with the heavy-tailed Cauchy noises it turns out that a new criterion that resembles in its form the Cauchy pdf’s is required to allow for an analytical derivation of a controller. Note that the stochastic system (1) can be decomposed into deterministic and stochastic parts by exploiting the linearity of the system. Let x̄k and z̄k be deterministic, and x̃k and z̃k be stochastic variables such that xk = x̄k + x̃k zk = z̄k + z̃k . Then, the cost function is a product of the membership or penalty functions of (8), defined by ψ(XN , UN −1 ) = = (4a) (4b) N −1 Mx (xk+1 )Mu (uk ) k=0 N −1 k=0 1 1 · 2 . 2 x2k+1 + ηk+1 uk + ζk2 (10) with initial condition x̄0 , and the stochastic part is given by The optimal control problem is to maximize the unconditional expectation E [ψ(XN , UN −1 )] with respect to the control history γ(0, N − 1), where E[·] is the unconditional expectation. An important characteristic of this cost criterion is that for the Cauchy densities the unconditional expectation exists and therefore lends itself to a dynamic programming solution, as demonstrated in the next section. x̃k = Φx̃k−1 + wk z̃k = x̃k + vk . III. A DYNAMIC P ROGRAMMING S OLUTION Then, the deterministic part is described by x̄k = Φx̄k−1 + uk−1 (5a) z̄k = x̄k (5b) (6a) (6b) The stochastic output is determined as the difference between the actual measurements zk and the output of the deterministic model, as z̃k = zk − z̄k . The process and measurements noise pdfs were defined in (2a) and (2b), while the initial condition of this stochastic model is Cauchy distributed with fX̃0 (x̃0 ) = α/π , α > 0. x̃20 + α2 (7) The above decomposition will be used to derive the onestep Cauchy controller. A general cost criterion is defined which is compatible with the Cauchy pdfs and is reminiscent of the choice of the cost criterion for the linear-exponential-Gaussian (LEG) [8]. There, a multiplicative cost composed of exponentials of quadratic functions of the state and control, which resemble Gaussians, are used to penalize these variables. An original motivation was to think of these exponential functions as membership function, as proposed in fuzzy set theory [9]. Here, these membership functions are chosen as a rational polynomial. In particular, they take the form resembling a Cauchy pdf, where the state and control are penalized using the membership functions Mx (xk ) = x2k 1 , + ηk2 Mu (uk ) = u2k 1 . + ζk2 Define the information sequence as Ik+1 := {Ik , zk+1 , uk }, k = 0, . . . , N − 1, I0 = {z0 }. (11) Since Ik+1 uses the most up-to-date measurements, it is a current information pattern. Dynamic programming is applied by starting with the cost function J∗ = max γ(0,N −1)∈UN −1 = ∞ where Uk is the class of piecewise continuous functions of Zk . (12) −∞ E [ψ(XN , γ(0, N − 1))] max E E ψ(XN , UN −1 ) IN γ(0,N −1)∈UN −1 ∞ max = u0 −∞ ∞ max u1 −∞ u2 −∞ ∞ ∞ ∞ max ··· max −∞ uN −1 −∞ ψ(XN , γ(0, N − 1))f (XN |IN ) dXN f (zN |IN −1 )dzN · · · f (z3 |I2 )dz3 f (z2 |I1 )dz2 f (z1 |I0 )dz1 f (z0 )dz0 (8) (9a) (9b) (9c) E [ψ(XN , γ(0, N − 1))] . Expanding the expectation on the right-hand side results in Let the state, measurement, and control histories be defined as Xk := {x0 , . . . , xk }, Zk := {z0 , . . . , zk }, γ(l, k) := {ul , . . . , uk }, γ(l, k) ∈ Uk max γ(0,N −1)∈UN −1 (13) where f (· |· ) is the conditional density. The Fundamental Lemma [8] was used to interchange the maximization and expectation operations. Since the strategy is to apply dynamic programming to this problem, the optimization of (13) is manipulated into the form of a recursion rule for the propagation of an optimal 1118 value function. To get this function, first note that E ψ(XN , γ(0, N − 1)) Ik+1 = max γ(k+1,N −1)∈UN −1 ∞ max uk+1 ∞ ∞ max uk+2 −∞ Cauchy control problem is stated as 1 z̃0 JC∗ = max E E u0 x21 + η 2 C C (19) = max E Ψ = E max Ψ · · · max u0 uN −1 −∞ ψ(XN , UN −1 )f (XN |IN ) dXN f (zN |IN −1 )dzN −∞ · · · f (zk+3 |Ik+2 )dzk+2 f (zk+2 |Ik+1 )dzk+2 (14) and f (Zk+1 |γ(0, k − 1) ) dZk+1 = f (zk+1 |Ik )dzk+1 · f (zk |Ik−1 )dzk · · · f (z2 |I1 )dz2 · f (z1 |I0 )dz1 f (z0 )dz0 (15) where the conditioning on the control sequence given the measurement sequence is not a random variable, and appears only for notational convenience. Using these two facts, the cost function can be more compactly rewritten as J∗ = ∞ max −∞ u0 ∞ ∞ ∞ ∞ · · · max max max −∞ u1 −∞ subject to (1), while, for simplicity, setting Φ = 1. JC∗ explicitly denotes the optimal Cauchy cost criterion, and 1 C z̃0 Ψ =E x2 + η 2 1 1 z̃0 =E (20) (x̃1 + x̄1 )2 + η 2 where x̄1 = x̄0 + u0 . The optimal control problem is to find the control signal at k = 0, u∗0 , such that it maximizes the cost function (19) given the measurement at time k = 0, i.e. u∗0 = arg max ΨC . To maximize the cost function (20), the conditional pdf fX̃1 |Z̃0 (x̃1 |z̃0 ) is needed. It is computed following the steps presented in [5], [6]. Using the Chapman-Kolmogorov equation, it can be computed as ∞ fX̃1 |Z̃0 (x̃1 |z̃0 ) = max γ(k+1,N −1) (22) The conditional density function fX̃0 |Z̃0 (x̃0 |z̃0 ) is constructed as fX̃0 |Z̃0 (x̃0 |z̃0 ) = E ψ(XN , γ(0, N − 1)) Ik+1 f ( Zk+1 |γ(0, N − 1)) (17) Jk∗ (Ik ) = max uk ∗ Jk+1 (Ik+1 ) dzk+1 . Although a dynamic programming recursion exist, the solution maybe difficult to obtain analytically. Hence, initially, to understand the properties of the optimal control problem, the simplest Cauchy control problem is first considered in the next section. In particular, the maximization indicated in (18) is performed only for k = 0. IV. O NE -S TEP C AUCHY O PTIMAL C ONTROLLER fZ̃0 (z̃0 ) . (23) fX̃0 ,Z̃0 (x̃0 , z̃0 ) = fZ̃0 |X̃0 (z̃0 |x̃0 ) fX̃0 (x̃0 ) = fV0 (z̃0 − x̃0 )fX̃0 (x0 ) γ/π α/π = · . (z̃0 − x̃0 )2 + γ 2 x̃20 + α2 (18) −∞ fX̃0 ,Z̃0 (x̃0 , z̃0 ) The joint density function fX̃0 ,Z̃0 (x̃0 , z̃0 ) is computed using the measurement equation (6b) and the noise pdf (2b), and is given by the sought after dynamic programming recursion rule becomes ∞ fX̃1 |X̃0 (x̃1 |x̃0 ) fX̃0 |Z̃0 (x̃0 |z̃0 ) dx̃0 . −∞ If the optimal value function is defined as ∗ Jk+1 (Ik+1 ) (21) u0 uk −∞ E ψ(XN , γ(0, N − 1)) Ik+1 max γ(k+1,N −1)∈UN −1 . (16) f (Zk+1 |Uk ) dZk+1 · · · −∞ u2 u0 (24) Since z̃0 is a sum of two Cauchy random variables, it is a Cauchy random variable with fZ̃0 (z̃0 ) = (α + γ)/π . z̃02 + (α + γ)2 (25) From (23), (24) and (25), and using a standard partial fraction expansion, the conditional pdf fX̃0 |Z̃0 (x̃0 |z̃0 ) is given by Initially, the cost function of (12) will include the state xk only with N = 1 and hence k = 0. Thus, the optimal 1119 fX̃0 |Z̃0 (x̃0 |z̃0 ) = a1 x̃0 + b1 a2 x̃0 + b2 + 2 2 2 x̃0 + α z̃0 − x̃0 + γ 2 (26) where a1 = 2z̃0 C = −a2 (27a) b1 = [z̃02 + γ 2 − α2 ]C (27b) b2 = [3z̃0 − γ 2 + α2 ]C 1 αγ/π · . C= α + γ z̃02 + (α − γ)2 Next, from (6a) and fX̃1 |X̃0 (x̃1 |x̃0 ) is given by (2a), fX̃1 |X̃0 (x̃1 |x̃0 ) = fW0 (x̃1 − x̃0 ) = the conditional (27c) (27d) pdf β/π . (28) (x̃1 − x̃0 )2 + β 2 Substituting (26) and (28) into (22) and solving the integral, the conditional pdf fX̃1 |Z̃0 (x̃1 |z̃0 ) is determined as fX̃1 |Z̃0 (x̃1 |z̃0 ) = where c1 = Note that ζ here is the weighting parameter. If it is small, it puts more constraints on u0 . Hence, the control signal is expected to become more suppressed compared to the unweighted case. As ζ becomes larger, the optimal control is expected to converge to the un-weighted controller since the control signal is less penalized. V. G AUSSIAN O PTIMAL C ONTROLLER To compare the performance of the Cauchy controller with, for example, one that is designed assuming Gaussian noises, one has to consider: (a) the parameters of the Gaussian pdfs that best approximate the the Cauchy pdfs, and (b) the cost criterion used to design the Gaussian controller to be comparable to the one used in the Cauchy noise setting. Those two items are addressed first in this section, following the solution to the Gaussian controller problem. a1 x̃1 + c1 a2 x̃1 + c2 + 2 A. Normal pdf Least Squares Fit of a Cauchy pdf + (α + β) [x̃1 − z̃0 ]2 + (γ + β)2 (29) To construct a normal or Gaussian pdf that best fits a given Cauchy pdf, the following optimization problem is solved x̃21 α+β b1 , α c2 = β γ+β z̃0 a2 + b2 . γ γ Using the conditional pdf (29), and performing the integration associated with the conditional expectation in (20), the Cauchy control cost function is evaluated analytically as η+α+β η(α+β) c1 a1 α (u0 ∞ 2 C N fX (x) − fX (x) dx, σ = arg min (30) + x̄0 ) + 1 C Ψ = 2 π u0 + x̄0 + (η + α + β)2 η+γ+β + u0 + x̄0 ) + η(γ+β) c2 + . (31) 2 z̃0 + u0 + x̄0 + (η + γ + β)2 ∗ σ d C Ψ = 0. (32) du0 Since the expression in (31) is fairly complex, in the sequel we will analyze several numerical cases, in an attempt to deduct some general results for the minimization problem at hand. As in standard LQG or LEG problems, one may be interested in weighting the control signal in the cost function, as was presented in (10). With N = 1, the optimal cost function for this case is 1 1 z̃0 , (33) · ΨCw = E x21 + η 2 u20 + ζ 2 where the notation ΨCw is introduced to distinguish the case with control signal weighting from ΨC that weights the state only. Therefore, from (20) ΨCw = u20 1 ΨC . + ζ2 (34) The optimal control signal may be found by solving d C Ψ = 0. du0 w (35) −∞ where the Cauchy pdf is δ/π , δ>0 x2 + δ 2 and the normal pdf is given by C (x) = fX 2 a2 γ (z̃0 To find the optimal controller defined in (21), the following equation must be solved (36) N fX (x) = (37) 2 e−x /(2σ ) √ , σ > 0. 2πσ (38) Solving (36) analytically leads to a complex nonlinear equation relating σ ∗ to δ. Solving the latter numerically yields σ ∗ = k0 δ, k0 ≈ 1.4. (39) Hence, the equivalent corresponding pdfs for the Gaussian optimal control problem are chosen as 2 e−(x0 −x̄0 ) /(2M0 ) √ , M0 = k02 α2 2πM0 2 e−wk /(2W ) fWk (wk ) = √ , W = k02 β 2 2πW 2 e−vk /(2V ) √ fVk (vk ) = , V = k02 γ 2 . 2πV fX0 (x0 ) = (40a) (40b) (40c) B. Linear Exponential Gaussian (LEG) Controller To best approximate the Cauchy cost criterion of (33), in the Gaussian case an exponential cost criterion is chosen, given by −(qx21 +ru20 ) (41) ΨN z̃0 . w =E e Similarly to fitting the pdfs discussed above, the parameters in the Gaussian cost can be chosen to best fit the Cauchy cost. That is, 1 1 (42) q = 2 2, r = 2 2. 2k0 η 2k0 ζ 1120 The LEG problem for the model in (1) and noise characteristics in (40) were addressed in [8]. The optimal controller is given by (43) u∗0 = −Λ0 [1 + 2P0 S0 ]−1 x̂0 where q q + r + 2qrW k 2 α2 γ 2 P0 = 02 α + γ2 qr . S0 = q + r + 2qrW Λ0 = Thus the optimal controller can be expressed as q u∗0 = − x̂0 . q + r + 2qr(P0 + W ) (44) K = P0 V α2 = 2 . α + γ2 Equation (55) is a 3rd order polynomial, whose solution is fairly complex. Adding measurement noise, i.e γ = 0, increases the order of the polynomial, and makes it even harder to solve, thus demonstrating the difficulty in solving the Cauchy optimal control problem. VI. N UMERICAL E XAMPLES (46) (47) (48) where −1 2u∗0 3 + 3z̄0 u∗0 2 + [ζ 2 + (η + β)2 + z̄02 ]u∗0 + ζ 2 z̄0 = 0. (55) (45) Furthermore, x̂0 can be computed as x̂0 = x̄0 + K z̃0 and, from (35), the optimal control satisfies (49) Substituting the parameters q, r, W , and P0 into (47) shows that the slope of u∗ with respect to z0 is independent of k0 . In other words the corresponding parameters of the Gaussian cost function may be chosen as q = 1/η 2 and r = 1/ζ 2 . This LEG controller is called risk adverse when r and q are positive. If r and q are negative, then the controller is an H∞ controller [8]. However, this type of cost is not allowed for the Cauchy densities, because the unconditional cost criterion will not exist. If the control signal is not included in the cost function (41), i.e. r = 0, the LEG cost function becomes, 2 (50) ΨN = E e−qx1 z̃0 . Through some numerical examples, the characteristics of the Cauchy optimal controller, obtained from either (32) or (35), is studied for both un-weighted and weighted control scenarios. For each case, the results are compared against the equivalent LEG controller, obtained from (51) or (47), respectively. A. Nominal Example First the un-weighted control scenario is considered. The parameters for Cauchy signals are chosen as α = 3, β = 3, η = 1, γ = 5, x̄0 = 0. Substituting these parameters into (31), the cost function becomes 3(29z 2 + 20uz − 288) 1 C Ψ = π 16(z 2 + 4) ((u + z)2 + 81) 10(7z 2 − 6uz + 112) + 16(z 2 + 4)(u2 + 49) α = 3, β = 3, γ = 5, η = 1, x̄0 = 0 0.06 0.05 0.04 ΨC (51) C. Discussion (57) where 0 and ∼ in z̃0 and u0 are dropped for brevity. The cost function is plotted in Fig. 1. In this case, from (44) and (46), Λ0 = 1 and S0 = 0. Therefore, using (43), the optimal controller is independent of q and is given by, u∗0 = −x̂0 . (56) 0.03 0.02 Consider the case where there is no measurement noise, i.e. γ = 0. Hence, from (31), the Cauchy cost function becomes η+β 1 C η Ψ = . (52) π (z̄0 + u0 )2 + (η + β)2 0.01 0 50 50 0 0 Using (32), the optimal control equals u∗0 = −z̄0 = −x̄0 z0 (53) which is the same as Gaussian optimal controller obtained from (48). Furthermore, the weighted Cauchy cost function becomes η+β 1 1 C η Ψw = 2 · π u0 + ζ 2 (z̄0 + u0 )2 + (η + β)2 Fig. 1. −50 −50 u0 Cost function for Example VI-A. The optimal controller can be obtained by minimizing (57) with respect to u. This required the solution of (54) 1121 l 5 u 5 + l 4 u 4 + l3 u 3 + l 2 u 2 + l 1 u + l 0 = 0 (58) α = 3, β = 3, γ = 5, η = 1, x̄0 = 0 where 10 l4 = 92z 2 (59a) 8 (59b) 6 l3 = 6(2016 + 25z ) (59c) l2 = z(19176 + 125z 2 ) (59d) l1 = 7(94176 + 1290z 2 + 5z 4 ) (59e) l0 = 147z(1476 + 5z 2 ). (59f) Cauchy Gaussian 4 2 u* l5 = 32 0 −2 −4 −6 −8 −10 −30 −20 −10 0 10 20 30 z0 Fig. 2. Optimal controllers for Example VI-A. α = 3, β = 3, γ = 5, η = 1, x̄0 = 0 0.08 Cauchy Gaussian 0.07 0.06 * 0.05 Ψ Equation (58) is solved numerically, and the optimal control signal is plotted in Fig. 2 along with the LEG controller, obtained from (51). The cost function (20) evaluated with both the Cauchy optimal control and the LEG controller is plotted in Fig. 3. Fig. 2 shows that the Cauchy controller is symmetric and linear about z̃0 = x̄0 = 0. Interestingly, its slope is nearly identical to that of the LEG controller. Furthermore, the figure shows that the Cauchy controller approaches zero when |z̃0 | becomes large, while the Gaussian controller remains linear with respect to the measurement z̃0 . This is a significant difference between the Cauchy and Gaussian optimal controllers, which can be deducted analytically from (58). Since u∗ is finite, the dominant term in (58) as |z̃0 | → ∞ is l1 u∗∞ , or lim|z̃0 |→∞ u∗∞ → 0. A similar conclusion can be deducted by examining the variance of the estimation error at k = 0, which is given by [5] z̃02 2 ˆ +1 . (60) E (x̃0 − x̃0 ) z̃0 = αγ (α + γ)2 0.04 0.03 Equation (60) shows that the variance of estimation error grows unboundedly when z̃0 increases. Hence, the optimal Cauchy control strategy correctly reduces the controller gain when the estimation error variance is increasing dramatically. This difference in the control strategy between the Cauchy and LEG controller results can be seen from better expected performance of the former, as presented in Fig. 3. There, one can clearly observe that the cost function ΨC > ΨN for large |z̃0 |. 0.02 0.01 0 −30 −20 −10 0 10 20 30 z0 Fig. 3. Optimal cost function for Example VI-A. B. Changing x̄0 In this example the offset parameter x̄0 in (56) is changed to x̄0 = 5. For this case, the coefficients of (58) are computed as (Note: z = z0 = z̃0 + x̄0 ) l5 = 37z 2 − 505z + 1856 3 2 4 3 (61a) l4 = 157z − 1565z + 1236z + 19720 (61b) 2 l3 = 300z − 2860z + 27196z − 239720z + 849468 (61c) l2 = 250z 5 − 1750z 4 + 32756z 3 − 187380z 2 − 592542z + 5636730 (61d) l1 = 70z 6 + 400z 5 + 4590z 4 + 78760z 3 − 78498z 2 − 9815700z + 49314906 (61e) l0 = 350z 6 − 2780z 5 + 67450z 4 − 138228z 3 + 998670z 2 − 28884924z + 137064150. (61f) The Cauchy optimal controller is plotted along with the LEG control signal in Fig. 4. The plots show that the Cauchy controller is almost linear and symmetric about x̄0 = 5. Furthermore, the slope has not changed compared to the Example VI-A. This can be expected from (31), where x̄0 shows a shifting effect. As |z̃0 | grows, the Cauchy control signal approaches −x̄0 = −5. This fact can be verified using (61e) and (61f), where u∗∞ = −l0 /l1 → −5 as |z̃0 | → ∞. C. Changing η In this example the parameter η that expresses the width of the membership function in (20) is varied. Here η = 5 is 1122 α = 3, β = 3, γ = 5, η = 1, x̄0 = 5 α = 3, β = 3, γ = 5, η = 5, x̄0 = 0 4 8 Cauchy Gaussian Cauchy Gaussian 2 6 0 4 −2 2 u u* * −4 0 −6 −2 −8 −10 −4 −12 −6 −14 −30 −20 −10 0 10 z Fig. 4. 20 30 −8 −30 40 −20 −10 0 10 Optimal controllers for Example VI-B. Fig. 5. chosen, leading to the cost function 30 Optimal controllers for Example VI-C. where 1 C 3(−416 + 20uz + 33z 2 ) Ψ = π 80(z 2 + 4) ((u + z)2 + 169) −176 + 6uz − 11z 2 . (62) + 8(z 2 + 4)(u2 + 121) l7 = 64 (66a) l6 = 196z l5 = 22336 + 297z (66b) 2 (66c) l4 = 5z(7940 + 47z 2 ) The coefficients in (58) become (Note: z = z̃0 ) (66d) l3 = 2(792288 + 12633z 2 + 35z 4 ) l5 = 64 l4 = 184z (63a) (63b) l2 = 987612z + 6250z l3 = 36608 + 270z 2 (63c) 2 l2 = z(58768 + 205z 2 ) (63d) l1 = 11(363584 + 2410z 2 + 5z 4 ) (63e) l0 = 363z(3848 + 5z 2 ). (63f) As in the previous examples, Fig. 5 shows the slope of Cauchy controller has not changed due to a change in η. This behavior is similar to the LEG controller, whose slope is independent of q when r = 0. D. Weighted Controller In the following two examples, cost functions are examined that weight specifically the control signal u0 . 1) ζ = 1: In this example the Cauchy cost parameters are given in (56) while setting ζ = 1 in (34). Using (55), the optimal control is found when there is no measurement noise, by solving 2u∗0 3 + 3z̄0 u∗0 2 + [17 + z̄02 ]u∗0 + z̄0 = 0. 20 z0 0 (64) The resulting optimal controller is plotted in Fig. 6. Using (42), the LEG cost function parameters in (41) are chosen as q = 1/η 2 = 1 and r = 1/ζ 2 = 1. From (35), the Cauchy optimal controller is found by solving l7 u 7 + l6 u 6 + l 5 u 5 + l 4 u 4 + l 3 u 3 + l 2 u 2 + l 1 u + l0 = 0 (65) (66e) 3 (66f) 2 4 l1 = 7(3523392 + 63471z + 250z ) l0 = 147z(1476 + 5z ). (66g) (66h) The LEG controller is determined from (47). Both controllers are plotted in Fig. 6. Surprisingly, the Cauchy controller still shows a linear behavior about x̄0 . Furthermore, its slope is slightly different from the slope of the LEG control signal. Also, the symmetric property of the controller about z̃0 = x̄0 and its asymptotic behavior when |z̃0 | → ∞ is preserved. A noticeable difference, as expected, is that the control signal is suppressed compared to the previous cases. This is the direct effect of penalizing u0 in the cost function. It is also interesting to note that when the measurement is perfect, the control signal acts in a similar manner to the case where the measurement is noisy. 2) ζ = 104 : The weighting parameter is increased to ζ = 104 . First, using (55), the optimal control is found when there is no measurement noise. The control signal satisfies 2u∗0 3 + 3z̄0 u∗0 2 + [108 + 1 + z̄02 ]u∗0 + 108 z̄0 = 0. (67) As expected, the optimal control is the same as the case where the control signal is not weighted, i.e. u∗0 = −z̄0 , as stated in V-C. The equivalent Gaussian parameters are q = 1, and r = 10−4 . In both cases, those parameters correspond to 1123 α = 3, β = 3, η = 1, ζ = 10000, x̄0 = 0 α = 3, β = 3, η = 1, ζ = 1, x̄0 = 0 10 0.2 Cauchy, γ=5 Gaussian Cauchy,γ=0 0.15 Cauchy, γ=5 Gaussian Cauchy,γ=0 8 6 0.1 u* u* 0.05 0 4 q=1 2 r = 1e-008 0 −2 −0.05 −4 −0.1 −6 −0.15 −8 −0.2 −30 −20 Fig. 6. −10 0 z0 10 20 −10 −30 30 −20 −10 0 z 10 20 30 0 Fig. 7. Optimal controllers for Example VI-D.1. a significant decrease in the weighting of the control signal. Thus it is expected that the resulting control signals will be nearly identical to those obtained in Example VI-A, where there was no weighting on the control signal. In this case, the coefficients in (65) are l7 = 64 l6 = 196z (68a) (68b) l5 = 3200022304 + 297z 2 (68c) 2 l4 = z(9200039608 + 235z ) (68d) 2 4 l3 = 2(604800786240 + 7500012558z + 35z ) 2 l2 = z(1917600968436 + 12500006125z ) (68e) (68f) 2 l1 = 7(9417603429216 + 129000062181z + 500000245z 4 ) (68g) l0 = 14700000000z(1476 + 5z 2 ). (68h) The optimal controllers are plotted in Fig. 7. As expected, the plots show that the Cauchy controller converges to the optimal control signal for the case with no control weighting. VII. C ONCLUSIONS AND F UTURE W ORK A new mathematical framework was developed for optimal control problems when the dynamic system is driven by Cauchy distributed signals. This entailed a newly developed cost function, addressed through dynamic programming. As an initial step, one time-step controller was derived and examined analytically. The results where compared against an equivalent LEG controller. It was shown that both controllers behave in a similar way when the measurement values are small. A dramatic difference was observed between the two controllers when the measurement values become large. The Cauchy optimal controller saturates when the measurement value increases, while the LEG controller is always linearly proportional to the measurements. To extend the results, it is necessary to obtain the general properties of Optimal controllers for Example VI-D.2. the Cauchy controller. Furthermore, the optimal controller must be determined for each time update. In addition, a dynamic programming framework must be developed for the proposed optimal control problem. These will be considered in future studies, utilizing the insights presented in this current work. R EFERENCES [1] E. E. Kuruoglu, W. J. Fitzgerald, and P. J. W. Rayner, “Near Optimal Detection of Signals in Impulsive Noise Modeled With Asymmetric alpha-stable Distribution,” IEEE Communications Letters, vol. 2, no. 10, pp. 282–284, Oct. 1998. [2] G. Samorodnitsky and M. S. Taqqu, Stable Non-Gaussian Random Processes: Stochastic Models with Infinite Variance. New York: Chapman & Hall, 1994. [3] P. Tsakalides and C. L. Nikias, Deviation from normality in statistical signal processing: Parameter estimation with alpha-stable distributions; in A Practical Guide to Haevy Tails: Statistical Techniques and Applications. Boston: Birkhauser, 1998, pp. 379–404. [4] G. A. Tsihrintzis, Statistical Modeling and Receiver Design for MultiUser Communication Networks. Boston: Birkhauser, 1998, pp. 379– 404. [5] M. Idan and J. L. Speyer, “Cauchy Estimation for Linear Scalar Systems,” in IEEE Conference on Decision and Control, Cancun, Mexico, 2008, pp. 658 – 665. [6] ——, “Cauchy Estimation for Linear Scalar Systems,” IEEE Transactions on Automatic Control, vol. 55, no. 5, May 2010. [7] L. N. Johnson, S. Koltz, and N. Balakrishnan, Continuous Univariate Distributions. John Wiley & Sons, Inc., 1994, vol. 2, ch. 16. [8] J. L. Speyer and W. H. Chung, Stochastic Processes, Estimation, and Control, 1st ed. SIAM, 2008. [9] H.-J. Zimmermann, Fuzzy Set Theory and its Applications, 4th ed. Springer, 2001. 1124

Optimal Control for a Scalar One-Step Linear System with Additive

Related documents

Products

Support

Optimal Control for a Scalar One-Step Linear System with Additive

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib