CONTROL-THEORY HEURISTICS FOR IMPROVING Cuneyt M. Ozveren

CONTROL-THEORY HEURISTICS FOR IMPROVING THE BEHAVIOR OF ECONOMIC MODELS John D. Sterman Cuneyt M. Ozveren October, 1988 WP # 2074-88 Control-Theory Heuristics for Improving the Behavior of Economic Models CUneyt M. zveren John D. Sterman Abstract The usefulness of control simulations is examined. theory techniques in economic A model of the capital investment accelerator is used for illustration. It is shown that behavior approaching optimality can be achieved in a highly nonlinear system with simple heuristics based on linearization and constant state feedback with an appropriate choice of desired closed-loop poles. Applications to real life situations are discussed and suggestions are made for approaching potential problems using similar control theory techniques. September 1988 Laboratory for Information Decision Systems, MIT, Cambridge MA 02139 *Sloan School of Management, MIT, Cambridge MA 02139 0 Introduction One can approach a system dynamics problem from two extreme points of view: mathematical and intuitive. provides global knowledge of simple models. Thus, Mathematical analysis the solutions but is only feasible in it is very hard to analyze large, realistic models mathematically. On the other hand, with the intuitive approach, one can investigate more comprehensive and realistic models, but it is all too easy to become lost in simulation and trial and error in designing policies to improve performance. We seek to answer several questions: 1. How useful are the techniques of modern control theory in simulation models of social systems? 2. Can "approximately optimal" policies be designed which provide significant performance improvement relative to the original policies of the decisionmakers? By "approximately optimal" is meant a set of policies based on heuristic methods rather than the closed-form solution of the problem, since such analytic solutions are generally impossible to determine in the high order, nonlinear systems typically of concern in system dynamics modelling. 3. How different are the improved and original policies? That is, are the improved rules consistent with the likely availability of information and bounded rationality of real managers? The goal of this paper is to motivate the heuristic use of some 1 III mathematical outlined as 1. tools to aid the systems analyst. The approach can be follows: Start from the full simulation model portraying the problem of interest. 2. Analyze the model linearization, of 3. formally, using decoupling, etc., where appropriate model reduction, to decrease the burden the mathematical analysis. Design an appropriate controller (policies for improved performance). 4. Project the results back to the original model and test robustness of the new policies in the full We assume that system. 1 is given and concentrate on 2, paper complements previous applications of control design in social and corporate systems the 3, and 4. The theory to policy (e.g. Mohapatra and Sharma 1985, Coyle 1985, Kivijarvi and Tuominen 1986) by applying formal methods to a nonlinear model with complex dynamics. existence of the the showing how people actually manage experimental results system provides a means Further, for directly assessing improvements which may be obtained through the performance the use of control techniques. We next describe some of the basic mathematical above approach is applied to Sterman's Kondratiev cycle or general long wave. techniques applicable The of the The particular example illustrates to a variety of situations such as business cycles (Mass 1975, Lyneis stabilization (Meadows 1969), (1985) model tools. 1980), commodity price market growth (Forrester 2 1968) and others. Although this paper is mostly heuristic, we hope to motivate the need for rigorous analysis based on the outlined method. The last section summarizes our results and makes recommendations for further analysis. Mathematical Tools Linearization is the most common tool for formal analysis of nonlinear models. A model of the form x(t) = f(x(t)) + g(u(t)) (1) where x is the vector of states, u is the vector of inputs in the model and f(-), g(-) state and input functions, respectively, could be linearized around a nominal trajectory x n(t) using the Jacobian matrices F = [fi,j], 1,3 G= [gi,j], ~1,3fi,. = afi(x)/ax. i 3 gij = (2.a) gi(u)/cu (2.b) Then the linearized model bx(t) = F6x(t) + Gu(t) (3) describes deviations from the nominal trajectory. The actual trajectory x(t) = x (t)+bx(t) is driven by the input u(t) = u (t)+bu(t) where f(x n(t))+g(un(t)) 0. This approximation is good for "slowly varying" nominal trajectories. analyzed using modern control The model can be theory (Guckenheimer and Holmes 1983; for "slowly varying" see Vidyasagar 1978). It should always be remembered that linearization only provides information about the local region of phase space, and is not a 3 III reliable guide to global nonlinearities is saturation, this case, the class bang-bang, discontinuous, dynamics. A special class of of piecewise roundoff etc. and therefore linear functions, such as Their derivatives are the Jacobian matrix is undefined. In the analysis can be performed for different regions where the derivatives of these nonlinearities are defined. Once the of control linearized model can be applied. placement using full state (3) is constructed, various methods The method used in this paper is pole feedback (Kailath 1980). In essence, a weighted sum of each state is fed back to the system as The method these weights, H, given set is to calculate of closed the system (3) that loop poles (i.e. is controllable, in order the input. to achieve a the eigenvalues of F+GH). If then a matrix H can be chosen such the eigenvalues of F+GH are as desired. The input u(t) is picked as: 6u(t) = Hx(t) (4) The closed loop system then becomes: 6x(t) = (F+GH)6x(t) and has (5) the desired closed loop poles. to sustain the nominal trajectory. The nominal For example, input is picked the nominal trajectory might be given by the equilibrium of the system, which may be changing with exogeneous conditions. To achieve an approximately optimal 2 poles, one rule of thumb is closed loop system such that equal response for a system with to select a pair of poles for the these poles are complex conjugates with magnitude of real and imaginary parts and negative real part. The response to a step input exhibits a 4 slight overshoot and is optimal in the sense of minimum overshoot and settling time (Roberge 1975). The distance from the poles to the origin is a trade-off between overshoot and settling time. An important aspect of any control design is robustness. We consider robustness by perturbation of the control parameters and by introducing uncertainty in the states. A Model of the Kondratiev Cycle Various models of the Kondratiev cycle or long wave have been developed to analyze recent economic difficulties (Rasmussen, Mosekilde, and Sterman 1985; Sterman 1985; Sterman 1986; Vasko 1987). model This paper adopts the model of Sterman 1985. The Kondratiev is chosen since it is a nonlinear system which exhibits complex dynamics, yet its structure is simple enough to allow for a considerable amount of mathematical analysis to be done by hand to illustrate the proposed techniques. systems dynamics literature. It is also well known in the The model has also been converted into a simulation game, Strategem-2 (Sterman and Meadows 1985). Strategem-2 provides a simulated economy in which the human decisionmakers replace the decision rule of the original model. Thus the game constitutes a laboratory for experimental testing of the model. Strategem-2 motivates the design of an "optimal" controller to "play" the game as we can compare the behavior of actual managers to the "optimal" response, thus providing a rough measure of the value of improved performance through formal analysis. 5 III The long-wave model portrays the process governing capital investment in the aggregate economy. The capital producing sector strives to satisfy the demand for capital of sector as well as its own capital needs. the goods producing The model represents the capital "self-ordering" feedbacks created by the fact that capital is an input to its own production. This dependency implies that the total demand for capital can only be filled when there is sufficient capacity, but also that the capacity of the capital-producing sector can only be increased by first ordering additional capacity and adding to the demand. Intuitively, such a positive feedback must destabilize the adjustment of capital producers to shocks, and in fact, the model typically exhibits large-amplitude limit cycles. It can also generate chaotic behavior, where no periodicity can be attributed to the model (Sterman 1988b). In the experimental version, subjects play the role of managers for the capital producing sector and strive to balance the supply and demand for capital. Subjects seek to minimize their costs or "score" during each trial. The score is the average absolute deviation between supply (production capacity, PC) and demand (desired production, DP) over the T periods of the game: S = h[DPt-PCtI. When presented with an unanticipated step input in demand, a sample of about 50 subjects produced an average score of more than 500 (Sterman 1987, 1988a). The optimal score for the same situation is 19.1 The poor 1The optimal score was determined by grid search of the score space as a function of successive decisions. The optimal strategy presumes that the step in demand is unanticipated, consistent with 6 performance of the subjects shows controller may offer substantial The states the goods sector capital improvements in the model are capital placed by the capital by that even an approximately optimal sector (BGS). (BKS), The (K), follows (Figure K1 K~ ~ the backlog of orders and the backlog of orders placed inputs are new orders from the sector NKS (specified by the player), goods sector NGS (exogenous). in performance. The model and new orders of the can be summarized as 1): r BKS K NCAT LS BK - BGS BKS FD NCAT [D1 + NKS NCAT FDS (6.a) NGS FDS = Fraction of Demand Satisfied Min(Desired Production,Capacity) Desired Production BKS + BGS(6c) NCAT Desired Production = Capacity = where K COR the normal of capital (ALC), the experiment the same (6.d) capital acquisition and capital in this paper, COR = 2 are used; Also, to time output ratio (NCAT), average lifetime (COR) are constants. the values NCAT = 2, For ALC = 20 and these are the values used in the Strategem-2 game. time step (2 years), and score function, as results (6b) the game. simulation length (70 years), the game, were used in order to compare our The acquisition lag and capital/output ratio the information available to the subjects. 7 II were deliberately set equal the STRATEGEM-2 game. to the time step In the original model based on econometric evidence and the integration error was not significant. differential equation versions to facilitate play in the parameters were time step reduced so that The difference and of the model produce the same types of behavior and respond to parameter changes in the same fashion. Linearizat ion: The only nonlinearity requiring piecewise analysis arises from the fraction of demand satisfied, can only fully satisfy equal to capacity. production. FDS. The capital-producing sector the demand when that demand is FDS equals less than or 100% when capacity exceeds desired As desired production rises above capacity, FDS falls, constraining the growth of two distinct regimes: capital itself. The model operates in FDS = 1 (capacity exceeds demand), and FDS < 1 (insufficient capacity). Considering each regime in turn, linearization of Equation (6) yields: 1. For FDS = 1, r6K 6K {SK 6BKS [BGS = F2 6BKS + (7) G2 tBGS 8 -1/ALC where F2= 0 1/NCAT 0 -1/NCAT 0 0 and G 2 = . -1/NCAT 0 2. For FDS < 1 6K 6K [6BKS * 1 6BKS = F1 6BGS where F1 6NKS + G ~~~~6NGSJ (8) 6BGS is KxBGS/COR(BKS+BGS) 2 -KxBKS/COR(BKS+BGS) 2 BKS/COR(BKS+BGS)-1/ALC -BKS/COR(BKS+BGS) -KxBGS/COR(BKS+BGS) 2 KxBGS/COR(BKS+BGS) 2 -KxBKS/COR(BKS+BGS) 2 -BGS/COR(BKS+BGS) GI = 0 KxBKS/COR(BKS+BGS) 2 and X denotes the value of state X at the operating 0 1 point. Let p = (K/COR)/(BKS+BGS). Note that q = NCATxFDS where FDS is the fraction of demand satisfied at the operating point. let r = BKS+BGS and note that = NCATxP, production at the operating point. BKS/CORx F1 = For FDS=l, - 1/ALC is the desired Then, we have xBGS/T -xBKS/I -BKS/CORxr -pxBGS/r -BGS/CORxr VxBGS/ the entire model is highly nonlinear. where Also, pxBKS/r -xBKS/r is linear, and for FDS<1, the model Note the following observations from the analysis of the above models: 9 III 1. FDS = 1 a. The system is stable with time constants ALC lifetime of capital, and NCAT (normal delivered within the normal capital capital stock depreciates with time constant ALC. stable. sector the goods gets all 1 The eigenvalues are 0, -, and BKS/CORxw-1/ALC. The Jacobian matrix has a zero eigenvalue, which that and thus it desires, BGS is also decoupled from K and BKS. is not controllable. FDS ( BGS is uncontrollable In other words, K can be controlled by NKS, but as long as FDS=1, a. orders to be acquisition time, and K and BKS are controllable via NKS. but 2. capital acquisition In this regime, excess capacity permits all time): b. (average higher order terms important. Thus, no the Jacobian matrix. to be safer seems ignored by the implies linearization are stability conditions can be inferred from Due to its highly nonlinear nature, to keep the system away from this it region whenever possible. b. The is linearized to design a controller which stabilizes region, or better into still, gets the the goal the system in this system out of this region the stable region where FDS=1. An interesting characteristic of equilibrium point, which for a given NGS, making Therefore, system is controllable. this model is that lets us calculate the nominal is exactly at the boundary of the system harder to control. 10 trajectory these two Specifically, at the regions, the equilibrium, FDS = K (9.a) = ALCxCOR NGS ALC-COR x NGS n BKS NKS BGS where X 1 n (9.b) n NCAT xCOR = NCATCOR x NGS ALC-COR X G (9.c) n COR = ALC-COR (9.d) x NGS x NGS = NCAT x NGS n denotes Thus, (9.e) the nominal control trajectory for X. to sustain the nominal COR be chosen as ALCOR RxNGS + - trajectory given NGS, NKS should NKS where 6NKS is chosen in order the deviations from the nominal to trajectory. Controller Design: The model, controllers full implemented in STELLA, (one for each region), state feedback. the capital instant of to perform pole placement using The controller determines the new orders of sector NKS and is designed to calculate H the eigenvalues of Fi+GCiH. time such that for each region, i = 1, 2. equilibrium rate of capital of was extended to support two the consumer sector. The nominal 1 at every are as desired, trajectory is given by the sector orders given the current orders Then, the resulting input is: 5K NKS = ALCCORXNGS + H ALC-COR 6BKS = NKS ~~n + 6NKS. i (10) 6BGS Note that as long as the desired poles are chosen to be nonzero and 11 III stable, the trajectories will be nominal trajectories. usually linearized around the equilibrium point and the poles are set accordingly. stable around In the control Here, we place the slowly varying literature, the system is the poles for all the system (except perhaps at a few number of points where controllability is lost). Although this is not backed up by rigorous results, merits further research in view of the successful it results shown below. Figure 2 shows a policy structure diagram of controller (equations for Appendix). First, current state of entries of H 1 the controller are supplied in the H1 and H2 are calculated on the basis of the the system and the desired closed loop poles. were found such that loop matrix (determinant polynomial ((X-desired pole 1)(X-desired pole 2) and 6NKS2 are calculated as H1 and H2 respectively, of I-(F+GHi)) the weighted sum, is the desired etc.). Then, 6NKS by multiplication via the current value of NGS and This stabilizes the dynamics of the error modes of the closed loop system and thus drives the states the system is more robust to since the steady the error will state values. Thus, go the response will not be very sensitive to to zero and of of the deviation from the nominal which are determined by Equation (9). The the characteristic polynomial the closed trajectory, the overall perturbations in H 1 and H2 . Finally, 6NKS 1 depending on the value of FDS, and NKS or NKS 2 is chosen is added to calculate NKS. n Three poles in the region FDS < 1 were placed in the left half plane on the circumference of a circle centered at 12 the origin. One pole was placed on the real axis with a complex conjugate pair with equal real and imaginary parts. The radius of the circle determines a trade-off between overshoot and settling time (the larger the radius, the more overshoot and faster settling). In the region FDS = 1, two poles were placed on the real axis in the left half plane, both at the same location. Their distance from the origin determines the settling time in this region. Positive shocks to NGS from equilibrium put the system in the unstable, nonlinear region FDS 1. The controller is designed to move the system into the stable region in an "optimal" quickly and with minimum overshoot. fashion - In the region FDS = 1, the system is expected to settle at the equilibrium in an overdamped way, thus providing the overall desired response. NGS put the system in the region FDS = 1. Negative jumps in The response will be overdamped in that case. In intuitive terms, region FDS increasing the speed of adjustment in the 1 may result in building the capital stock too high, and thus degrading performance by increasing the score. the speed when FDS 1 may cause with inadequate investment raising the score. Decreasing the system to linger in this region to escape, also degrading performance by Increasing the speed in the region FDS = 1 may cause the system to reenter the region FDS time intervals used in the Strategem-2 game. 1 due to the discrete This is not expected to cause any problems since capital will be bounced back up again, but may cause small amplitude (probably damped) oscillations. Decreasing the speed in this region may cause slow settling to the equilibrium (overconservative response), 13 and thus increase the III score. Experimental Results: for pole Various speeds were tried, and simulation results locations -0.38, illustrated in Figure 3. DesiredK = 1, and -1, -0.27±0.27j for FDS for FDS = 1 are desired capital In the figure, the sector input supports The capital (BKS+BGS)/NCAT. -1 intuition that a positive jump in the goods sector demand is input NKS such followed by an amplified positive jump in the control the share of that production is the capital sector demand in the desired increased enough to the desired production input drops close to zero for a while to its equilibrium value. about 10 years. To 10% FDS the goods sector. The score investigate for robustness in the response, and for the the initial Capital jump in the falls to 80%, but recovers in the simulation is 15.2'3 the speed in both directions independently. changes Later, to compensate for the the desired response, while compensating backlog of and supply the region FDS = 1). (bounced to overdemand, and finally settles shows increase the capital inputs were perturbed by This resulted in negligible the increase in the average score per 2 In the Strategem-2 game, all quantities are rounded 10 units to simplify the decisionmaking task of simulation analyzed here permits values. As a result, the optimal 3 Compare score of the behavior the states to the nearest the subjects. The to take continuous the score in the simulation can be less than 19 in the experiment. in Figure 3 to Figure 4 which shows typically produced by subjects of 14 the experiment. the cycles period was less than 1 for all perturbations. Another robustness test was performed by introducing uncertainty in choosing the appropriate controller around the equilibrium. 0.95 shows Specifically, this boundary was chosen as the region: (desired production)/(production capacity) 1.05. Figure 5 the simulation result when the controller for FDS < 1 was chosen for this boundary. The response exhibits some underdamped fluctuations due to the complex conjugate pair, unimportant. but they seem to be Figure 6 shows the simulation result when the controller for FDS = 1 was chosen for this boundary. Since the controller for FDS = 1 was overdamped, when used in the boundary, never bounces the system into the region FDS = 1. Thus, FDS never makes it to 1, and gets into a cycle where it slides down, (but below 1), slides back down, etc. it jumps up Recall that there was an extra degree of freedom in the controller for FDS = 1, which was not used, since BGS is uncontrollable. included in the controller. Specifically, BGS was not To correct for the above deficiency, multiple of BGS was added to the controller for FDS = 1. a This has no effect in the region FDS = 1, since the capital sector is decoupled from the goods sector in this region. In the boundary, this could be used to correct the undesired effect of the uncertainty. Figure 7 illustrates the simulation result when 12 BGS was added to the controller of the region FDS = 1. of The response is greatly improved, and FDS settles at 1 in this case. Intuitively, when faced with an unstable region and uncertainty in the states, a robust controller will err on the side of caution by moving the system rapidly to the stable zone even at the cost of 15 III possibly moving results for goods farther into the stable zone than necessary. other simulations where different waveforms for sector demand were chosen are summarized The controller perturbations in Table the 1. seems to be robust with respect to perturbations in the desired pole locations, uncertainty The which also correspond in the feedback gains. to The discrepancy due to the in choosing the appropriate controller around equilibrium was corrected by using the additional degree of freedom in the results are not contingent on Thus the region FDS = 1. precise knowledge of the system parameters. The controller designed in this paper could be modified to follow ramp inputs better, but response to other More advanced control inputs. linear quadratic regulation, is likely to give worse results in could be applied to requires the use of packaged programs to solve techniques, such as this problem, but the necessary equations. Conclusions and Further Analysis This paper presents a heuristic application of control to an economic model. Although we considered one case, theory the same approach can be applied to a wide class of problems which exhibit the same structure, that is, any system with a region in which resources are fully utilized and a results region of slack. In general, the show that a problem which is highly nonlinear can be controlled quite satisfactorily with the aid of simple mathematical analysis and basic control concepts. 16 We have considered two key techniques for approaching such nonlinear problems: 1. Linearization around the nominal used technique. trajectory: This is a commonly The results of Vidyasagar (Vidyasagar 1978), although complicated, are particularly useful in finding the class of "slowly varying" trajectories that the system would be stable around. We verified this for step inputs by simulation. Ramp inputs and sinusoids seem to present some problems. 2. Analyzing piecewise-linear nonlinearities in linear regions separately: We considered a common example of this case, saturation. However, since the general applications of control the system to operate in the unsaturated region, theory require there is not much rigorous analysis to be found in the literature for the case when the operating point is required to be on the boundary between the two regions, as in many system dynamics problems. This perhaps merits a rigorous analysis. Comparing the "Optimal" Policy with Actual Dectsion-Makhing Practice: Many economists argue that dysfunctional oscillations such as the long wave cannot exist since economic agents with rational expectations would behave in an optimal fashion. shows that The analysis here there are in fact optimal strategies for investment which can avoid instability. But, it is very unlikely that the results presented in this paper would be achieved in practice through conventional decision making processes. by the experimental results This assertion is supported in Sterman 1987 and Sterman 1988a. The decision rule developed here requires information a manager in real life is unlikely or unable to have. 17 It then processes that III information in a highly sophisticated fashion. optimal rule utilizes knowledge of system to compute firms. the stock and new orders. demand. But firm'to be able to is excess or insufficient capacity compared to in reality, an individual firm is unable to whether customers' orders represent long run needs, adjustments, fooled, or self-ordering effects. The optimal ordering still more in response to the their own attempt to raise capacity. It single decision process which is full the Rather, as in each regime, experiments with locally rational important (but leads to poor large gap between the requirements of optimal type, the trend of the inputs in practice than precise values. robustness also suggest errors, tend to use a system). control and the reality of bounded rationality, problems of this we stress that The tests of controller that small variations due to modelling the analysis is the critical determining the best strategy. The main role of nonlinearity in Performance can be dramatically 18 in is much more etc. still produce very satisfactory results. result of into is unlikely that actual managers, economists, and students confirm, people To overcome rule is not firm cannot distinguish with certainty which regime the economy is operating in. in the transient stock rise in demand induced by firms use dramatically different decision rules performance tell the Strategem-2 game frequently are, as players of particularly since a The strategies depending on which regime one is operating in, requiring a there the full economy is not known to real The rule also requires very different decide whether the the equilibrium structure of targets for capital equilibrium structure of In particular, improved by determining which regime one reacting accordingly, even if only approximately correct. is operating in and the decision rules for each regime are As in any policy analysis, implementation will depend on the modeller's ability to explicate the rationale for the policy. reinforces rather than replaces justification of The use of control concepts the need for managerially oriented the proposed policies. 19 III REFERENCES 1985. The Use of Optimization Methods Coyle, R. G. in a System Dynamics Model, J. Forrester, Investment, Guckenheimer, System Dynamics Review, 1968. Market Growth as Ind. for Policy Design 1:81-91. Influenced by Capital Man. Rev. J. and Dynamical Systems, P. Holmes 1983. Nonlinear Oscillations, and Bifurcations of Vector Fields, Springer-Verlag. Kailath, T. 1980. Linear Systems, Prentice-Hall Kivijarvi, H. and M. 1986. Tuominen, J. 1980. Corporate Solving Optimal Control System Dynamics Review, Problems with System Dynamics, Lyneis, Inc. 2(2):138-149. Planning and Policy Design, Cambridge, MIT Press. Mass, N. 1975. Economic Cycles: An Analysis of Underlying Causes, Cambridge, MIT Press. Meadows, D. L. 1969. Dynamics of Commodity Production Cycles, Cambridge, MIT Press. 20 Mohapatra, P. K. J. and S. Sharma, 1985. Synthetic Design of Policy Decision in System Dynamics Models: A Modal Control Theoretical Approach, System Dynamics Review, Rasmussen, S., 1:63-80. E. Mosekilde and J. D. Sterman 1985. Bifurcations and Chaotic Behavior in a Simple Model of the Economic Long Wave, System Dyanmics Review, 1:92-110. Roberge, J. K. 1975. Operational Amplifiers: Theory and Practice, John Wiley and Sons Inc. Sterman, J. D. 1985. A Behavioral Model of the Economic Long Wave, Journal of Economic Behavior and Organization, 6:17-53. Sterman, J. D. and D. Meadows 1985. Strategem-2, Simulation and Games, 16(2):174-202. Sterman, J. D. 1986. The Economic Long Wave: Theory and Evidence, System Dynamics Review, 2(2):87-125. Sterman, J. D. 1987. Testing Behavioral Simulation Models by Direct Experiment, Management Science, 33(12):1572-1592. Sterman, J. D. 1988a. Misperceptions of Feedback in Dynamic Decision Making, Organizational Behavior and Human Decision Processes, forthcoming. 21 III Sterman, J. D. 1988b. Deterministic Chaos in Models of Human Systems: Methodological Issues and Experimental Results, System Dynamics Review, 4:148-178. Szymkat, M. and E. Mosekilde 1986. Global Bifurication Analysis of an Economic Long Wave Model, presented at Simulation Congress, Antwep, Belgium. Vasko, T. Vidyasagar (ed.) 1987. the 2n European The Long Wave Debate, Berlin, Springer. 1978. Nonlinear Systems Analysis, Prentice Hall. 22 APPENDIX Equations for the Controller FDS <1: Given desired polynomial X 3+aX2+bX+c, the feedback vector is: Hi = [ h 1 h2 h3] where 2 - r h 1 = (/(xBGS))(rx where r = -BKS/(CORxr) + 1/ALC + r -a h2= h3 + axr - b - CORxc/(xBGS(r-COR))) = CORxc/(0xBGSx(r-COR)) FDS = - (BKS/BGS)( + r -a) 1: Only two poles can be placed. 2 polynomial X +aN+b, h2 h3 ] Thus, given a desired second order the feedback vector is: where H2 = [h 1 h = axNCAT/ALC - NCAT/ALC 2 h2 = a - bxNCAT - 1/NCAT - 1/ALC The value of h 3 does not effect the characteristic polynomial. Recall that we used this degree of freedom to improve robustness. 23 CAPITAL SECTOR r Figure l.a Simulation Model 24 Equations of the Long Wave model: version to simulate the STRATEGEM-2 game. New orders for capital placed by the consumer goods and services sector (NGS) are exogenous. In the experiment, the new orders of the capital sector (NKS) are determined by the player. Inthe analysis presented here, NKS is determined by the controller. Capital Sector: FDS = PR/DP Fraction of Demand Satisfied (dimensionless) DP = (BKS+BGS)/NCAT Desired Production (units/year) NCAT = 2 Normal Capital Acquisition Time (years) PR = MIN(DP,PC) Production Rate (units/year) PC = K/COR Production Capacity (units/year) COR = 2 Capital Output Ratio (years) K= K + dt* (CA- CD) CD = KIALC Capital Stock (units) Capital Discard Rate (units/year) ALC = 20 Average Life of Capital (years) CA = FDS*BKS/NCAT Capital Acquisition Rate (units/year) BKS = BKS + dt* ( NKS- CA) Backlog of orders of the capital sector (units) NKS = Determined by subject New Orders of the Capital Sector (units/year) Goods and Services Sector: BGS = BGS + dt * ( NGS - GCA) GCA = FDS*BGS/NCAT Backlog of orders of the goods sector (units) NGS = Exogenous Goods Sector: Capital Acquisition Rate (units/year) New Orders of the Goods Sector (units/year) dt = 2 Simulation time step (years) Figure 1.b Equations of the Long Wave 25 Model III K BKS BGS I .11 II Calculate Calculate H1 Desired poles (SPEED 1 ) H2 - - Desired poles (SPEED 2 ) _.~~~~~~~~~~~~~~~~ 1 6K, 6BKS, Input 1 -Input 2 BGS l it LI r- 2 Multiplier Multiplier i . I6NKS 2 6NKS 1 I I Is FDS ) NKS FDS<1? K ,BKS, BGS n n n - YES :Pick 6NKS 2 l EQUATION NGS Pick 6NKS1 - NKS n 9 Figure 2 Block diagram of controller. 26 3 NKS 2 DesiredK 1K 800.0 600.0 12 12 12 --- 1 2- 12 12 400.0 200.0 3 -T/3,-r0.0 0.0 1 FDS X 3 54.0 54.0 36.0 Time 18.0 3 2 NKSn . 72.0 NKS 100.0 1.0 62.5 0.8 25.0 0.5 -12.5 0.3 -50.0 0.0 NKS n SNKS Time Figure 3 Simulation of the basic run: Response to a 10l step increase in demand. Score = 15; K, DesiredK. NKS: units; FDS: X. 27 FDS 111 Units Trial 6: SCORE = 908 Units Trial I : SCORE= 651 ---- r 0 Years Units Trial 16: SCORE= 479 Units Trial 24: SCORE = 366 Years Yeers Trial 40. SCORE = 161 Units Trial 30: SCORE = 212 Units IZUUr I uuu r- 1003 1\ 800 800 600 600 400 400 200 200 0 / I1. , 0 I0 I I , , 20 .1 I% .j 30 , , 47---- 1 Years 40 I 50 ..... . . -- I 60 , I 70 n u , 0 1O 0 , . I. 20 30 Years 40 I 50 , II 60 I 70 Desired Production Production CaDacity .................. Subects Orders for Capital Figure 4 Typical experimental results. Note the large amplitude and long period of the cycles generated by the subjects. scales differ. 28 N.B.: vertical 2 DesiredK 1K 800.0 2 600.0 1 1 is ~1 - 12_----12 12 12 400.0 200.0 0.0 _ ' . ' . .. . . . 1. .I 1 FDS NKS ' l t 36.0 Time 54. 54.0 72.0 2 NKS hhh _ LUU .U I 18.0 0.0 . q - .- 1 4 1.0 150.0 0.8 1oo.o o.5 50.0 0.3 0.0 0.0 I . .- I _e. 3 Time Figure 5 Controller for FDS<1 used in the boundary whenever .95 DP/PC 1.05. Score = 18; K, DesiredK, NKS: units; FDS: X. 29 FDS 111 I K 2 DesiredK 800.0 600.0 1 2 21 1 2- ' 12 12- 400.0 200.0 0.0 . . . .*. - 0.0 ~~18.0 1 18.0 I '' '' ' ' ' ' 1 FDS - -- - - - - - - - - -- -- - - - - - - -- - - 54.0 72.0 2 NKS 200.0 NKS v 36.0 Time 1 1-- 1.0 1 150.0 0.8 100oo.o 0.5 50.0 0.3 0.0 0.0 I Time Figure 6 Controller for FDS=l used in the boundary whenever .95 < DP/PC 1.05. Score = 21; K, DesiredK, NKS: units; 30 FDS: X. FDS 1K 2 DesiredK 800.0 600.0 1 = i 2----11 2- I 12.... 1 2- 12- 400.0 200.0 0.0 ...... w i 0.0 - ' I ' I. . . . . . 1 FDS L .I 0 ' ' ' ' _7_ _ _1_36.0 Time I 1 8.0 - - - - - - - - - - - - - - - - - - - - -_ __ _ 54.0 I 54.0 72.0 2 NKS 111^^ 5U .U 1.0 150.0 NKS 0.8 oo.o 0.5 50.0 FDS 0.3 0.0 C 0.0 54.0 Time Figure 7 Score = Improvement 19; K, of controller DesiredK, NKS: for FDS=1 used units; 31 FDS: X. in the boundary. III Table 1 Summary of simulations with various waveforms for NKS. FDS K NGS Ramp with a lag No equilibrium Ramp 225+2.5(t-2) Above 90% Avg score per period Settles around 100 Sinusoidal 225+50 2rt -2) 72sin Gaussian Noise 225+20xNORMAL Periodic between Distorted sinusoidal _ __ 100% and 70% _ Fluctuates between Noisy but stable 100% and 60% 32 Settles around 125 Below 200

CONTROL-THEORY HEURISTICS FOR IMPROVING Cuneyt M. Ozveren

Related documents

Products

Support

CONTROL-THEORY HEURISTICS FOR IMPROVING Cuneyt M. Ozveren

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib