2.3 Steady State Probabilities 2.3.1 Finding Steady State Probabilities In section 2.2.1 we saw how to compute the powers Pn of the transition matrix P. We saw that each element of Pn was a constant plus a sum of multiples of powers of numbers i whose absolute value is no greater than one. In fact, in most cases the i have absolute value strictly less than one, so their powers go to zero as n . So Pn approaches a limiting matrix, which we shall denote by P, as n tends to . The rows of this limiting matrix contain the probabilities of being in the various states as time gets large. These probabilities are called steady state probabilities. Let's look at the examples of the previous section. Example 1. If it is sunny one day there is a 90% chance that it will be sunny the next day and a 10% chance that it will be cloudy. If it cloudy on a particular day, there is a 60% chance that it will be sunny the next day and a 40% chance that it will be cloudy. We model this by a Markov chain with two states with state 1 corresponding to being sunny on a particular day and state 2 to being cloudy. We let n count the days with n = 0 being today so that Xn is the state of day n, either sunny or cloudy. The transition matrix is (1) P = 0.9 0.1 0.6 0.4 In the previous section we saw that 7 + 7(0.3) 6 - 6(0.3) 7 7 6 Pn = 1 n n 1 1 - (0.3)n 7 7 1 6 + (0.3)n 7 7 As n one has 7 + 7(0.3) 6 - 6(0.3) 7 7 6 Pn = 1 n n 1 1 - (0.3)n 7 7 1 6 + (0.3)n 7 7 7 6 7 6 P = 1 7 1 7 since (0.3)n 0. In fact within a week (7 days) one has Pn quite close to P since (0.3)7 0.0002. So for a day more than a week in the future one has Pr{sunny} Pr{cloudy} 6 7 1 7 no matter whether it is sunny or cloudy today. In the long run it is sunny six days out of seven. These probabilities 2.3.1 - 1 1 = 2 = lim Pr{ Xn = 1 | X0 = 1} = n lim Pr{ Xn = 2 | X0 = 1} = n lim Pr{ Xn = 1 | X0 = 2} = 6 7 lim Pr{ Xn = 2 | X0 = 2} = 1 7 n n are called the steady state probabilities of being in states 1 and 2. In many applications of Markov chains the steady state probabilities are the main items of interest since one is interested in the long run behavior of the system. For the time being we shall assume the following about the transition matrix P. (1) The eigenvalue one is not repeated. (2) The other eigenvalues have absolute value strictly less than one. Then Pn = TDnT -1 0 0 1 t12 1 t22 = 1 tn2 t1n t2n 1 0 (2)n tnn 0 T ( ) 0 0 n -1 n In general each entry of Pn is a constant plus a sum of multiples of powers of numbers of absolute value less than one. As n approaches infinity Pn approaches the matrix P of the constants. Thus as n we get P 1 t12 1 t22 = 1 tn2 = 0 1 1 0 1 0 t1n 1 t2n 0 tnn 0 0 0 0 0 0 0 0 0 0 T -1 21 2 22 n 2n n1 n2 nn 1 0 1 1 0 = 1 0 0 T -1 1 2 2 n n 1 2 n 1 = 0 0 where we let 21 2 22 n 2n n1 n2 nn 1 2 2 n n 1 2 n 1 T -1 = So 1 P = So the rows of P are all equal to the vector which is the first row of T-1. If we take the equation P = TDT -1 and multiply both sides on the left by T-1 we get T -1P = DT -1. The ith row of T -1P is the ith row of T-1 times 2.3.1 - 2 P. The ith row of DT -1 is i times the ith row of T-1. Thus the ith row of T -1P is i times the ith row of T-1. So the ith row of T -1 is a left eigenvector of P with eigenvalue i. In particular, the first row of T -1 is a left eigenvector of P with eigenvalue 1. In other words (3) P = where = (1, 2, …, n) So the rows of P are vectors which are solutions of the equation P = and which are probability vectors, i.e. the sum of the entries of equals 1. The entries of are called steady state probabilities. Another way to see that the rows of P satisfy (7) is to start with the relationship Pn+1 = PnP and let n . Since Pn P and Pn+1 P one gets P = PP. Since the ith row of PP is the ith row of P times P we see the rows of P satisfy (3). If the eigenvalue one is not repeated then there is only one such vector satisfying (3). In this case this is enough information to find it without computing Pn and taking the limit as n . Example 1 (continued). Find the steady state vector = (1, 2) in Example 1. Since P = we get 0.9 0.1 (1, 2) 0.6 0.4 = (1, 2) Therefore 0.91 + 0.62 = 1 0.11 + 0.42 = 2 Therefore - 0.11 + 0.62 = 0 0.11 - 0.62 = 0 Both of these equations are equivalent to 1 = 62. In order to find 1 and 2 we need to use the fact that 1 + 2 = 1. Combining this with 1 = 62 gives 62 + 2 = 1 or 72 = 1 or 2 = 1/7 and 1 = 6/7. Example 2. (see Problem 1 of section 2.1.1) At the start of each day an office copier is checked and its condition is classified as either good (1), poor (2) or broken (3). Suppose the transition matrix from one state to another in the course of day is 2.3.1 - 3 P = 0.8 0 0.32 0.06 0.4 0.08 0.14 0.6 0.6 In the previous section we saw that 0.55 + (0.52)(0.51) - (0.078)(0.28) P = 0.55 - (1.74)(0.51)n + (1.19)(0.28)n 0.55 - (0.33)(0.51)n - (0.22)(0.28)n n n n 0.1 - (0.05)(0.51)n - (0.05)(0.28)n 0.35 - (0.47)(0.51)n + (0.12)(0.28)n 0.1 + (0.18)(0.51)n + (0.72)(0.28)n 0.35 + (1.56)(0.51)n - (1.91)(0.28)n n n n n 0.1 + (0.03)(0.51) - (0.14)(0.28) 0.35 + (0.29)(0.51) + (0.36)(0.28) As n one has Pn P = 0.55 0.55 0.55 0.1 0.35 0.1 0.35 0.1 0.35 So if n is bigger than 10 or so Pr{ the copier is in good condition } 0.55 Pr{ the copier is in poor condition } 0.10 Pr{ the copier is broken } 0.35 So the vector of steady state probabilities is = (0.55, 0.10, 0,35). As in the previous example, this vector can be found by solving P = along with the fact that its components sum to one. The equation P = is 0.8 (1, 2, 3) 0 0.32 0.06 0.4 0.08 0.14 0.6 = (1, 2, 3) 0.6 Therefore 0.81 + 0.323 = 1 0.061 + 0.42 + 0.83 = 2 0.141 + 0.62 + 0.63 = 3 Therefore - 0.21 + 0.323 = 0 0.061 - 0.62 + 0.083 = 0 0.141 + 0.62 - 0.43 = 0 The third equation is just the negative of the sum of the first and the second, so we can solve the first two alone. Multiplying them by 100 gives 2.3.1 - 4 - 201 + 323 = 0 61 - 602 + 83 = 0 Solving the first equation for 1 gives 1 = 8 . Substituting into the second equation we get 5 3 48 - 602 + 83 = 0 5 1 22 8 22 . Since 1 + 2 + 3 = 1 we have 5 3 + 75 3 + 3 = 1. Solving for 3 gives 75 3 75 120 22 3 = 217. So 1 = 217 and 2 = 217. Thus Solving for 2 gives 2 = 120 22 75 = (1, 2, 3) = ( 217, 217, 217 ) (0.55, 0.10, 0.35) Problem 1. Let P = 1 - p p . Show that the steady state vector is = ( q , p ) q 1 - q p+q p+q 2.3.1 - 5