Page 1 Math 166-copyright Joe Kahlig, 10A Section G.2: Mixed Strategies In a strictly determined game, the optimal strategy is a pure strategy of selecting the row/column for the saddle point. When the payoff matrix does not have a saddle point, i.e. is not strictly determined, a mixed strategy is used by the players. Example: Here is the payoff matrix (in terms of the row player) for a game. Assume that the row player will chose strategy R-1 40% of the time and use R-2 the rest of the time. Assume that the column player will chose strategy C-1 15% of the time and C-2 the rest. If this game is repeated many times using these mixed strategies, find the expected value of the game. C-1 A = R-1 3 R-2 −3 C-2 −2 1 Expected value of a Game: Let P and Q be the matrices representing the mixed strategies of the row player, P, and the column player, Q, and let A be a payoff Matrix. P = h p1 p2 pm ··· i A= a11 a21 .. . a12 a22 .. . am1 am2 ··· ··· ··· ··· a1n a2n .. . amn Then the expected value is given by Example: Find the expected value of the game for these mixed strategies. C-1 A = R-1 3 R-2 −3 A) P = h B) P = h 0.75 0.5 C-2 −2 1 0.25 0.5 i i Q= Q= " " 0.3 0.7 0.6 0.4 # # Q= q1 q2 .. . qn Page 2 Math 166-copyright Joe Kahlig, 10A Definition: For every choice of strategy for the row player there is a best counterstrategy–that is a strategy for the column player that results in the least expected value for the game. An optimal mixed strategy for the row player is one for which the expected value against the column player’s best counterstrategy is as large as possible. Definition: For every choice of strategy for the column player there is a best counterstrategy–that is a strategy for the row player that results in the largest expected value for the game. An optimal mixed strategy for the column player is one for which the expected value against the column player’s best counterstrategy is as small as possible. Theorem: For a game with the payoff matrix given as A= " a c b d # Then the optimal mixed strategy for the row player is given by P = h p1 p2 i , where p1 = d−c and p2 = 1 − p1 . a+d−b−c The optimal mixed strategy for the column player is given by Q= " q1 q2 # , where q1 = d−b and q2 = 1 − q1 . a+d−b−c The value of the game(expected value) is E = P AQ = ad − bc a+d−b−c Example: Find the value of the game and the optimal strategy for the row and column players. A= " 3 2 −1 4 # Example: Find the value of the game and the optimal strategy for the row and column players. A= " 15 −5 10 20 # Page 3 Math 166-copyright Joe Kahlig, 10A Example: Which of these strategies should never be used by the row player? A= R-1 R-2 R-3 C-1 4 6 5 C-2 5 7 6 C-3 2 3 8 Example: Which of these strategies should never be used by the column player? A= R-1 R-2 R-3 C-1 4 6 5 C-2 5 7 6 C-3 2 3 8 Definition: Row i is said to dominate row j if every entry of row i is sponding entry of row j Column i is said to dominate column j if every entry of column i is ing entry of column j to the corre- to the correspond- Rows/columns that are dominated may be removed from the payoff matrix without affecting the analysis. Example: Find the optimal strategy for the row and the column player. A= R-1 R-2 R-3 C-1 4 6 5 C-2 5 7 6 C-3 2 3 8 Math 166-copyright Joe Kahlig, 10A Page 4 Example: Reduce the matrix by deleting rows or columns that are dominated by other rows or columns. 6 2 7 −3 6 4 2 1 A) B) 2 0 2 −8 2 1 4 3 −2 9 6 6 3 4 1 3 1 2 8 1 Example: Find the optional strategy for both players. 4 5 −3 4 0 2 1 2 −8 3 −2 7 −3 2 0 3