Document 10435135

advertisement
MATH 166 Spring 2016
c
Wen
Liu
G.2
G.2 Mixed Strategy Games
Expected Value: Suppose a game has a payoff matrix

a11 a12 · · ·

a22 · · ·
 a
A =  21
···

am1 am2 · · ·

a1n

a2n 


amn
and the row player uses the strategy p = [p1 p2 · · · pm ], where each pi is the probability of choosing
the ith strategy, and the column player uses the strategy q T = [q1 q2 · · · qn ], where each qj is the
probability of choosing the jth strategy, then the expected value for those two strategies is
E(p, q) = pAq

= [p1 p2
a11 a12

a22
 a
· · · pm ]  21

am1 am2
···
···
···
···

a1n

a2n  


amn
q1
q2
..
.





qn
Example: Consider the game with the payoff matrix
−1 1
A=
2 4
Find the expected values for the strategies below and determine which the row player should use.
(a) p = [0.2 0.8] and q T = [1 0].
(b) p = [0.7 0.3] and q T = [1 0].
Remark: From the example above we see that if a player uses different strategies that the payoffs
will change. There exists an optimal strategy for each player that we shall call p for the row player
and q for the column player. If both players use their optimal strategies, the expected outcome is
v = E (p, q), the value of the game. The goal of game theory is to find the optimal strategy for each
player.
Page 1 of 3
MATH 166 Spring 2016
c
Wen
Liu
G.2
Example: Find the optimal strategy for the row player and the value of the game given in previous
example.
Generalized 2 × 2 Game: If the payoff matrix A is given by
a11 a12
A=
a21 a22
Then the optimal row strategy is
p = [p1 p2 ]
a22 − a21
=
a11 + a22 − a12 − a21
1 − p1
and the optimal column strategy is
q = [q1 q2 ]
a22 − a12
=
a11 + a22 − a12 − a21
1 − q1
The optimal value of the game is
v = E (p, q)
= pAq
a11 a22 − a12 a21
=
a11 + a22 − a12 − a21
Page 2 of 3
MATH 166 Spring 2016
c
Wen
Liu
G.2
Example: Josh and Sandy play a card game where each player has two cards to show. Josh has the
three and four of hearts and Sandy has the two and five of diamonds. Both show a card at the same
time. If the sum is even, Josh wins the sum of the two cards in dollars. If the sum is odd, Sandy wins
the sum of the two cards in dollars. What is Josh’s optimal strategy and the value of this game?
Definition: Row i is said to dominate row j if every element in row i is greater than or equal to
the corresponding element in row j. Also column i is said to dominate column j if every element in
column i is less than or equal to the corresponding element in column j.
Example: Find the value and the optimal strategy for the row player for the game with the following
payoff matrix


−2 −1 0
B =  −1 0
1 
2
3
−4
Page 3 of 3
Download