Games of pure conflict two-person constant sum games Two person constant sum game • Sometimes called zero-sum game. • The sum of the players’ payoffs is the same, no matter what pair of actions they take. Maximin strategy • One way to play a game is to take a very cautious view. • Your payoff from any action depends on other’s actions. • In a two-player game, you might assume other player always does what is worst for you. • Given that assumption, you would choose the strategy such that gives you the best payoff if the other player always does what is worst for you given your strategy. Maximin in Constant Sum games In a two-person constant sum game, if the other player is doing his best response to your action, he is taking the action that is worst for you, given your action. A) Yes B) No C) Maybe Why so? • If the sum of your payoff and your opponent’s payoff is constant, then the larger is your opponent’s payoff, the smaller is yours. • So if he is choosing the highest possible payoff for himself, given your action, he is taking the action that is worst for you, given your action. In a two-player constant sum game, suppose that you know that the other player has a spy who will find out your strategy before he chooses his. Your best option is to use a maximin strategy. A) Yes B) No C) Maybe Simple hide and seek Player 2 (Seeker) Look Upstairs p upstairs Hide Player 1 (Hider) Hide downstairs Look Downstairs 0,1 1,0 Is this a constant sum game? A) Yes B) No 1,0 0, 1 Mixed strategies and maximin • Suppose you are hider, choosing a mixed strategy, and you believe that seeker will do what is worst for you, given your mixed strategy. • This is not a silly assumption in a two-player zero sum game, because what is worst for you is best for your opponent. • The maximin player will choose her best mixed strategy given that she believes opponent will respond with the strategy that is worst for her. Clicker question • Suppose that you are hider and you choose to randomize, hiding upstairs with probability .6. What strategy by seeker is worst for you? A) Look upstairs with probability .6 B) Look upstairs and downstairs with equal probability C) Look upstairs for sure D) Look upstairs with probability .4 Clicker question • If you are hider and hide upstairs with probability .6 and seeker uses the strategy that is worst for you, what is your expected payoff? (Remember: Your payoff is 0 if he finds you, 1 if he doesn’t.) A) .6 B) .4 C) .5 D) .35 More generally • If you are hider and you hide upstairs with probability p>1/2, what is the strategy for seeker that is worst for you? • What is your expected payoff if he does that? Where p is the probability you hide upstairs: If p>0, he will look upstairs always. He will not find you with probability 1-p. Your expected payoff is p x 0 + (1-p) x 1=1-p What if you hide upstairs with p<1/2? • What is worst thing that seeker can do to you? • (He’ll look downstairs for sure.) • What is your expected payoff? If p<1/2, Seeker stategy that is worst for hider is Look downstairs always. Payoff to hider is p x 1+(1-p) x 0=p Maximin for hide and seek The pessimist’s view Maximin strategy in hide and seek Randomize with p=1/2. There is a Nash equilibrium where both play p=1/2. If each player knew that the other had a spy who knew your strategy, both would choose to randomize with p=1/2. Battle of Sexes Bob Movie A Movie A Alice Movie B 3,2 1,1 0,0 2,3 Is this a constant sum game? A) Yes B) No C) Maybe Movie B Clicker Question In the Battle of the Sexes game, where Alice and Bob play only pure strategies, what is the strategy profile in which each plays a maximin strategy? A) Both go to A B) Both go to B C) Alice goes to A, Bob goes to B D) Alice goes to B, Bob goes to A E) There are two maximin strategy profiles. (Both go to A and both go to B) Maximin strategies Bob Movie A Movie A Alice Movie B Movie B 3,2 1,1 0,0 2,3 What would Bob do if he thinks Alice will always do what is worst for Bob? His worst payoff if he goes to A is 0. His worst payoff if he goes to B is 1. His maximin strategy is therefore go to B. What is Alice’s maximin strategy? If Alice and Bob both use maximin strategies in this game, is the outcome a Nash equilbrium? A) Yes B) No In the Movie Choice game between Alice and Bob, suppose that Alice knows that Bob has a a spy who will find out Alice’s strategy before he chooses his. Alice’s best option is to use her maximin strategy. A) Yes B) No What should Alice do if she knows that Bob will find out her strategy before she moves? A) Go to A B) Go to B C) Use a mixed strategy Working Some Problems Penalty Kick Goalkeeper Jump Left Jump Right Kick Left .3 , .7 .8, .2 Kick right .9, .1 .5, .5 Shooter Is this a constant sum game? A) Yes B) No The story • When Goalie guesses wrong, Kicker either scores or misses the goal. • When Goalie guesses right, Goalie sometimes but not always stops the ball. • Kicker kicks harder and more accurately to the right than to the left. • Both players know this. (Fairly realistic in professional soccer.) Finding N.E. goalie strategy • Suppose Goalie goes Left with probability p • Payoff for Shooter from shooting Left is .3p+.8(1-p)=.8-.5p Payoff for Shooter from shooting Right is .9p +.5(1-p)=.5+.4p Shooter will randomize if .8-.5p=.5+.4p Solve this equation to find p=1/3 Finding N.E. Shooter strategy If Shooter goes Left with probability q: Payoff to Goalie from jumping Left is .7q+.1(1-q)=.1+.6q Payoff to Goalie from jumping Right is .2q+.5(1-q)=.5-.3q Goalie will randomize if .1+.6q=.5-.3q which implies that q=4/9. Does this game have a Nash equilibrium in which Kicker mixes left and right but does not kick to center? • If there is a Nash equilibrium where kicker never kicks middle but mixes between left and right, Goalie will never play middle but will mix left and right (Why?) • If Goalie never plays middle but mixes left and right, Kicker will kick middle. (Why?) • So there can’t be a Nash equilibrium where Kicker never kicks Middle. (See why?) Problem 4: For what values of x is there a mixed strategy Nash equilibrium in which the victim might resist or not resist and the Mugger assigns zero probability to showing a gun? • In a Nash equilibrium where mugger mixes only between no gun and gun hide, mugger must be indifferent between these strategies. • If Victim mixes with probability p of resisting, Expected payoff to mugger of no gun is 2p+6(1-p). Expected payoff to mugger of gun-hide is 3p+5(1-p). These are equal when P=1/2. So in a mixed strategy equilibrium it must be That expected payoff to gunman is 4. This will be a N.E. only if x<4. Problem 7.7, Find mixed strategy Nash equilibria A mixed strategy N.E. strategy does not give positive probability To any strictly dominated strategy c dominates a and y dominates z Look at reduced game without these strategies In this reduced game, b dominates d. We are left with a game where each player has 2 Possible strategies—b and c for Player 1, x and y For Player 2. Find a mixed strategy N.E. for this game. Expected Utility Theory of Choice Under Uncertainty • Suppose that you face random outcomes. You assign a “utility” to each possible outcome in such a way that your choices among uncertain prospects are those that maximize “expected utility”. Expected utility Example: Utility of money • Suppose you have a lottery that will with probability 1/4 win 10 million dollars and with probability ¾ will be worthless. You get just one chance to sell your ticket. Would you sell it for 2.5 million dollars? A) Yes B) No Question • Suppose you have a lottery ticket that will with probability 1/4 win 10 million dollars and with probability ¾ will be worthless. What is the expected value of this lottery ticket? Expected utility Example: Utility of money • Suppose you have a lottery ticket that will with probability 1/4 win 10 million dollars and with probability ¾ will be worthless. You get just one chance to sell your ticket. Would you sell it for 1 million dollars? A) Yes B) No Expected utility Example: Utility of money • Suppose you have a lottery ticket that will with probability 1/4 win 10 million dollars and with probability ¾ will be worthless. You get just one chance to sell your ticket. Would you sell it for 500 thousand dollars? A) Yes B) No Expected utility Example: Utility of money • Suppose you have a lottery ticket that will with probability 1/4 win 10 million dollars and with probability ¾ will be worthless. You get just one chance to sell your ticket. Would you sell it for 100 thousand dollars? A) Yes B) No Expected utility Example: Utility of money • Suppose you have a lottery ticket that will with probability 1/4 win 10 million dollars and with probability ¾ will be worthless. You get just one chance to sell your ticket. Would you sell it for 10 thousand dollars? A) Yes B) No Construct a utility scale • Let u(10 million)=1 Let u(0)=0. • Then ask question. How much money X for sure would be just as good as having a ¼ chance of winning 10 million and ¾ chance of 0? • Then assign u(X)=(3/4)u(0)+(1/4)u(10,000,000)= (3/4)0+(1/4)1=1/4. Assigning utility to any income • Lets choose a scale where u(0)=0 and u(10 million)=1. • Take any number X. Find a probability p(X) so that you would just be willing to pay $X for a lottery ticket that pays 10 million with probability p(X) and 0 with probability 1-p(x). • Assign utility p(X) to having $X. Field Goal or Touchdown? • Field goal is worth 3 points. • Touchdown is worth 7 points. Which is better? Sure field goal or probability ½ of touchdown? Finding the coach’s von Neumann Morgenstern utilities • Set utility of touchdown u(T)=1 • Set utility no score u(0)=0 The utility of a gamble in which you get a touchdown with probability p and no score with probability 1-p is pu(T)+(1-p)u(0). What utility u(F) to assign to a sure field goal? Let p* be the probability such that the coach is indifferent between scoring a touchdown with probability p* (with no score with prob 1-p*) and having a sure field goal. Then u(F)=p*u(T)+(1-p*)u(0)=p*x1+(1-p*)x0=p*. So long, for now... Problem 10. Each of three players is deciding between the pure strategies go and stop. The payoff to go is 120, where m is the number of players that choose go, and the payoff to stop is 55 m (which is received regardless of what the other players do). Find all Nash equilibria in mixed strategies. Let’s find the “easy ones”. Are there any symmetric pure strategy equilibria? How about asymmetric pure strategy equilibria? How about symmetric mixed strategy equilibrium? Solve 40p^2+60*2p(1-p)+120(1-p)2=55 40p2-120p+65=0 What about equilibria where one guy is in for sure and other two enter with identical mixed strategies? For mixed strategy guys who both Enter with probability p, expected payoff from entering is (120/3)p+(120/2)(1-p). They are indifferent about entering or not if 40p+60(1-p)=55. This happens when p=1/4. This will be an equilibrium if when the other two guys enter with Probability ¼, the remaining guy is better off entering than not. Payoff to guy who enters for sure is: 40*(1/16)+60*(3/8)+120*(9/16)=92.5>55. Advanced Rock-Paper-Scissors Rock Paper Scissors Rock 0,0 -1,1 2,-2 Paper 1,-1 0,0 -1,1 Scissors -2,2 1,-1 0,0 Are there pure strategy Nash equilibria? Is there a symmetric mixed strategy Nash equilibrium? What is it? Finding Mixed Strategy Nash Equilibrium Rock Paper Scissors Rock 0,0 -1,1 2,-2 Paper 1,-1 0,0 -1,1 Scissors -2,2 1,-1 0,0 Let probabilities that column chooser chooses rock, paper, and scissors be r, p, and s=1-p-r Row chooser must be indifferent between rock and paper This tells us that -p+2(1-p-r)=r-(1-p-r) Row chooser must also be indifferent between rock and scissors. This tells us that –p+2(1-p-r)=-2r+p We have 2 linear equations in 2 unknowns. Let’s solve. They simplify to 4r+4p=3 and 4p=2. So we have p=1/2 and r=1/4. Then s=1-p-r=1/4.