Topic 4
Mixed Strategies
“I used to think I was indecisive
– but now I’m not so sure.”
- Anonymous
Review
Predicting likely outcome of a game
Sequential games
Look forward and reason back
Simultaneous games
Look for simultaneous best replies
Mike Shor
What if (seemingly) there are no equilibria?
2
Mike Shor
Employee Monitoring
Employees can work hard or shirk
Salary: $100K unless caught shirking
Cost of effort: $50K
Managers can monitor or not
Value of employee output: $200K
Profit if employee doesn’t work: $0
Cost of monitoring: $10K
3
Mike Shor
Employee Monitoring
Employee
Manager
Monitor No Monitor
Work
50 , 90 50 , 100
Shirk
0 , -10 100 , -100
Best replies do not correspond
No equilibrium in pure strategies
What do the players do?
4
Mike Shor
Employee Monitoring
John Nash proved:
Every finite game has a Nash equilibrium
So, if there is no equilibrium in pure strategies, we have to allow for mixing or randomization
5
Mike Shor
Mixed Strategies
Unreasonable predictors of one-time interaction
Reasonable predictors of long-term proportions
6
Mike Shor
Game Winning Goal
7
Mike Shor
K
I
C
K
E
R
Soccer Penalty Kicks
(Six Year Olds Version)
G O A L I E
L R
L
R
-1
1 ,
, 1
-1
1
-1 ,
, -1
1
8
Mike Shor
Soccer Penalty Kicks
There are no mutual best responses
Seemingly, no equilibria
How would you play this game?
What would you do if you know that the goalie jumps left 75% of the time?
9
Mike Shor
Probabilistic Soccer
Allow the goalie to randomize
Suppose that the goalie jumps left p proportion of the time
What is the kicker’s best response?
If p=1, goalie always jumps left
we should kick right
If p=0, goalie always jumps right
we should kick left
10
Mike Shor
Probabilistic Soccer (continued)
The kicker’s expected payoff is:
Kick left: -1 x p + 1 x (1-p) = 1 – 2p
Kick right: 1 x p - 1 x (1-p) = 2p – 1
should kick left if: p < ½
(1 – 2p > 2p – 1) should kick right if: p > ½ is indifferent if: p = ½
What value of p is best for the goalie?
11
Mike Shor
Probabilistic Soccer (continued)
Goalie’s p
L (p = 1)
R (p = 0) p = 0.75
p = 0.25
p = 0.55
p = 0.50
Kicker’s strategy
R
L
R
L
R
Either
Goalie’s
Payoff
-1.0
-1.0
-0.5
-0.5
-0.1
0
12
Mike Shor
Probabilistic Soccer (continued)
Mixed strategies:
If opponent knows what I will do, I will always lose!
Randomizing just right takes away any ability for the opponent to take advantage
If opponent has a preference for a particular action, that would mean that they had chosen the worst course from your perspective.
Make opponent indifferent between her strategies
13
Mike Shor
Mixed Strategies
Strange Implications
A player chooses his strategy so as to make her opponent indifferent
If done right, the other player earns the same payoff from either of her strategies
14
Mike Shor
Mixed Strategies
COMMANDMENT
Use the mixed strategy that keeps your opponents guessing.
15
Mike Shor
Employee Monitoring
Employee
Manager
Monitor No Monitor
Work 50 , 90 50 , 100
Shirk
0 , -10 100 , -100
Suppose:
Employee chooses (shirk, work) with probabilities (p,1-p)
Manager chooses (monitor, no monitor) with probabilities (q,1-q)
16
Mike Shor
Keeping Employees from Shirking
First, find employee’s expected payoff from each pure strategy
If employee works: receives 50
Profit(work) = 50
q + 50
(1-q)
= 50
If employee shirks: receives 0 or 100
Profit(shirk) = 0
q + 100
(1-q)
= 100 – 100q
17
Mike Shor
Employee’s Best Response
Next, calculate the best strategy for possible strategies of the opponent
For q<1/2 : SHIRK
Profit (shirk) = 100-100q > 50 = Profit(work)
For q>1/2 : WORK
Profit (shirk) = 100-100q < 50 = Profit(work)
For q=1/2 : INDIFFERENT
Profit(shirk) = 100-100q = 50 = Profit(work)
18
Mike Shor
Manager’s Equilibrium Strategy
Employees will shirk if q<1/2
To keep employees from shirking, must monitor at least half of the time
Note: Our monitoring strategy was obtained by using employees’ payoffs
19
Mike Shor
Manager’s Best Response
Monitor: 90
(1-p) - 10
p
No monitor: 100
(1-p) -100
p
For p<1/10 : NO MONITOR monitor = 90-100p < 100-200p = no monitor
For p>1/10 : MONITOR monitor = 90-100p > 100-200p = no monitor
For p=1/10 : INDIFFERENT monitor = 90-100p = 100-200p = no monitor
20
Cycles
1 shirk
Mike Shor p
1/10 work
0
0 no monitor
1/2 q
1 monitor
21
Mutual Best Responses
1 shirk
Mike Shor p
1/10 work
0
0 no monitor
1/2 q
1 monitor
22
Mike Shor
Equilibrium Payoffs
1/2
Monitor
1/2
No Monitor
9/10
Work
50 , 90 50 , 100
1/10
Shirk
0 , -10 100 , -100
23
Solving Mixed Strategies
Seeming random is too important to be left to chance!
Determine the probability mix for each player that makes the other player indifferent between her strategies
Mike Shor
Assign a probability to one strategy (e.g., p)
Assign remaining probability to other strategy
Calculate opponent’s expected payoff from each strategy
Set them equal
24
Mike Shor
New Scenario
What if cost of monitoring were 50?
Employee
Manager
Monitor No Monitor
Work
50 , 50 50 , 100
Shirk 0 , -50 100 , -100
25
Mike Shor
New Scenario
To make employee indifferent:
26
Mike Shor
Real Life?
Sports
Football
Tennis
Baseball
Law Enforcement
Traffic tickets
Price Discrimination
Airline stand-by policies
Policy compliance
Random drug testing
27
Mike Shor
IRS Audits
1997
Offshore evasion compliance study
Calibrated random audits
2002
IRS Commissioner Charles Rossotti:
Audits more expensive now than in ’97
Number of audits decreased slightly
Offshore evasion alone increased to $70 billion dollars!
Recommendation:
As audits get more expensive, need to increase budget to keep number of audits constant!
28
Mike Shor
Law Enforcement
Motivate compliance at lower monitoring cost
Audits
Drug Testing
Parking
Should punishment fit the crime?
29
Mike Shor
Football
You have a balanced offense
Offense
Run
Pass
Defense
Run
0 , 0
5 , -5
Pass
5 , -5
0 , 0
Equilibrium:
run half of the time
defend the run half of the time
30
Football
You have a balanced offense
Offense
Run
Pass
Defense
Run
1 , -1
5 , -5
Pass
8 , -8
0 , 0
Mike Shor
The run now works better than before
What is the equilibrium?
31
Mike Shor
Effects of Payoff Changes
Direct Effect:
The player benefitted should take the better action more often
Strategic Effect:
Opponent defends against my better strategy more often, so I should take the action less often
32
Mike Shor
Mixed Strategy Examples
Market entry
Stopping to help
All-pay auctions
33
Mike Shor
Market Entry
N potential entrants into market
Profit from staying out: 10
Profit from entry: 40 – 10 m
m is the number that enter
Symmetric mixed strategy equilibrium:
Earn 10 if stay out. Must earn 10 if enter!
34
Stopping to Help
Mike Shor
N people pass a stranded motorist
Cost of helping is 1
Benefit of helping is B > 1
i.e., if you are the only one who could help, you would, since net benefit is B-1 > 0
Symmetric Equilibria
p is the probability of stopping
Help: B-1
Don’t help: B x chance someone stops
35
Mike Shor
Stopping to Help (continued)
Don’t help:
B x chance someone stops
= B x ( 1 – chance no one stops )
= B x ( 1 – (1-p) N-1 )
Set help = Don’t help
B x ( 1 – (1-p) N-1 ) = B – 1
p = 1 - (1/B) 1/(N-1)
36
Mike Shor
Probability of Stopping
1
0.9
0.8
0.7
0.6
p 0.5
0.4
0.3
0.2
0.1
0
0 2 4 6
N
8 10 12
B=2
37
Mike Shor
Probability of Someone Stopping
38
Mike Shor
All-Pay Auctions
Players decide how much to spend
Expenditures are sunk
Biggest spender wins a prize worth V
How much would you spend?
39
Mike Shor
Pure Strategy Equilibria?
Suppose rival spends s < V
Then you should spend just a drop higher
Then rival will also spend a drop higher
Suppose rival spends s ≥ V
Then you should spend 0
Then rival should spend a drop over 0
No equilibrium in pure strategies
40
Mike Shor
Mixed Strategies
We need a probability of each amount
Use a distribution function F
F(s) is the probability of spending up to s
ε
Imagine I spend s
Profit: V x Pr{win} – s
= V x F(s) – s
41
Mixed Strategies
For an equilibrium, I must be indifferent between all of my strategies
V x F(s) – s = V x F(s’) – s’ for any s, s’
What about s=0?
Probability of winning = 0
So V x 0 – 0 = 0
Mike Shor
V x F(s) – s = 0
F(s) = s/V
42
Mike Shor
Mixed Strategies
F(s) = s/V on [0,V]
This implies that every amount between
0 and V is equally likely
Expected bid = V/2
Expected payment = V
There is no economic surplus to firms competing in this auction
43
All-Pay Auctions
Patent races
Political contests
Wars of attrition
Mike Shor
Lesson: With equally-matched opponents, all economic surplus is competed away
If running the competition: all-pay auctions are very attractive
44
Mike Shor
Mixed Strategies in Tennis
Study:
Ten grand slam tennis finals
Coded serves as left or right
Determined who won each point
Found:
All serves have equal probability of winning
But: serves are not temporally independent
45
Mike Shor
What Random Means
Study:
A fifteen percent chance of being stopped at an alcohol checkpoint will deter drinking and driving
Implementation
Set up checkpoints one day a week
(1 / 7 ≈ 14%)
How about Fridays?
46
Mike Shor
Exploitable Patterns
CAVEAT
Use the mixed strategy that keeps your opponents guessing.
BUT
Your probability of each action must be the same period to period.
47
Exploitable Patterns
Manager’s strategy of monitoring half of the time must mean that there is a 50% chance of being monitored every day !
Mike Shor
Cannot just monitor every other day.
Humans are very bad at this.
Exploit patterns!
48