Dixit and Skeath Chapter 4 (2004)

advertisement
Economics Exam notes
Chapter 4
Games are said to have simultaneous moves if players must move without the
knowledge of what their rivals have chosen to do. A game is also simultaneous when
players choose their actions in isolation, even if the choices are made at different
hours no player will have information about what the other have done or will do. Thus
simultaneous-move games have imperfect information.
Strategy is a complete plan of action. In a simultaneous game, each player has
at most one opportunity to act, whereas in a sequential they have multiple
opportunities. Therefore there is no distinction between a strategy and action in
simultaneous-move games.
Mixed strategies: A strategy can be a probabilistic choice from the basic
actions initially specified (for ex: in sports where players deliberately randomize their
choice of action to keep opponent guessing). Pure strategies are the basic initially
specified actions.
Simultaneous-move games with discrete strategies are most often depicted
with the use of a game table (aka game matrix or payoff table). The table is called
the normal form or the strategic form of the game.
Zero-sum or constant-sum game: are games where the interests of the two
sides are exactly the opposite of each other, therefore for each combination of the
player’s choices, the payoffs of one can be obtained by reversing the sign of the
payoffs to the other.
Each player wants to pick an action that yields her the highest payoff. Nash
equilibrium is a list of strategies, one for each player, such that no player can get a
better payoff by switching to some other strategy that is available to her while all the
other players stick to the strategies specified for them in the list.
The definition of Nash equilibrium does not require equilibrium choices to be
strictly better than other available choices. Figure 4.3: given column’s choice of
middle, row could not do any better than she does when choosing low so low, middle
qualifies for a Nash equilibrium.
More important, it does not have to be jointly for the players. In figure 4.1
(bottom, right) gives payoffs 9,7 which are better for both players then 5,4 of the nash
equilibrium. However playing independently, they cannot sustain (bottom, right).
Given that column plays right, row would want to deviate from bottom to lo and get
12 instead of 9. Getting the jointly better payoff of 9,7 would require cooperative
action that made such cheating possible.
Nash equilibrium can be found using cell-by-cell inspection, or enumeration.
People play simultaneous-move games and do make choices and do replaces the
actual knowledge or observation of others’ actions players can make blind guess. A
more systematic way to figure what others are doing is experience and observation for
instance- if players play this game or similar ones with similar players all the time
they may develop a pretty good idea of what others do. Another method is the logical
process of thinking through others’ thinking- as in out yourself in their position.
Even when all the rules of a game are known without any uncertainty external
to the game each player may be uncertain about what actions the others are taking at
the same time. Players choose in the face of this strategic uncertainty by forming
some subjective views or estimates about the others actions- that is what the notion of
belief captures. If a player does not know the actual choices of the others but has
beliefs about them, in Nash equilibrium those beliefs would have to be correct. Thus
Nash equilibrium is a set of strategies one for each player such that, 1) each player has
correct beliefs about the strategies of the others, 2) the strategy of each is the best for
herself given her beliefs about the strategies of the others.
The way of thinking about Nash equilibrium has two advantages, 1) concept
of best response is no longer logically flawed- each player is choosing her best
response, not to the as yet unobserved actions of the others but only to her own
already formed beliefs about their actions. 2) Where we allow mixed strategies the
randomness in one player’s strategy may be better interpreted as uncertainty in the
others players’ beliefs about this players’ action.
Some games have a special property that one strategy is uniformly better than or
worse than another. The well-known game of the prisoners’ dilemma illustrates this.
Each player has to choose between confessing and not confessing- they both know
that no confession leaves them each with a 3-year jail sentence for involvement of the
kidnapping. Also if one of them confesses, he or she will get a short sentence of 1
year for cooperating with the police, while the other will go to jail for a minimum of
25 years. If both confess, they figure that they can negotiate for jail terms of 10 years
each.
 Suppose he believes she will confess: then his best choice will be to confess
(payoff of 10 instead of 25)
 Suppose he believes she will deny: then his best choice is also to confess (payoff
of only 1 instead of 3)
Thus in this game confess is a dominant strategy or the strategy deny is a
dominated strategy because regardless of what the husband thinks the wife will do
the strategy confess will be better for him.
Prisoner’s dilemma game has three essential features:
1. Each player has two strategies- to cooperate or defect
2. Each player has a dominant strategy
3. The dominance solution equilibrium is worse for both players than the nonequilibrium situation in which each plays the dominated strategy.
Both players follow conventional wisdom in choosing their dominant strategy but the
resulting equilibrium outcome yields them payoffs that are lower than they could have
achieved if they had each chosen their dominated strategy- the problem is how could
they guarantee that someone will not cheat.
In some games only one player has a dominant strategy. In this case the other
rational player can assume that she will use this dominant strategy and so choose her
equilibrium action accordingly.
In larger games, some of a player’s strategies may be dominated even though
no single strategy dominates all of the other. In this type of game they may be able to
reach equilibrium by removing dominated strategies from consideration as possible
choices. This reduces the size of the game and the “new” game may have another
dominated strategy, which can be reduced until no further ones are available. Or the
new game may have a dominant strategy for one of the players. This is known as
successive or iterated elimination. If this process ends in unique outcome, then the
game is said to be dominance solvable.
Column
Left
Middle
Right
Top
3,1
2,3
10,2
Row
High
4,5
3,0
6,4
Low
2,2
5,4
12,3
Bottom
5,6
4,5
9,7
The only dominated strategy for row is high which is dominated by bottom; column’s
left is dominated by right. We must note that we could not have eliminated column’s
left unless we eliminated row’s high because against row’s high, column would get 5
from left but only 5 from right. Then within the remaining set of strategies (top low
and bottom for row, idle and right for column), row’s top and bottom are both
dominated by low. When row is left with only low, column chooses his best responsemiddle. The game is hence dominance solvable and the outcome is low middle- being
the Nash equilibrium.
Weakly dominated and strictly dominated strategy:
Top
3,1
2,3
10,2
Low
2,2
5,4
12,3
Bottom
5,6
5,5
9,7
Low dominated top, but the dominance of low over bottom is less clear. The 2
strategies give row equal payoffs against column’s middle, although low does row a
higher payoff than bottom when plays against column’s right. So from row’s
perspective low weakly dominates bottom and low strictly dominates top.
Column
Row
Left
0,0
1,1
Up
Down
Right
1,1
1,1
For row up is weakly dominated by down; if column plays left then row gets the same
payoff from her 2 strategies. For column right weakly dominates left, dominance
solvability then tells us that (down, right) is a Nash Equilibrium. That is true but down
left and up right are also Nash equilibrium. When row is playing down column cannot
improve by switching to right and when column is playing left rows best response is
down. Same thing applies up right. Therefore if you use weak dominance to eliminate
some strategies it is a good idea to make cell by cell check to make sure you haven’t
missed any other equilibria.
Many simultaneous-move games have no dominant or dominated strategies.
Others may have one or several dominated strategies but iterated elimination of
dominated strategies will not yield a unique outcome. In such cases best response
analysis is the tool- so we find each player’s best response strategy depending on the
other player’s available strategies. Best-response analysis is a comprehensive way of
locating all possible Nash equilibria of a game. In the below figure: if column chooses
left row’s best response would be bottom, for middle best respone is low… all
highlighted in green. For column, the best responses are circled in yellow; if row
chooses top- column will choose middle… etc. there will be some games for which
best-response analysis does not find a Nash equilibrium just as dominance solvability
sometimes fails. But when best-response of a discrete strategy game does not find a
Nash equilibrium then the game has no equilibrium in pure strategies.
Row
Top
High
Low
Bottom
Left
3,1
4,5
2,2
5,6
Column
Middle
2,3
3,0
5,4
4,5
Right
10,2
6,4
12,3
9,7
For zero-sum games where strict conflict exists, the minimax method is used
and only works for this game; it relies on a thought process that accounts for the fact
that outcomes that are good for one player are by definition bad for the other. In this
method each player asks herself “ is this the best choice for me, even if the other
player found out that I was playing it?” Then she must consider her opponents best
response to her chosen strategy. But in zero sum games, that best response is the
worst one for her; i.e. in 0-sum games each player believes that her opponent will
choose an action that yields her the worst possible consequences of each of her own
actions. Then acting on those beliefs she should choose the action that leads to the
least bad outcome.
Suppose row wants the outcome to be a cell with as high number as possible, column
would therefore want the opposite. Row figures that for each of her rows column will
choose the column with the lowest number in that row. Therefore row should choose
the row that gives her the highest amongst these lowest numbers.- the maximum
among the minimum—the maximin. Similarly column knows that for each of her
column, row will choose the row with the highest number in that column so column
should choose the column with the smallest number among the largest ones- the
minimum among the maxima—the minimax.
Defense
Offence
Run
Short pass
Medium pass
Long pass
Run
Pass
Blitz
2
6
6
10
5
5.6
4.5
3
13
10.5
1
-2
Begin by finding the lowest number in each row- the offence’s worst payoff from
each strategy- and the highest number in each column- the defense’s worst payoff
from each strategy. The offense’s worst payoffs are highlighted in yellow, and
defense’s are in green. The next step is to find the best of each player’s worst possible
outcomes. The largest of the row minima is 5.6, so that is the offence’s maximin. The
smallest of the column maxima is 5.6 so that’s the defense’s minimax. Thus the
offence’s maximin strategy is its best response to the defense’s minimax and vice
versa. In cases where the minimax method fails to find equilibrium in zero-sum
games we conclude that the game has no Nash equilibrium in pure strategies.
Three player games: suppose Emily is contemplating the possible outcomes
of the street garden game, there will be six possibilities- she can choose to either
contribute or not when both Nina and Talia contribute, or when neither contribute or
when just one of them do. For her, it’s best to take advantage of her neighbors that
contribute while she doesn’t (payoff of 6). She enjoys a bigger garden if she also
contributes but at the cost of her own contribution (payoff of 5). At the other end of
the spectrum are the outcomes that arise when neither Nina nor Talia contribute; here
Emily would again prefer not to contribute (payoff of 2 instead of 1 if she contributes
while they don’t). In between these cases are the situations in which either Nina or
Talia contributes but not both. In this case Emily prefers not to contribute (4) rather
then to contribute (3) and gain a larger garden. Because Nina and Talia have the same
views on the costs and benefits of contribution and garden size, each of them order
the different outcomes in the same way- the worst being the one in which each
contributes while the other 2 don’t. To find the Nash equilibrium we need a
game table:
Talia chooses
Contribute
Don’t contribute
Nina
Emily
Contribute
Don’t
Contribute
Don’t
5,5,5
6,3,3
3,6,3
4,4,1
Nina
Emily
Contribute
Don’t
Contribute
Don’t
3, 3, 6
4,1,4
1,4,4
2,2,2,
The first test should be to determine whether there are dominant strategies for any of
the players. For Emily we compare the two rows of both pages of the table and when
Talia contributes Emily has a dominant strategy not to, and when Talia doesn’t
Emily’s dominant strategy is also no to. Thus it is best for Emily not to contribute
regardless of what the other two players choose to do. The same applies to Nina. For
Talia, we must compare cells across pages of the table- the top left in first page with
that in second and so on. This indicates that Talia also has a dominant strategy not to
contribute. This game is another example of a Prisoner’s dilemma- there is a unique
Nash equilibrium where they all receive payoff of 2 but another outcome exists in the
game where all three yield higher payoffs of 5. Even though it is better for them to
pitch in and build the garden no one has the individual incentive to do so.
The NASH equilibrium of the game can also be found using cell-by-cell inspection.
For example consider cell (3,3,6) when Emily considers changing her strategy she can
only move to the next row but within the same column and page; Nina can change her
strategy to another column but in the same row and page; finally Talia can change
only the page position. While Emily and Nina improve their strategy from 3 to 4,
Talia become worse off from 6 to 5, but since at least one can become better off by
changing their strategy the cell that we examined is not Nash equilibrium.
We can also use best response strategy. Emily’s best response is highlighted yellowNina green- and Talia pink. The cell at bottom right has all three best responses and
therefore it gives us the Nash equilibrium.
Some games have no pure strategy Nash equilibrium and an example is that of
a single point in a tennis match. Given that down the line passing is stronger than a
cross-court shot and Evert is more likely to win the point when Navratilova moves to
cover the wrong side of the court a reasonable set of payoffs can be worked out. Evert
is successful with DL 80% of the time Navratilova covers CC, and 50% of the time if
N covers DL. Similarly, E is successful with her CC 90% if N covers DL and 20% of
the time when N covers CC. clearly this is a zero sum game because the fraction of
time that N wins this tennis point is just the difference between 100% and the fraction
of time that E wins.
Evert
DL
CC
Navratilova
DL
50
90
CC
80
20
From the above table, the rules for solving simultaneous-move games tells us to
look first for dominant or dominated strategies and then try to minimax or use
cell by cell inspection to find the Nash equilibrium. No dominant strategy exist
here- going on cell by cell we start with the choice of DL for both players. From
that outcome, E can improve her success from 50 to 90 by choosing CC. but then
N can hold E down to 20 by choosing CC. after this E can raise her success again
to 80 by making her shot DL and N in turn can do better with DL. In every cell
one player always wants to change her play and we cycle through the table
endlessly. What is important games with no Nash, is not what players should do
but what they should not do. Thus in this case players should not act
systematically, because their opponent can figure out the pattern of their actions
and decrease the chances of their success. In this case E for instance should
randomize her actions and this is known as mixing strategies.
Download