Uploaded by Filip gt3

Refinements Lecture

advertisement
Week 2: Refinements
September 7, 2022
In this reader we introduce and discuss a few refinements of Nash equilirium for extensive
form games. We consider the notions of subgame perfect equilibrium (SPE), perfect Bayesian
equilbrium (PBE), and sequential equilibrium (SE). In the specific context of signaling games
we also discuss the intuitive criterion (IC).
We will not give formal definitions, but rather try to develop an intuition for these refinements
in specific examples, and explain how to compute them.
1 SPE and PBE
Consider the following two person game in extensive form.
1
I
A
1
8
1
1
B
C
2
[α]
l
0
0
II
L
[1 − α]
R
2
r
l
r
a
8
1
0
0
8
1
0
0
2
b
6
2
c
2
6
d
0
0
The dotted line indicates a non-singleton information set for player 2. Thus, a (pure) strategy
profile for this game is a vector ((X, Y, Z), (u, v, w)), where X ∈ {I, II}, Y ∈ {A, B, C}, Z ∈
{L, R}, u ∈ {l, r} v ∈ {a, b}, and w ∈ {c, d}.
1
Refinements
2
NASH EQUILIBIUM.
The profile ((X, Y, Z), (u, v, w)) is a Nash equilibrium of this game when
(X, Y, Z) is a best response for player 1 against (u, v, w), and conversely (u, v, w) is a best
response for player 2 against (X, Y, Z).
Claim 1.
Exercise 1.
The profile ((I, A, L), (l, a, d)) is a Nash equilibrium.
Prove claim 1.
SUBGAME PERFECTION.
A subgame of an extensive form game is a part of the game tree
that starts in a singleton information set, and does not have an information set that contains
decision nodes both within and outside the subgame. For example, the above game tree has
5 subgames, namely the game itself (starting at the root), the subgame starting at the node
following action I, the subgame starting at the node following action II, the subgame starting
at the node following action L, and the subgame starting at the node following action R.
Note that for example the subtree starting at the node following action B is not a subgame,
because that node is in the same information set as the node following action C, which is a
node outside the subtree under discussion.
The profile ((X, Y, Z), (u, v, w)) is a subgame perfect equilibrium (an SPE) of the above game
when, for each subgame, the restriction of the profile to that subgame is a Nash equilibrium
of that subgame.
Exercise 2.
Claim 2.
Exercise 3.
1
Show that the Nash equilibrium ((I, A, L), (l, a, d)) is not subgame perfect.
The profile ((II, A, L), (l, b, c)) is an SPE.
Prove claim 2.
PERFECT BAYESIAN EQUILIBRIUM.
Given a general extensive form game, consider a pair
(σ, β), where σ is a profile of mixed behavioral strategies in an extensive form game, and β is
a profile of probability distributions over information sets. Such a pair (σ, β) is often called an
assessment, and the β component of an assessment is usually called a belief.
An assessment (σ, β) is a perfect Bayesian equilibrium (a PBE) if
[1] Given the profile σ, the belief β is consistent with Bayesian updating, and
[2] given the belief β, the profile σ maximizes expected payoffs at each information set.
A few remarks are in order.
1 Just to clarify, in this example, the strategy profile (Z, (v, w)) is the restriction of the profile
((X, Y, Z), (u, v, w)) to the subgame that starts after action II.
Refinements
3
[a] The first requirement is called consistency (CONS), while the second requirement is called
sequential rationality (SR).
[b] Note that requirement [1] (Bayesian updating) can only be applied to those information
sets that are reached with positive probability (in some subgame). In those information
sets in which [1] cannot be applied, the choice for β is free, as long as it is in line with
requirement [2]. We say that, in those cases, CONS is automatically satisfied.
[c] Formally, a PBE is an assessment (σ, β), consisting of a strategy profile and a belief.
However, often only the strategy component σ of the assessment is called a PBE as well
in that case, and the supporting belief is omitted.
We argue that the SPE ((II, A, L), (l, b, c)) in the above extensive form game is not a PBE.
Note that the non-singleton information set for player 2 is reached with probability zero, even
in the subgame following action I (since player 1 chooses A at the root of that subgame). So,
CONS is automatically satisfied.
So, take any belief [α, 1−α] at that information set. We check SR. The best response of player 2
given this belief is to play r, with an expected payoff of 1 (versus an expected payoff of 0 for the
action l). Thus, the choice of l in the strategy (l, b, c) of player 2 does not maximize expected
payoff given the belief [α, 1 − α], and SR does not hold. Hence, the SPE ((II, A, L), (l, b, c)) is
not a PBE.
Claim 3.
Exercise 4.
For any λ ∈ [0, 1], the profile ((I, λB + (1 − λC, L), (r, b, c)) is a PBE.
Prove claim 3.
Refinements
Exercise 5.
4
(SPE) Consider the following game.
1
a
b
2
c
5
4
d
1
e
f
g
e
C
1
2
5
4
f
g
C
1
2
7
2
1
9
7
2
1
2
5
4
1
2
3
6
8
3
3
6
(a) Find the unique SPE of this game.
(b) Now revise the game by eliminating player 1’s move “e” at his second information set. Find
the unique SPE of the revised game.
Refinements
5
2 PBE and SE
Consider the following two-person game in extensive form. The C indicates a chance move.
C
2
3
2
2
1−p
1
3
I
I
p
1−q
2
8
q
II
[α]
[1 − α]
z
1−z
z
1−z
3
1
0
2
3
2
0
1
We first compute all perfect Bayesian equilibria in behavioral strategies
((p, 1 − p), (q, 1 − q), (z, 1 − z))
for this game. We also compute the corresponding beliefs [α, 1 − α] for player II for each PBE.
Notice that for player 2, with belief [α, 1 − α],
l⪰r
A.
⇔
2−α≥1+α
⇔
α ≤ 12 .
If α < 21 . Then z = 1 by SR. Then, the unique best response of player 1 is p = q = 1.
Then however by CONS,
·p
2
1
= > .
1
3
2
·p+ 3 ·q
So, in this case we do not find any PBE.
α=
B.
2
3
2
3
If α > 21 . Then z = 0 by SR. Then p = q = 0. So, the info set of player 2 is reached with
zero probability, so CONS is automatic.
C.
If α = 12 . If p = q = 0. Then the info set is not reached, and CONS is automatic. Also,
any z ∈ [0, 1] satisfies SR. For p = q = 0 to be a best response, we need that 2 ≥ 3z, which
yields z ≤ 23 . All these profiles are PBE.
If however the info set is reached with strictly positive probability. Then by CONS,
1
2
=α=
2
3 ·p
2
1
3 ·p+ 3 ·q
,
Refinements
6
which can be rewritten to 2p = q. So, 0 < p < 1. Then player 1 mixes between playing In and
Out in the left decision node. Then necessarily 2 = 3z, so that z = 32 .
In total we find 3 line segments of PBE.
p
0
0
λ
q
0
0
2λ
z
0
z
2
3
Range parameters
1
2 ≤α ≤1
0 ≤ z ≤ 23
0 ≤ λ ≤ 12
α
α
1
2
1
2
Next, we compute a refinement of PBE, namely sequential equilibrium (SE) for this game. An
assessment (σ, β) is a sequential equilibrium (an SE) if
[1] there exists a completely mixed sequence (σ k )k∈N of strategy profiles with σ k → σ as
k → ∞, and β k → β as k → ∞ for the beliefs β k induced by σ k via Bayesian updating.
[2] given the belief β, the profile σ maximizes expected payoffs at each information set.
Again, a few remarks are in order.
[1] Since each σ k is assumed to be a completely mixed strategy profile, the belief β k is
uniquely defined since all information sets are reached with strictly positive probability.
[2] Note that [1] in fact is a strong version of CONS (confusingly usually also called CONS).
Thus, SE is indeed a refinement of PBE.
For
1
2
≤ α ≤ 1, we check that the PBE (p, q, z, α) = (0, 0, 0, α) is an SE. First we consider the
case where α < 1. Take a completely mixed sequence (pk , q k , z k ) that converges to (p, q, z).
We try to achieve that αk = α for all k. Then, by CONS, necessarily
α=
k
2
3 ·p
2
1 k
k
3 ·p + 3 ·q
,
so that α · q k = (2 − 2α) · pk . Choose
(pk , q k , z k ) =
(α
k,
2−2α 1
k , k
)
.
The indeed this sequence is completely mixed. Moreover,
2
3
·
2
3
pk
· pk
2pk
α
=
=
= α,
1
k
k
k
2p
+
q
2α
+
2 − 2α
+ 3 ·q
so that
(pk , q k , z k , αk ) → (p, q, z, α)
Refinements
7
as k → ∞. So, indeed, the strong form of CONS is satisfied. Finally, SR is immediate, because
we started with a PBE.
For the case α = 1, we need to be a bit more careful, to guarantee that the sequence is
completely mixed. Take for example αk =
k
k+1 .
we get that
(pk , q k , z k ) =
(
Then, using the same computations as above,
1
k
2
k(k+1) , k , k+1
)
.
We can now check that all requirements are indeed satisfied. In the same way, we can check
that all PBE for this game are in fact SE.
The following example shows that a perfect Bayesian equilibrium does not have to be sequential.
2
a
0
I
0
b
c
0
2
0
[1 − α]
d
II
[α]
d
II
e
0
3
0
e
[β]
f
0
2
0
[1 − β]
III
III
g
g
0
0
1
f
1
0
1
0
0
0
The strategy profile (a, e, f ) with beliefs α = 1 and β = 0 is a PBE. However, this assessment
is not an SE, because in every SE we need to have that α = β.
Exercise 6.
(PBE and SE) Consider the following two-person game in extensive form. The
C indicates a chance move.
Refinements
8
C
2
3
2
1
1−p
1
3
I
I
p
1−q
2
1
q
II
[α]
[1 − α]
z
1−z
z
1−z
5
3
0
2
4
2
0
3
(a) Compute all perfect Bayesian equilibria in behavioral strategies
((p, 1 − p), (q, 1 − q), (z, 1 − z))
for this game. Don’t forget to report the corresponding beliefs [α, 1 − α] for player II for
each PBE.
(b) Compute all sequential equilibria of this game. Don’t forget to report the completely mixed
strategy profiles for each equilibrium.
Refinements
9
Exercise 7.
(SE) Consider the following two-person game in extensive form. The C indicates
a chance move.
C
2
3
5
6
1−p
1
3
I
I
p
1−q
2
6
q
II
[α]
[1 − α]
z
1−z
z
1−z
9
5
0
3
9
0
0
3
(a) Compute all sequential equilibria in behavioral strategies
((p, 1 − p), (q, 1 − q), (z, 1 − z))
for this game.
(b) Compute all sequential equilibria of the game that results when the payoffs (2, 6) when
player I opts out in the right hand branch are replaced by (7, 6).
Refinements
10
3 PBE in signaling games
Consider the following signaling game we analysed last week.
u−1, 0
0, 1 u
r
1−r
0, 0 u
u
[α]
u
L
t
u
p
d
r
1−r
u
0, 1
1−p
uChance
u
d
u
[1 − α]
q
L
u
t′
s
1−s
u 1, 1
II
u 1, 1
.5
u
d
[β]
.5
II
2, 0 u
R
u
u
1−q
u
R [1 − β]
d
s
1−s
u 1, 0
We found that the pure strategy profiles (p, q, r, s) = (1, 1, 1, 1) and (p, q, r, s) = (0, 0, 0, 0) are
Nash equilibria in behavioral strategies. We check whether these profiles are PBE.
A.
For (p, q, r, s) = (1, 1, 1, 1). Consider the left information set. Since player 1 moves to the
left in both the upper and the lower node, each decision node for player 2 in the left info set is
reached with probability 12 . So, by CONS, α = 12 . Then, since the left info set is reached with
strictly positive probability (and hence the belief in the left information set is identical to the
probabilities generated by the strategy of player 1), SR is automatically satisfied.
Now consider the right info set. Since this info set is reached with zero probability, Bayesian
updating does not apply, and CONS is automatically satisfied. We check SR. Given belief
[β, 1 − β], the expected payoff of u is 1 − β, and the expected payoff of d is β. Thus,
u⪰d
⇔
1−β ≥β
⇔
β ≤ 12 .
It follows that any
(p, q, r, s, α, β) = (1, 1, 1, 1, 12 , β) with 0 ≤ β ≤
1
2
is a PBE.
B.
For (p, q, r, s) = (0, 0, 0, 0). A similar computation yields that any
(p, q, r, s, α, β) = (0, 0, 0, 0, α, 12 ) with 0 ≤ α ≤
is a PBE.
1
2
Refinements
Exercise 8.
Exercise 9.
11
Verify the claim in part B above.
(PBE) For the signaling games in Exercises 1 and 2 of the previous week,
compute all pure strategy perfect Bayesian equilibria of those games.
Download