Document 13878524

advertisement
DEFINABLE AND CONTRACTIBLE CONTRACTS
MICHAEL PETERS AND BALÁZS SZENTES
Abstrat.
This paper analyzes Bayesian normal form games in whih players write ontrats
that ondition their ations on the ontrats of the other players. These ontrats are required
to be representable in a formal language. This is aomplished by onstruting ontrats whih
are denable funtions of the Godel ode of every other player's ontrat. We provide a omplete
haraterization of the set of alloations supportable as pure strategy Bayesian equilibrium of this
ontrating game. When information is omplete, this haraterization provides a folk theorem.
In general, the set of supportable alloations is smaller than the set supportable by a entralized
mehanism designer.
1.
Self Referential Strategies and Reiproity in Stati Games
In this paper we haraterize the alloation rules attainable by players in a Bayesian game when
they have the ability to ommit themselves by writing ontrats that ondition their ommitments
on other players' ontrats.
The idea that ontrats might ondition on other ontrats is not new in eonomis. The best
known expression of this idea is well known in the industrial organization literature (e.g. [11℄) as
the 'meet the ompetition' lause in whih one rm ommits itself to lower its prie when any of its
ompetitors does. A similar idea appears in trade theory as the priniple of
reiproity
([2℄). This
takes the form of trade agreements like GATT that require ountries to math tari uts by other
ountries. Finally, tax treaties sometimes have this avor - for example, out of state residents who
work in Pennsylvania are exempt from Pennsylvania tax as long as they live in a state that has a
'reiproal' agreement that exempts out of state residents (presumably from Pennsylvania) from
1
state taxes.
All of these approahes are used to support ooperative outomes in stati games. We extend this
approah to games with inomplete information. We provide a full haraterization of alloations
supportable as ontrat equilibrium. In partiular, we show the limits of the 'ontrats on ontrats'
approah by providing alloations supportable by a mehanism designer whih annot be supported
as equilibrium with ontratible ontrats. We also use our Theorem to provide something that
looks like a folk theorem for a restrited environment.
The diulty with extending the older literature is that the oneptual and tehnial tools
developed there an only be used in any but the simplest problems. The meet the ompetition
Version - January 22, 2009.
1http://www.revenue.state.pa.us/revenue/wp/view.asp?A=238&Q=244681
1
2
MICHAEL PETERS AND BALÁZS SZENTES
argument, for example, is extremely stylized. The Stakleberg leader, all it rm A, oers to sell at
a very high prie provided its ompetitor, rm B, also oers that high prie in the seond round. If
B
in the seond round oers any prie below the highest prie,
ost. If
B
A ommits itself
to sell at marginal
believes this ommitment, then one best reply is to set the highest prie.
If the rms move simultaneously, then the logi of the argument beomes louded.
ertainly write a ontrat that ommits it to a high prie if
suppose that
to
A's
B
A
ould
sets the same high prie. However
B 's strategy is simply to set this high prie and that for some reason this is a best reply
ontrat. Then
outome, rm
B
A
should deviate and simply underut rm
would have to oer a ontrat similar to
naive argument would suggest that
B
A's
B.
To support the high prie
in order to prevent
should simply oer the same ontrat as
A,
A's
deviation. A
a high prie if
A
sets a high prie, and marginal ost otherwise. Casually, two outomes seem onsistent with these
ontrats - both rms prie at marginal ost or both rms set the high prie. This seems to violate
a fairly fundamental property of game theory whih is that for eah pair of ations (ontrats
in this ase), there is a unique payo to every player.
atually say what
A
would do if
B
2
More to the point,
A's
ontrat doesn't
oers a ontrat that promises to set a high prie unless
A
sets
a lower prie, et. The speiation of the problem itself seems to be ambiguous about payos.
Generally ontrats that reat to
ations
of other players simply don't make sense. They may
not lead to unambiguous outomes as in the example above. More generally, it is possible that
suh ontrats are simply ontraditory. For example, two rms might write ontrat that ommit
both of them to set a prie that is stritly lower than the other rms prie (or two eonomists
demand ontrats that guarantee that they will both earn more money than anyone else in the
department). To resolve ambiguities and ontraditions in suh ontrat, an outside mediator is
needed to hoose an outome.
This defeats the purpose of using ontrats to deentralize the
underlying alloation problem.
The reiproal tax agreement problem is better behaved, and provides the basis for the argument
we extend below. State
residents of state
A
A wants to exempt residents of state B from state taxes provided B exempts
from taxes. To write the law
'reiproal' agreement with state
A.
A
exempts residents from any state that has a
The question is what exatly is a 'reiproal' agreement. It
is lear enough what the intention is - reate a situation in whih both states take the mutually
beneial ation of exempting one another in a way that eliminates any inentive for either of them
to deviate. As mentioned above, it isn't enough to assume that state
residents of state
State
2
B
A
from tax beause
A
B
unonditionally exempts
would not longer have any inentive to exempt state
has to have a law like the law in state
A,
B.
in other words, a reiproal agreement.
One paper that allows multiple payos to be assoiated with eah array of ations is [12℄ who use this approah
to support equilibrium when it might not otherwise exist.
DEFINABLE AND CONTRACTIBLE CONTRACTS
3
It seems that to resolve this kind of problem one needs to dene the term 'reiproal ontrat'
as follows:
reiproal ontrat
≡

exempt
don't
if the other state oers a reiproal ontrat,
otherwise
This kind of denition is familiar from the Bellman equation in dynami programming where the
value funtion is dened in a self referential way. It is tempting to model this in the following naive
way: start by dening a olletion of ontrats that seem eonomially sensible. For example, it is
reasonable that a state ould write a ontrat that simply xes any tax rate independent of what
the other states do. Let
C
be the set of ontrats that simply x some unonditional tax rate.
Append to this set of feasible ontrats the reiproal ontrat, all it
the set of feasible ontrats as
C ∪ {r}.
r,
dened above. Now model
The reiproal ontrat above is just
r,
while 'otherwise'
means any ontrat with a xed tax rate. Dene a normal form game in whih the strategies are
C ∪ {r}
and delare the outome if both states oer
r
to be
(exempt, exempt).
Then there is an
equilibrium in whih the states mutually exempt (assuming they jointly want to).
We would argue that this is unsatisfatory for a number of reasons. First, it is undesirable to
restrit the set of feasible ontrats in order to support the outome you are looking for.
approah desribed above amounts to little more than saying that
then laiming it is an equilibrium for both states to oer
r.
r
The
is the only feasible ontrat,
A more satisfatory approah is
to dene a set of ations that seem eonomially meaningful, then to allow the broadest set of
ontrats possible. In the same manner that the value funtion emerges endogenously from the
eonomi environment, the reiproal ontrat should be derived from eonomi fundamentals.
Seond, the approah desribed above misses the essene of reiproity whih is the innite
regress involved in self referential objets. A ontrat that makes formal sense is the following:
C
=

exempt
don't
if other State exempts any State who exempts any State who exempts. . .
otherwise
where the statement in the top line is repeated ad innitum.
Arguably, the ontrat
C
is a
reiproal ontrat sine it would exempt any State oering a reiproal ontrat. Yet it simply
isn't feasible under the naive desription given above.
Finally, the ad ho approah desribed above simply isn't exible enough to handle omplex
environments and inomplete information. For example, making the game asymmetri requires ad
ho extension of the approah above. If State
A
is supposed to exempt, while state
B
is supposed
to take some other ation, say 'partly exempt', then to support the right outome, the ontrats
should look something like the following:
reiproal ontratA
≡

exempt
don't
if other State oers reiproal ontratB
exempt otherwise
4
MICHAEL PETERS AND BALÁZS SZENTES
and
reiproal ontratB
≡

partially
don't
exempt if other State oers reiproal ontratA
exempt otherwise
Now the ontrats are not diretly self referential, as is the Bellman equation, instead they are ross
referential. A single self referential or reiproal ontrat simply doesn't go far enough. Furthermore, the ontrats above dene only a single ooperative ation, and use a blanket punishment
for deviations. Desirable or interesting equilibrium alloations may not look like this. For example,
in a general Bayesian game, the most desirable ooperative ation for both players might depend
on information that only one of them has. So the ation that State A wants to take might depend
on the ontrat that B oers. Alternatively, the most eetive punishment for A to impose on B
might depend on ations that other states are taking.
As the number of possibilities inreases,
so does the number of speial words we need to add to our ontrating language to support the
outomes we want.
Our approah avoids these problems. We x a language and require ontrats to be written in
this language. We then show this language already ontains all the speial terms like 'reiproal
ontrat' that we need, even in very rih eonomi environments where simple notions like 'ooperation' do not adequately desribe the alloations we are interested in. The ontrating language
that we desribe is universal in this sense.
It is universal in a seond way.
Allowing ontrats to speify ations that depend on other
ontrats means that ations might depend on whether other players' ontrats depend on the
way you make your ation depend on their ontrats, the way you make your ation depend
on how their ontrats depend on the way you make your ontrat depend on their ontrats,
and so on.
In simple prisoner's dilemma problems like the tax problem disussed above, this
problem is relatively straightforward sine 'dependene' simply means whether or not the other
player ooperates. However, in riher environments, 'dependene' is more subtle sine there are
many dierent ways that players an ondition their ations at eah round in the hierarhy of
dependenies desribed above. The method we desribe below provides a ompat way of dealing
with this.
Finally, the Bellman equation style representation of a reiproal ontrat illustrates that the
notion of reiproity depends on the ontrating environment beause the word 'ooperate' appears
in the denition of a reiproal ontrat. It isn't obvious how to extend the argument to problems
where a single 'ooperative' ation doesn't exist. The set of ontrats that we use, on the other
hand, is independent of the underlying game that is being played. Contrats need to map into
feasible ations, but the way that these ations depend on other ontrats doesn't depend on what
these ations are. Nor does it depend on whether or not players have private information. In this
sense, our ontrats are
universal
in the sense that they an be applied to any strategi situation.
DEFINABLE AND CONTRACTIBLE CONTRACTS
1.0.1.
How Denability Works.
5
Return again to the main purpose of denability. Instead of re-
ating speial terms like reiproal ontrat in an ad ho way to support ooperative outomes in
speial situations, we want to provide a ontrating environment in whih we an show that the
speial terms we need to write the ontrats that players need to enfore their ollusive agreement
will always exist within the language. We do it here to illustrate the method for the very simple
ase, then generalize the approah in the setions below.
Suppose there are
number of ations.
m
players in a normal form game in whih eah player has a ountable
Endow players with a formal language ontaining a ountable number of
words or haraters that they an use to write ontrats. Feasible ontrats are nite sequenes of
haraters in this formal language. As we mentioned above, the set nite subsets of a ountable
set is ountable, so there are bijetions mapping eah nite text into
alled the
Godel Coding
N.
. Provided the language inludes all the natural numbers and the usual
arithmeti operations, it is possible for players to write ontrats that are
NN −1
One suh a mapping is
denable
funtions from
into that player's ation spae. Sine denable funtions an be written as nite sequenes of
haraters in the language, they have Godel odes assoiated with them. Hene we ould interpret
the denable funtions as ontrats that make the players ation depend on the Godel ode of the
other player's ontrat.
To make the argument easier to relate to onventional ontrat theory, we assume below that
the ontrat spae for eah player is the set of denable funtions from
of the player's ation spaes
.
and onversely if the integer
NN −1
into the subsets
Every denable funtion an be assoiated with a unique integer,
n
is assoiated with a denable funtion, then it is assoiated with
a unique text. Now for eah array of funtions hosen by the players, ompute the Godel Code
of eah suh funtion. Fit the odes of the other players' strategies into eah player's strategy to
determine a unique subset of ations for every player. Then, players simultaneously take ations
from these subsets.
Our objetive is to try to haraterize the set of equilibria of this game. To see how it works,
we might as well restrit attention to a two player prisoner's dilemma. As we illustrated above, we
don't really need our formalism to understand this game. However, it provides a simple illustration
of the methods we use in the general ase. Call the players
the usual payo struture in whih
o if they both play
funtion from
N
to
C
D
strategy that hooses ation
[c]
D.
D
1
is
as the 'enoding' of
c1 ([c2 ])
and the ations
C
and
D
with
A strategy
c
for a player is a denable
no matter what the Godel ode of the other player's strategy.
c.
[c]
denote the Godel ode of the strategy
c
and
Sine the Godel oding is an injetion from the set of denable
strategies to the set of integers. For any pair of strategies
player
2,
One obvious equilibrium of this game ours when both players use a
Every denable funtion has a Godel ode. Let
refer to
and
is a dominant strategy and both players are stritly better
than they are if they both play
{C, D}.
1
c1
and
c2 ,
the ation (C or
D)
taken by
and similarly for player 2. Sine every pair of ations determines a payo, this
proedure assoiates a unique payo with every pair of strategies.
6
MICHAEL PETERS AND BALÁZS SZENTES
There are many things that aren't denable strategies that also have Godel odes. We want to
make use of some of these other things. In partiular, we want to use denable strategies with
variables.
free
For example, there is a sublass of denable strategies for player 1 dened parametrially
by
γ x (n) =

C
n = x,
D
otherwise.
free variable x
This is simply a denable strategy with a
, where
x
is the target ode of the other
player's strategy that will trigger the ooperative ation. Denable strategies with free variables
are also denable, and so they too have Godel odes. The strategy with free variable that we want
is a slight modiation of the one above, in partiular
cx (n) =
(1.1)
The mapping
< x >(x) is
a text with one free variable, then
n.
Hene, if
n
i
h
n = hxi(x) ,
D
otherwise.
the omposition of two funtions. First, the funtion
operation to the Godel oding. That is,
to be

C
φ
(n)
<n>
is the text whose Godel ode is
hxi
n.
is the inverse
Seond, if
φ
is
is the same text where the value of the free variable is set
is a Godel ode of a denable strategy with one free variable, then
is itself a denable strategy (without a free variable).
h
i
(n)
hni
this denable strategy happens to be. Notie that in this ase,
< n >(n)
is just the Godel ode of whatever
[hxi (x)]
won't be equal to
x sine a
denable strategy must have a dierent Godel ode from a denable strategy with one free variable
beause of the fat that the Godel oding is injetive.
We want to dene a strategy by xing a value for
are interested in is
[cx ].
Sine
[cx ]
x
in (1.1). In partiular, the value of
x
we
is the Godel ode of a strategy with a free variable, the right
hand side of (1.1) requires that we deode
[cx ]
Putting all this together gives
c[cx ] (n) =
So
c[cx ] ([c2 ]) =
to get

C
cx ,
then x
x
at
[cx ]
to get the ontrat
c[cx ] .
n = c[cx ]
D
otherwise

C
[c2 ] = c[cx ]
D
otherwise
is a the 'reiproal' or self-referential ontrat mentioned above. Now we simply need to verify
what happens when both players use strategy
If player 2 uses strategy
by player 1.
c[cx ] ,
then
c[cx ] .
[c2 ] = c[cx ]
, whih evidently triggers the ooperative ation
The same argument applies for player 2.
denable strategy
c′
Player 2 an deviate to any alternative
that she likes. Sine every denable strategy has a Godel ode, the reation of
player 1, and onsequently both players payos are well dened. As the Godel oding is injetive,
DEFINABLE AND CONTRACTIBLE CONTRACTS
c′ 6= c[cx ]
implies the Godel ode of
respond by swithing from
C
to
c′
is not equal to
D.
c[cx ]
7
, and the deviation by 2 indues 1 to
Notie that this argument makes use of an enoding of the strategy with free variable
isn't a denable strategy.
cx ,
whih
One might have expeted the target ode number to be assoiated
with a strategy instead of a strategy with a free variable. For example, it seems that to enfore
ooperation there needs to be a denable strategy
c∗ =
Of ourse, for arbitrary
n∗

C
c∗
with enoding
[c∗ ] = n∗
suh that
[c2 ] = n∗
D
otherwise
[cn∗ ] = n∗ .
it will be false that
This leads to a xed point problem
that, in fat, does not have a solution in general. More generally, one ould try to onstrut a
self-referential ontrat by nding a xed point of the the following problem. For eah
cn ([c2 ]) =
g
where

C
D
if
[c2 ] = g (n) ,
n∗
suh that
[cn∗ ] = g (n∗ ),
a self-referential ontrat. Indeed, what we did above is that we hose
showed that
n = [cx ]
reiproal ontrat
≡
and its reursive ounterpart
C
=
don't
c[cx ]
to be
cn∗
is obviously
[< n >(n) ]
and

exempt
don't
cx
works, reall the reiproal tax agreement
other State oers reiproal ontrat
exempt
otherwise
if other State exempts any State who exempts any State who exempts. . .
otherwise
The 'reiproal ontrat' is
g (n)
then
is a orresponding xed point.
To see how the strategy with free variable

exempt
onsider
otherwise,
is a denable funtion. If there exists an
∗
n,
c[cx ]
and the statement other state oers reiproal ontrat is
[c2 ] =
.
State
A
wants to exempt any state whose law fullls a ondition. For example, if the ondition
it is looking for is that the other state simply exempts State S, then it would ompute the Godel
ode
n0 = [C∀n]
then use the strategy
cn0 =

C
D
[c2 ] = n0
otherwise
If it does that, then it an't be an equilibrium as explained above.
So what it needs to do is
to exempt any State whose law fullls a ondition that exempts any state whose law fullls a
ondition. For example, if it wanted to exempt State
A
if and only if State
and so on.
A
B
unonditionally exempts state
if and only if State
B,
B 's
law exempts state
then it would adopt the strategy
c[cn ] ,
0
8
MICHAEL PETERS AND BALÁZS SZENTES
cx omes
i
h
(x)
,
n = hxi
This is where the partiular struture of the ontrat
cx (n) =

C
D
into play. Reall that
otherwise.
It speies exemption if and only if a ondition is fullled, but it doesn't seem to speify what the
ondition is. However, it does require that whatever the ondition
x
is, if
x
in turn depends on
a ondition, then the ondition that it depends on must be the same as the ondition itself. To
see if
x
depends on a ondition, we rst deode it and nd the statement
hxi
that the integer
orresponds to. Then if it depends on some ondition, we require that that ondition be
whih is the meaning of
exempts state
B 's
B
law must be
(x)
hxi
. So now we an do the innite regress. State
if and only if the Godel ode of State
c[cx ] ,
or that
i.e., the same ondition that
B
A
exempt
A
B 's
law is
c[cx ] .
A
itself,
adopts a law that
This means that state
if and only if the Godel ode of State
requires.
x
x
A's
law is
c[cx ]
,
Every olletion of denable ontrats uniquely determines a set of ommitments for eah of the
players. Any sensible desription of the set of feasible ontrats should unambiguously determine
players' ommitments in this way. We aomplish this by making the ontrats arithmeti. The
set of denable funtions is the largest set of arithmeti funtions that an be desribed using
a nite text in a rst order language.
In this sense, the lass of ontrats that we desribe is
universal in that any 'sensible' model of ontrats on ontrats should involve a ontrat spae
that is embedded in the one we desribe.
2.
Literature
As we mentioned in the introdution, our paper is not the rst to show how ontratual devies
an be used to support ooperative play.
Muh of the literature in this area follows an idea
developed in [5℄ in whih ations are delegated to an agent who is given the appropriate inentives
to arry out ations that might not otherwise be part of a non-ooperative equilibrium. This idea
was developed by [8℄ who used it to prove a 'folk theorem' for a very speialized environment.
The idea that the agent might be used to report deviations, thereby allowing prinipals to ommit
themselves to punish a deviator, is developed in [4℄.
This idea provides the basis for the menu
theorems in ommon ageny, like [9℄, [10℄ and [6℄ whih illustrate how ooperative outomes an
be supported by having the agent trigger punishments. Reently [14℄ provides a very general folk
theorem for multiple ageny games in whih prinipals an ommit to follow the reommendations
of (potentially interested) agents.
Though this literature shows how ontratual devies an be used to support ooperative behavior, it relies on the delegation of deision making power to an agent. In this paper, there is
no agent, beause ontrats depend diretly on one another. This approah is losely related to
ideas in the omputer siene literature. One paper that uses this approah is [13℄. He has players
writing programs that determine their ations. Using an idea due to von Neumann, he allows these
DEFINABLE AND CONTRACTIBLE CONTRACTS
9
programs to use other programs as data, whih has the eet of making the output of eah player's
program depend on the other players' programs. The result extends the reiproal ontrat idea
presented above to a more general
n
player game. We have illustrated the basi priniple with our
'ross-referential' example above. To support any array of ations, Tennenholz eetively writes
out expliitly a sequene of programming statements resembling the verbal statements we made
above. These dene the keywords that are needed to support any array of ations in whih eah
player's payo is at least his or her minmax value.
The paper by [7℄ shows how to extend the set of supportable alloations in two player games.
They onstrut a set of ommitment devies whih an be used to support
orrelated
strategies in
whih all players payos exeed their minmax payos. Speially, in some games their devies
support outomes in whih all players reeive payos that exeed their best payos with Tennenholz's programs. This is aomplished by onstruting ommitment devies that allow players to
orrelated their ations while using independent randomizing devies.
On the most basi level, our paper diers sine we are interested in problems with inomplete
information. However, the more important dierene is that our approah is dedutive rather than
onstrutive. We x a set of ommitment devies, then use this same set of devies to support
individually rational outomes in all nite games. The advantage of this is that we are able to
give a omplete haraterization of the set of all supportable alloations. Tennenholz theorem does
not rule out, for example, the possibility that there might be program equilibrium in whih some
players reeive less than their minmax payo. With omplete information, this possibility is not
ritial. For example, [7℄ rule it out by allowing players to reserve the right to pik their ations
in an unonstrained way ex post. They interpret this as giving players the right not to partiipate
in the ontrating proess. With inomplete information, there is no simple analog to the minmax
payo, so there is no simple trik like non-partiipation that an be used to show whih alloations
annot
be supported as equilibria. By providing a more omplete desription of the set of feasible
ontrats we are able to overome this diulty.
It is preisely the ability to pin down alloations that annot be supported as equilibrium that
is ritial to our objetive of showing the limits to whih ontrats an be used to deentralize
the mehanism designer's problem. We use our haraterization to onstrut examples of alloation rules that an be supported by a mehanism designer, but annot be supported as ontrat
equilibrium.
We emphasize that the ontribution here is not intended to be a ontribution to the omputer
siene literature.
In fat, we view the paper as a very traditional ontrating model in whih
players have aess to a legal system whih an be used to provide redress when ontrats are
not arried out. Yet redress is all we want. Our purpose is to dene a ontrating language suh
that players an write any ontrat that they like in this language.
One all the players have
written their ontrats, they should be able to dedue on their own what ations they need to
take in order to fulll their ontrats. With omplete information, it isn't ompletely surprising
10
MICHAEL PETERS AND BALÁZS SZENTES
that many alloations an be supported in suh an environment. It is in inomplete information
environments where the set of supportable alloations has not been haraterized, and this is how
we view our ontribution here.
Finally, the use the Godel oding is simply for onveniene. One one sees that the olletion
of all nite texts onstitutes a ountable set, any denable bijetion from nite texts to integers
an be used to do the analysis we do. Any denable bijetion an be expliitly written into the
ontrats we allow, so that a judge (or player for that matter) who doesn't know what it is an
expliitly alulate it.
3.
The Language and the Gödel Coding
We onsider a formal language, whih is suiently rih to allow its user to state propositions
in arithmeti. Furthermore, the set of statements in this language is losed under the nite appliations of the Boolean operations:
q, ∨,
and
∧.
This implies that one an express, for example,
the following statement:
∀n, x, y, z {[(n ≥ 3) ∨ (x 6= 0) ∨ (y 6= 0) ∨ (z 6= 0)] → (xn + y n 6= z n )]} .
In addition, one an also express statements in the language that involve any nite number of free
variables. For example, x is a prime number is a statement in the language. The symbol
a free variable in the statement.
x
< 4.
Let
L
is
Another example for a prediate that has one free variable is
One an substitute any integer into
x = 0, 1, 2, 3
partiular one is true if
x
x
and then the prediate is either true or false. This
and false otherwise.
be the set of all formulas of the formal language. Eah of its element is a nite string
of symbols. It is well known that one an onstrut a one-to-one funtion
value of this funtion at
ϕ ∈ L,
and all it the Gödel Code of the text
L → N.
Let
[ϕ]
be the
ϕ.
In what follows, we dene a lass of funtions whih an be represented represented by nitely
many haraters in our formal language.
Denition 1.
φ
in
k+1
The funtion
f : Nk → 2 N
free variables suh that
In the denition, the mapping
singleton for all
we refer to
[φ]
Example.
n∈N
k
, then
f
is said to be
b ∈ f (a1 , ..., ak )
f
denable
if and only if
is a orrespondene from
is a funtion. If the funtion
as the Godel enoding of
f.
(
f
φ (a1 , ..., ak , b)
Nk
to
N.
is true.
Of ourse, if
f (n)
is denable by the prediate
φ
is a
then
We illustrate the previous denition with an example.
Consider the following funtion dened on
f (a) =
if there exists a rst-order prediate
N:
0
if
a
is an even number,
1
if
a
is an odd number.
We show that this funtion is denable by onstruting the orresponding prediate
φ (x, y) ≡ {{y = 1} ∧ {y = 0}} ∨ {∃z : 2z = y + x} .
φ.
DEFINABLE AND CONTRACTIBLE CONTRACTS
φ indeed has two free variables.
Notie that
φ
front of it.) The rst part of
that
φ (a, b)
f (a) = 0
or one.) If
beause
b=0
a+b
φ (a, 0)
then
Suppose there are
payo of Player
is not free beause there is a quantier
is either one or zero. The seond part says that
φ (a, 0)
if and only if
is true. To see this, rst notie
φ
(This is beause the rst part of
is indeed true. If
b = 1,
x+y
b
requires
then the seond part of
φ
to be zero
beomes false
Complete information Contrating Game
m
players. Player
i is ui (a1 , . . . , am ).
i if he takes ation ai
ontrat,
submits a
z
is an odd number.
4.
to player
b∈
/ {0, 1}.
is false whenever
y
states that
is divisible by two. Notie that
(The variable
11
i
Ai .
has a nite ation spae
Let
We use the onventional notation that
while the other players take ation
N
whih is a denable orrespondene from
m
a−i .
to
A
denote
×m
i=1 Ai .
The
ui (ai , a−i ) is the payo
Eah player simultaneously
N
2 ,
where `denable' is to be
understood in the sense of Denition 1. At stage two, players take ations simultaneously from
subsets of their ations spaes. These subsets are determined by the rst-stage ontrats. If at
stage one player
two if
j
submitted ontrat cj (j
[ai ] ∈ ci ([c1 ] , ..., [cm ]).
= 1, ..., m), then player i an only take ation ai at stage
We restrit attention to pure-strategy subgame perfet equilibria of
this game.
The pure strategy minmax value for player
ui =
Let
aj
be any one of the ations that
for player i, suh that,
i
min
is
max ui (ai , a−i ) ,
a−i ∈A−i ai ∈Ai
j
uses to attain his minmax payo. Let us x an ation
aj1 , ..., ajm ∈ arg min uj aj , a−j .
aji i
a−i
j
That is, ai is the ation that player
all
i
uses to punish player
j.
For onveniene, dene
ajj = aj
for
j ∈ {1, ..., m}.
Theorem 1.
The ation prole
a∗ = (a∗1 , ..., a∗m ) ∈ A
outome in the ontrating game if and only if
is supportable as a pure-strategy SPNE
∗
ui (a ) ≥ ui
for eah i.
Before we proeed with the proof of the theorem, we reall two piees notations from the
introdution.
[< n >] = n.
First, if
n ∈ N
Seond, for any text
stands for a free variable in
is
x1 < x2 , n1 = 1,
variable:
< xi >
and
ϕ
then
n2 = 2
(x1 ,...,xk )
< n >
then
ϕ,
xi
then
, where
let
ϕ
denotes the text whose Gödel ode is
(n1 ,...,nk )
ϕ
i ≤ k.
is
1 < 2.
ni
in
ϕ(n1 ,...,nk )
is dened for all
Mathematial Logi, that if
ϕ
ϕ
for
i = 1, ..., n.
For example, if
Consider now the following text in
One an evaluate this statement at any
vetor of integers. Sine the Godel oding was a bijetion
addition,
and
That is,
denote the statement where if the letter
is evaluated at
(n1 ,n2 )
n.
(n1 , ..., nk ).
< ni >
k
xi
ϕ
free
k -dimensional
is a text for eah
n i ∈ N.
In
In addition, it is a well-known result in
f (n1 , ..., nk ) = < ni >(n1 ,...,nk ) ,
then
f
is a denable funtion.
12
Proof.
MICHAEL PETERS AND BALÁZS SZENTES
First, we prove the only if part. Fix an equilibrium in the ontrating game. Let
j (j = 1, ..., m)
the equilibrium ontrat of player
i
Notie, that player
he an oer
c:N
m
oering
is,
ej
A
3
c instead of ci .
suh that
c (n1 , ..., nm ) = N
ui < ui ,
We show that if
Let
e
cj = cj
if
is the ation spae of player
Also notie that
ui
denote player
i's
denote
equilibrium payo.
an always oer a ontrat that does not restrit his ation spae. That is,
→ 2N ,
obviously denable.
and let
cj
fi = Ai .
A
j
player
j 6= i and e
ci = c.
for all
i
(n1 , ..., nm ) ∈ Nm .
The ontrat
i
ej = {aj : [aj ] ∈ e
A
cj ([e
c1 ] , ..., [e
cm ])}.
min
The inequality follows from
max ui (ai , a−i ) ≥
ej ⊆ Aj
A
for all
minmax value by oering the ontrat
j.
That
(e
c1 , ..., e
cm ).
in any pure strategy equilibrium of this subgame
is weakly larger than
e−i ai ∈Ai
a−i ∈A
is
an protably deviate at the rst stage by
Let
in the subgame generated by the ontrat prole
The payo of player
c
min
max ui (ai , a−i ) .
a−i ∈A−i ai ∈Ai
Therefore, player
i
an always ahieve his pure
c.
For the if part, onsider the following ontrat of Player i,
m
cixi ,x−i ,
in
m
free variables:
cix1 ,...,xm ([cj ])j=1 =

 [a∗ ]
h ii
 aji
(4.1)
| k : < xk >(x1 ,...,xm ) 6= [ck ] | =
6 1,
(x1 ,...,xn )
if
k : < xk >
6= [ck ] = {j}
if
The expression (4.1) is not a ontrat, but rather a ontrat with free variables.
Eah suh
o
n
i
cγ ,...,γm
The funtions
have no free
expression has a Godel ode, so let
i
om
n 1
i
variables, so they onstitute a set of ontrats. We will now show that cγ ,...,γ
onstitutes
1
m i=1
1
m
. First observe what
an equilibrium prole of ontrats whih support the outome ak , . . . , ak
1
m
γ i = cix1 ,...,xm .
happens when all players use ontrat
Player
player
i
ciγ 1 ,...,γm .

 [a∗ ]
m
h ii
ciγ 1 ,...,γm ([cj ])j=1 =
 aji
if
if
Notie that
| k : < γ k >(γ 1 ,...,γm ) 6= [ck ] | =
6 1,
(γ 1 ,...,γ m )
6= [ck ] = {j} .
k : < γk >
needs to hek whether the Godel ode of
k 's ontrat, ck .
γk
is equal to the Godel ode of
is the Godel ode of the ontrat with free variable
ckx1 ,...,xm .
γ 1 , ..., γ m
k
(whih gives the ontrat cγ ,...,γ ), then evaluate its Godel ode. This is what is to be ompared
1
m
Player
i's
The integer
< γ k >(γ 1 ,...,γm )
ontrat says to take this ontrat with free variable, x the free variables at
with the Godel ode of the ontrat oered by
k.
Of ourse, these are the same in equilibrium
k
beause ck = cγ ,...,γ . Sine this is the ase for all m − 1 of the other players, player i ends up
1
m
∗
taking ation ai . So these ontrats support the outome we want if everyone uses them.
3
For example, the prediate
{x1 = x1 } ∧ ... ∧ {xm = xm } ∧ {y = y}
denes c. That is, for all y ∈ N the prediate is true no matter how the free variables are evaluated.
DEFINABLE AND CONTRACTIBLE CONTRACTS
Nm
13
2N . o
However, any suh ontrat
n
j
will have a dierent Godel ode, and so will indue the punishment ai
from the other players.
i6=j
n o
j
Reall that ai
is the ation prole that players other than player j use to minmax player j .
Player
Sine
j
an deviate to any denable ontrat mapping
uj (a) ≥ uj
into
i6=j
any deviation will be unprotable.
One might argue that restriting the spae of ontrats to be denable funtions of Godel odes
is both arbitrary and unnatural. Indeed, there is no reason for a judge to interpret a ontrat as
a desription of a mapping from the Godel odes of the ontrats oered by the other players to
the ations spae of the player. For that matter, the judge might not even know about the Godel
oding. It is important to note that the salient feature of denable ontrats is that they an be
written as texts that use a nite number of words in a formal language. The set of nite texts
seems a very natural desription of the set of feasible ontrats. In fat, from this perspetive it
seems that
any
reasonable desription of the set of feasible ontrats should allow any suh text.
The ompliation with suh a broad desription of the set of ontrats is that to properly dene
a game, one must fully desribe the mappings from proles of texts into payos.
Many texts
will be omplete nonsense and some modelling deision has to be taken about how these would
translate into ations and payos. The ontrats that we speify above are denable texts that
have two advantages in this regard. First, sine every nite text has a Godel ode, they tie down
the ation of the player who oers suh a ontrat even if the other players in the game oer
ontrats involving texts that make no eonomi sense. Furthermore, if all players oer ontrats
from the set we speify, an outome for every player is uniquely determined.
Finally, sine the Godel oding itself is denable, the oding an be embedded diretly into the
ontrat. So players don't need to agree to use the Godel ode of other ontrats. They an use
the Godel ode unilaterally, and the impliations of the ontrat will be understood by the others
provide they agree on the underlying language in whih ontrats are written.
Generalizations.
Everything about this theorem involves pure strategies. This imposes limits
on its appliation. Next, we disuss how to extend our result to the ase when players an mix
over their restrited ation spae at the seond stage of the game but annot randomize over the
ontrats they oer at the rst stage.
Allowing suh mixing expands the set of payo proles
that an be supported by equilibria for two reasons. First, sine players an randomize ertain
onvex ombinations of payo proles an now be supported. Seond, players an use mixing when
punishing a deviator, and hene the minmax value of the players will be smaller.
Formally, for all
S = ×i Si , Si ⊂ Ai ,
and the payo funtion of player
i
dene a game,
is the restrition of
GS ,
ui
where the ation spae of player
on
S.
Let
E (S)
∗
equilibria in GS . Dene the minmax value of player i, ui , as
u∗i =
min
max
min
S−i ⊂A−i Si ⊂Ai σ∈E(S−i ×A)
S−i =×j6=i Sj
Z
ui (a) dσ (a) .
i
is
Si ,
denote the set of mixed
14
MICHAEL PETERS AND BALÁZS SZENTES
The idea is that in the ontrating game, players an restrit their ation spaes arbitrarily, hene,
when they punish player
i
they an hoose
S−i
arbitrarily. On the other hand, their seond-stage
ations must be best responses, and that is why we have to onsider equilibrium payos in the
restrited game. An argument idential to the proof of Theorem 1 shows that the random alloation
σ ∈ ∆ (A)
an be supported as an equilibrium if
∃Si ⊂ Ai for all i, suh that σ ∈ E (×i Si ),
R
ui (a) dσ (a) ≥ u∗i for all i.
(ii)
(i)
and
What happens if players are allowed to randomize over the ontrats they oer? It is possible
to show that part (i) an be ompletely relaxed. That is, the distribution over the outomes does
not have to be an equilibrium in
GS ,
and it does not even have to be generated by independent
randomizations of the players. The onstrution of mixed equilibria in our ontrating game that
supports orrelated outomes is entirely based on Kalai et.al. (2008). The authors onsider twoperson games where players submit ommitment devies instead of taking ations. A devie then
determines the ation of the player as funtion of the other devie. The authors onstrut a set
of devies suh that any individually rational orrelated outome an be implemented as a mixed
equilibrium in the game. That is, although the players mix independently over their devies, the
distribution over the ations proles will be orrelated. It is easy to show that these results extend
to
n-person omplete information games, and in addition,
the the equilibrium ommitment devies
onstruted by Kalai et.al. (2008) are denable funtions as long as the probabilities involved in
eah mixing are all rational numbers.
Theorem 2.
Suppose that
σ ∈ ∆ (A),
and
σ (a) ∈ Q
for all
a ∈ A.
The distribution
σ
an
be supported as a mixed-strategy equilibrium outome in the ontrating game if and only if
R
ui (a) dσ (a) ≥ u∗i
for all
i ∈ {1, ..., m}.
Another question is why we use denable funtions as opposed to programs or Turing mahines.
One might want to require that the ontrats must be omputable and assume that the set of
available ontrats is the set (or a subset) of Turing mahines.
(i = 1, 2) hooses mahine τ i , then τ i
runs on the desription of
In suh a model, if player
i
τ j , and the output will be a subset
of the ation spae of player i. It is well-known, that one an onstrut self- and ross-referential
4
ontrats (mahines) in this spae too.
In fat, this onstrution is essentially idential to our
onstrution of ross-referential denable funtions. Most importantly, the equilibrium ontrats
we onstrut to support individually rational alloations are, in fat, reursive funtions, and hene
they are omputable by Turing mahines. Therefore, if the reader insists on omputability, he an
restrit attention to the spae of Turing mahines.
There are, however, several advantages of our approah over modelling ontrats with Turing
mahines. First, Turing mahines do not always halt, and therefore, it is not lear how one an
4
Suh mahines were onstruted even in the ontext of Game Theory, see Anderlini 1990 and Canning 1992.
DEFINABLE AND CONTRACTIBLE CONTRACTS
15
dene the restrition on the ation spae of a player, if his mahine does not halt.
A way to
handle the halting problem is to restrit the spae of Turing mahines to be the set of mahines
that always halts. We nd suh restritions arbitrary. Instead of restriting the spae of reursive
funtions, we expanded it to be the set of denable funtions and avoided the halting problem that
way. Seond, another problem with Turing mahines is that they an only ondition on the atual
desription of the mahines submitted by the other players but annot ondition on the funtions
what the mahines ompute. Take the example of the prisoner dilemma. It is possible to onstrut
a Turing mahine,
τ,
suh that
τ ([τ 2 ]) =
(
C
D
if
[τ 2 ] = [τ ]
otherwise.
The problem is that if player 2 submits a mahine, say
with
τ,
τ ′,
whih is omputationally equivalent
but has a dierent desription, then player 1 would defet.
In fat, it is not possible
to onstrut a mahine whih does not suer from this problem. We avoid suh problems with
denable funtions. Indeed, it is possible to express ontrats that do not ondition on the atual
way the other ontrat is written, but on the funtion itself that the other ontrat desribes.
Consider
c1 ([c2 ]) =
The ontrat
cγ
(
c∗2 ⇔ c2 ,
C
if
D
otherwise.
is obviously denable, but does not ondition on the atual form of
c2 .
As long
∗
as c2 represents the same funtion as c2 , ooperation is presribed.
5.
Contrating in a Bayesian Environment
In the previous setion, we showed how ontratible ontrats an be used to support any
alloation for whih every player's payo is at least his minmax value. Assuming non-partiipation
is always an option, this is the set of alloations that is supportable by a entralized mehanism
designer. In this sense, ontratible ontrats ompletely deentralize the alloation problem. In
this setion, we show that the same result is not true in the Bayesian ase. We do this by proving a
theorem that ompletely haraterizes the set of alloation rules that an be supported as Bayesian
equilibrium in the ontrating game. We then onstrut alloations that an be supported by a
entralized mehanism designer, but whih annot be supported as equilibria.
We also show, however, ontratible ontrats make it possible for one player's ation in a
ontrat equilibrium to depend on another player's type. The reason is that ontrats expliitly
ondition ations on other player's ontrats, whih, along the equilibrium path, an depend on
their types. We exploit the reiproal ontrating idea desribed above to enfore type ontingent
agreements. The idea is that a ontrat will speify a number of 'target' Godel odes, one for eah
of the ontrats the other player's dierent types are supposed to oer along the equilibrium path.
16
MICHAEL PETERS AND BALÁZS SZENTES
As long as other players oer a ontrat whose ode is equal to one of these targets, the ontrat
responds with a 'ooperative' ation. If any of the others deviate, the ontrat will respond with
some kind of punishment.
When information is omplete, a punishment is simply an array of ommitments that nondeviators make.
These ommitments are suh that they make all deviations unprotable.
We
want to extend this idea to the Bayesian ase. There are a number of diulties assoiated with
this. First, ontrat oers depend on player types, so they reveal information. Potential 'deviators'
an ondition their play on non-deviators' ontrats, so they an ondition their ommitments on
the non-deviators' type. Players may not want to reveal their type information at the ontrating
stage for this reason.
Nonetheless, they will want at on this type information ex post.
So
ontrats will typially bind players to subsets of their ations, leaving some disretion for them
to vary their ations with their types ex post.
So it is important to think of ontrat oers as
ommitment orrespondenes, rather than simple ommitments to ations as would sue with
omplete information.
Seondly, non-deviators have the ability to respond to deviations ontratually. They an make
their punishments more severe by exploiting residual unertainty that the deviator has about
their type ex post. They would do this, again, by ommitting to a subset of ations instead of a
single ation. The punishment for a deviation onsists of two parts for this reason: a punishment
orrespondene, that restrits the non-deviator's ex post hoie to a subset of his ations, and a
seond stage strategy that depends on the non-deviator's type (onstrained by the information the
non-deviator reveals about his type with his on path ontrat).
Perhaps the most omplex part of ontrating equilibrium is that non-deviators' responses depend on the deviator's ontrat.
sophistiated way.
As suh they ould, in priniple, depend on the deviation in
The simple logi that we developed for the reiproal ontrating argument
above relied on the idea that non-deviators ould
punish
deviators by minmaxing them.
The
minmax ation doesn't depend on exatly how the non-ooperative player hooses to be nonooperative. With inomplete information, there is no natural analog for the minmax punishment.
The ability to ontrat on other ontrats suggests the possibility that non-deviators ould reat to
deviations in a way that depends on exatly what the deviation is thereby holding on path payos
to something below the orresponding min max payo.
A remarkable property of our theorem is to show the sense in whih punishments an be understood to be invariant to the manner in whih a player hooses be non-ooperative. We show
that any ontrat equilibrium an be supported by having non-deviators respond to a deviation
with a ontratual ommitment that is independent of what the deviator hooses to do. Ex post
hoies that non-deviators make from their ontratual ommitment orrespondenes will typially
depend on the deviation.
trats.
However, these ex post hoies aren't part of the non-deviator's on-
This result allows us to extend the reiproal ontrating idea to Bayesian games.
If a
player oers a ontrat that is onsistent with equilibrium play by one of his types, then the other
DEFINABLE AND CONTRACTIBLE CONTRACTS
17
players respond ooperatively. Otherwise, there is a single punishment orrespondene that the
non-deviators impose, just as in the omplete information ase.
This property of ontrat equilibrium is a onsequene of restriting players to denable ontrats. It is the part of our theorem that allows us to show the limits of ontratible ontrats,
sine we an show the kinds of alloations that an't be supported as equilibrium.
This is the
part of our theorem that we use to provide examples of alloations supportable by a mehanism
designer but not by ontratable ontrats. This is the advantage we derive from speifying the
ontrat spae very preisely. An abstrat ommitment spae suh as the one provided in [7℄, or a
spae of ontrats that is onstruted to have desirable properties, suh as programs in [13℄, an be
used to show that a large set of alloations an be supported as equilibrium alloations. However,
in the Bayesian ase, a omplete haraterization requires a demonstration that ertain alloations
annot be supported.
To provide this, a omplete desription of the set of feasible ontrats is
required. Our Lemma illustrates that denability provides just suh a omplete desription.
The model is the same as in the previous setion, with the addition of player types. There are
m
players. Player i's ations spae is a nite set denoted by
from a nite set
T
i
Ai .
Eah player
i
has a type ti drawn
. The joint distribution types is ommon knowledge. The payo of player
ui (ai , a−i , t) where t ∈ T1 × · · · × Tm .
Notie that a strategy rule for player
i in the
i
is
Bayesian game
|Ti |
.
the players might otherwise be involved in is an element of Ai
Our haraterization of the set of alloations that an be supported as ontrat equilibrium
hinges on the information that equilibrium play reveals about players' types. Our argument refers
repeatedly to the information that is revealed through equilibrium play. Most things that happen
o the equilibrium path depend on this information.
use the
information partition
orrespondene
type
ti .
τ i : Ti → 2
Ti
A natural way to inorporate this is to
indued by these ontrats.
Fix an equilibrium, and dene the
to mean the set of types of player
One other players see the ontrat oered by player
believe that
i's
type lies in the set
τ i (ti ).
i
i
who oer the same ontrat as
of type
The orrespondene
τi
ti ,
is an
they should ommonly
information partition.
Similarly, the orrespondene
τ −i (t−i ) =
Y
τ j (tj )
j6=i
desribes the information available to player
i
about the types of the other players.
Eah ontrat speies a set of ations from whih players subsequently hoose. In this sense,
equilibrium ontrats support a ommitment
orrespondene
for eah player. As ontrats depend
on other ontrats, whih in turn depend on other players' types, this ommitment orrespondene
an be written as a mapping
ri : T → 2Ai .
Sine the set from whih
i
hooses his ation an only
depend on some other player's type to the extent that the other player's ontrat varies with his
type,
ri
should be measurable with respet to the information partition
τ −i .
Contrats speify sets of feasible ations. Ultimately, payos are determined by players' hoies
from these sets in the nal stage of the game.
Let
si : T → Ai
denote the outome funtion
18
MICHAEL PETERS AND BALÁZS SZENTES
assoiated with the seond stage strategies.
that
si (t)
lies in the set
depend on
t
instead of
ri (t)
ti .
5
These outome funtions must have the property
for eah t. It might seem strange that this outome funtion should
The reason that player
other players is twofold. First, player
i's
equilibrium ations depend on the types of
i gets to see the ontrats oered by eah of the other players.
His beliefs vary as the other players' ontrats vary, so his ations in the seond stage will vary
with the other players types. Seondly, his own ommitments depend on the ontrats, and thus
the types of the other players. Evidently, player
i
only observes types imperfetly by observing
the ontrats that are oered. This is aptured simply by observing that this indued outome
funtion must be measurable with respet to the information partition
τ −i .
Eah o equilibrium ontrat oered by a deviator speies a ommitment for eah array of
ontrats oered by the other players. Sine the ontrats the other players oer depend on their
types, a deviation implies a ommitment orrespondene
fi : T−i → 2Ai .
Sine these types are
revealed only through the ontrats that the others oer, this orrespondene should be measurable
with respet to the information partition
τ −i
that aptures this information. Let
all ommitment orrespondenes available to the deviator, i.e.,
mappings from
T−i
into
2
Ai
Fi
is the set of all
Fi
be the set of
τ −i
measurable
.
In a ontrat equilibrium, a deviation leads to two sorts of 'punishments'. First, sine the other
players' ontrats speially ondition on the ontrat oered by the deviator, the non-deviators
will hange their ommitments. As mentioned above, we are going to show that the punishment
orrespondene assoiated with this hange in ommitments an be taken to be independent of the
deviation
fi .
However it is possible that the way that the non-deviators hoose from sets to whih
they have ommitted themselves will depend on
i
p =
j
Q
fi .
i deviates as pij : T−i → 2Ai and
i
j6=i pj . In a ontrat equilibrium, this punishment is the onsequene of the ontrat that
Write the 'punishment' that player
j
imposes when player
has written, so the punishment an only vary with
j 's
type to the extent that
j 's
ontrat does.
As a onsequene, this punishment will be measurable with respet to the information partition
τ −i .
Finally, we need to desribe the non-deviators' behavior in the ex post stage, let
T−i → Aj .
equilibrium
The non-deviator
j
sij : Fi ×
an no longer ondition his behavior on information revealed by i's
behavior. However, he does observe i's ommitment orrespondene in the sense that
he observes the deviator's ontrat. He also observes the on equilibrium ontrats of the others.
This is aptured, as always, by requiring that his behavior be measurable with respet to
5
τ −ij .
The
To simplify the argument slightly, we fous on pure strategy outomes here. It is ompletely trivial to extend
this argument to outomes that involve randomization at the seond stage by having the outome funtions be
mappings from T into △ (Ai ), restriting the supports of these mappings to lie in ri (t), then letting ui (s (t) , t) be
the expeted utility assoiated with the randomization. With this notation, the inequalities that haraterize the
equilibrium remain unhanged.
DEFINABLE AND CONTRACTIBLE CONTRACTS
ation
sij (fi , t−i ) should be ontained in pij (t−i ) for every t−i ∈ T−i
be the outome funtion assoiated with a deviation. for eah
and
19
fi ∈ Fi .6
Let
si =
i = 1, . . . m,
Q
j6=i
sij
The theorem is based on a pair of inequalities that are based on the objets dened above. The
rst aptures the idea that no player wishes to mimi the equilibrium behavior of another of his
own types. For eah
i = 1, . . . m,
and eah
ti
and
t′i
Et−i (ui (s (t) , t) : ti )
(5.1)
≥ Et−i
max
ai ∈ri (t′i ,t−i )
Et′−i
The ommitment orrespondene
ri
ui ai , s−i t′i , t′−i
,
ti , t′−i
:
t′−i
∈ τ −i (t−i ) : ti
!
results in a olletion of ations from whih player
.
i is ommit-
ted to hoose in the seond stage. This hoie set an depend on the types of the others beause
max
their ontrats do. The
operator on the right hand side requires the player to hoose a best
reply from this set given posterior beliefs.
Taken together, these onstraints for all the players
requires that play in the seond stage onstitutes a Bayesian equilibrium of the game in whih
eah player hooses an ation from the set of ations to whih he is ommitted, given posterior
beliefs about players' types.
To deal with deviations at the ontrating stage of the game, we require that for eah ti
∈T
Et−i (ui (s (t) , t) : ti ) ≥
(5.2)
max Et−i
fi ∈Fi
max Et′−i ui a, si fi , t′−i , ti , t′−i , t : t′−i ∈ τ −i (t−i ) : ti .
a∈fi (t−i )
A deviation implies a ommitment orrespondene
fi .
The inequality says that even if the deviator
hooses a best reply from the set of ations to whih he is ommitted, he annot gain by deviating.
So far we have imposed no restritions on seond stage punishment behavior. In appliations it
is natural to want behavior in the seond stage to be onsistent with some renement like
perfet
Bayesian equilibrium. Our theorem works with most standard renements but does not depend
on them. To illustrate, let
T̃−i
be a subset of
T−i .
Ã1 , . . . , Ãm
be a olletion of subsets of the players ation sets and let
while it is ommon belief that the non-deviators' types lie in
the subset of ations in
Ã−i
ned) Bayesian equilibrium,
have
i
h
Ri Ã1 , . . . , Ãm , T̃−i
6
Ri
has deviated from
the seond stage,
T̃−i .
Let
Ri Ã1 , . . . , Ãm , T̃−i
i
be
that are onsistent with some renement. For example in a (unrei
h
Ri Ã1 , . . . , Ãm , T̃−i = Ã−i . A slightly stronger renement might
ruling out ations that are stritly dominated given that players are
onstrained to hoose ations in
equilibrium,
i
Ãihin
The interpretation of these objets is that some player
an equilibrium. Contrats onstrain the players to hoose from the sets
Ã1 , . . . , Ãm .
Finally, onsistent with the idea of perfet Bayesian
might ontain only ations that are onsistent with Bayesian equilibrium in the
If mixing is allowed in the last stage, then the support of sij (fi , t−j ) should be ontained in pij (t−i ).
20
MICHAEL PETERS AND BALÁZS SZENTES
game dened by subsets
some
Ã1 , . . . , Ãm
, ommon belief that the non-deviators' types lie in
belief about the deviator i. We will all a Bayesian equilibrium in ontrats an
T̃−i
and
R-equilibrium
if following eah array of ontrat oers for whih player i's ontrat is inonsistent with his equilibrium strategy and all other players ontrat oers are onsistent with the equilibrium strategies
of players whose types lie in
T̃−i ,
ontinuation play lies in
i
h
Ri Ã1 , . . . , Ãm , T̃−i
where the
Ãj
are
the ontratual ommitments of eah of the players. We an now state our main theorem.
Theorem 3.
An alloation rule
s : T → A an be supported as an R-equilibrium in ontrats if and
r,
only if there is a ommitment orrespondene
i
punishments
p i=1,...,m , and
Q
τ i , eah pi is
respet to τ =
in
r (t)
for eah
a olletion of information partitions
outome funtions
i
s i=1,...,m
suh that
r
{τ i }i=1,...,m ,
is measurable with
τ −i , the support of s (t) is ontained
t, s (fi , t−i ) ∈ Ri fi (t−i ) , p (t−i ) , τ −i (t−i ) for eah fi and t−i , and (5.1) and
i
(5.2) are satised.
measurable with respet to
i
One of the key properties of this theorem is to show that when ontrats are denable, equilibrium must support a single punishment orrespondene of exatly the kind we have desribed.
Speially, it is a punishment orrespondene that is independent of
f.
This is entral to the
reiproity idea that we developed at the beginning of the paper. Unooperative behavior by one
player provokes a punishing ontratual response from the others that doesn't depend on exatly
how the deviator goes about being unooperative. We want to show that it is the seond stage
ommitments that apture this property of reiproity.
7
This property of ontrat equilibrium is
also very surprising. It might seem that ontrat equilibrium ould be supported with very ompliated ontrats that punish deviators in a way that is sensitive to exatly how they deviate. We
show that this is not the ase.
Finally, the theorem is written assuming that players use pure strategies at every stage. The
primary reason we assume away mixed strategies is so that we an use an information partition to
represent players knowledge at the seond stage instead of a more omplex measure of information.
We assume pure strategies in the seond stage simply for onsisteny. It is ompletely trivial to
extend the theorem to allow randomization at the seond stage. This involves nothing more than
assuming that
ui (s, t)
sii (fi , t−i ) are mixtures on Aj
whose support is ontained in
to be expeted utility assoiated with the mixture
s
when
pjj (t−i ), then redening
s ∈ △ (A).
The theorem and
proof then proeed verbatim.
6.
We write the proof in three parts.
Proof of Theorem 3
The rst part shows the 'if ' part of the theorem.
generalization of the reiproal ontrating idea presented above.
It is a
Before going on to the more
diult 'only if ' part, we prove the Lemma that is interesting for its own sake, and whih forms
the basis of the seond part of our proof. Finally, we give the proof of the only if part.
7
Care here is needed to observe that seond stage
behavior
does depend on the deviation in the rst stage.
DEFINABLE AND CONTRACTIBLE CONTRACTS
6.1.
21
If Part:
Proof.
Suppose that
s, τ , r, pi , si
satisfy (5.1) and (5.2). We onstrut a Bayesian equilib-
t
x denote xjj
rium in the ontrating game whih implements the alloation s. Let
where
t
xjj
j∈{1,...,m}, tj ∈Tj
denotes a free variable. Consider the following ontrat in
=
|T |
,
free variables:
ctxi ([c1 ] , ..., [cm ])

tk
tk

< xtkk >(x) = [ck ] ,
if ∀k∃xk ∈ xk : tk ∈ Tk
s.t.
 ri (t)

pji (t−j ) if k : ∄xtkk ∈ xtkk : tk ∈ Tk s.t. < xtkk >(x) = [ck ] = j,



Ai
otherwise and if k + 1 > k if k ∈ {j : τ (tj ) = τ (ti )} ,
The last statement is in the third line is always true. Suh a statement, however, makes it possible
that a player with two dierent types oers two dierent but omputationally equivalent ontrats.
Let
γ tii
denote the Godel Code of this ontrat and let
oered by player
i
t
with type ti will be: cγi . Then
=
Notie that
γ = γ tii
i,ti
.
The equilibrium ontrat
ctγi ([c1 ] , ..., [cm ])


if ∀k∃tk ∈ Tk s.t.
< γ tkk >(γ) = [ck ] ,

 ri (t)
pji (t−j ) if k : ∄tk ∈ Tk s.t. < γ tkk >(γ) = [ck ] = j,



otherwise and if k + 1 > k if k ∈ H (ti ) ,
Ai
t
t
< γ qq >(γ) = cγq .
Therefore, the previous ontrat an be rewritten as
ctγi ([c1 ] , ..., [cm ])

t 
if ∀k∃tk ∈ Tk s.t.
cγk = [ck ] ,

 ri (t)
t j
=
pi (t−j ) if k : ∄tk ∈ Tk s.t. cγk = [ck ] = j,



otherwise and if k + 1 > k if k ∈ H (t ) ,
A
(6.1)
i
i
Next, we speify the strategies of the players at the seond stage. If for all
suh that player
j
tj
oers a ontrat cγ , then Player
player deviated, say Player
Dene
fk : T−k → 2
Ak
k,
takes ation
and he oered a ontrat
ck ,
si (t).
and player
j
there is a
tj ∈ T j
Suppose now that one
oered
t
cγj
for all
j 6= k .
as follows:
fk (t−k ) = ck
(6.2)
i
j
h i
t
cγj
ck , ctγj j6=k ,
denotes the vetor of the Godel odes of players other than k . Dene player i's
j6=k
k
strategy as si (fk , t−k ). Notie that by (6.1) these seond-stage strategies are onsistent with the
k
restritions imposed by the ontrats and the renement Rk , that is, si (t) ∈ ri (t) and s (fk , t−k ) ∈
Rk fk (t−k ) , pk (t−k ) , τ −k (t−k ) . (We do not have to speify the strategies if more than one
where
players deviate at the ontrating stage.)
We shall argue that the strategies desribed above onstitute an
R−equilibrium in the ontrat-
m
ing game. First, we show that the strategies {si }i=1 are optimal in the seond stage. Consider
22
MICHAEL PETERS AND BALÁZS SZENTES
onstraint (5.1) with
t′i . This onstraint requires
ti =
of the other players.
to be a best response to the strategies
It remained to show that players do not have inentive to deviate at the
k
ontrating stage. Suppose that player
ctγk .
si (t)
We shall onsider two ases.
tk oers a ontrat ck whih is dierent from
t′k
cγ but τ k (tk ) 6= τ k (t′k ). Then, by (5.1), this
with type
Case 1:
ck =
deviation is not protable no matter what the strategy of player
t′
ck 6= cγk
for all
t′k ∈ Tk .
Suh a deviation indues player
i
k
is at the seond stage. Case 2:
with type
ti
to take ation
ski (fk , t−k ).
Hene, by (5.2) suh a deviation annot be protable.
6.2.
Invariant punishment orrespondene.
of the punishment orrespondene
denes
fi .
p
i
The point of this setion is to show the existene
. A deviator ontemplates dierent ommitment orrespon-
What this Lemma shows is that there has to existene some xed punishment orre-
spondene
pi (t−i )
suh that no matter whih ommitment orrespondene the deviator want to
implement, there must be a way for him to write his ontrat in suh a way that the response of the
non-deviators is exatly the same, and is given by this orrespondene
pi .
This is a onsequene
of the fat that ontrats are required to be denable funtions.
Let
ctii
denote the ontrat of Player
Lemma 4.
For any array
i
funtions, pk
(t−i )
for all
ctii
k 6= i,
i
with type
i=1,...,m, ti ∈T i
ti .
Dene
o
n
t′
τ (t) = t′ ∈ T : ∀i ctii = cii .
of ontrats and every
suh that for any
τ −i
i,
there are
measurable funtion
τ −i
measurable
fi : T−i → 2Ai
, there
∗
is a ontrat ci suh that
h i
t
c∗i [c∗i ] , cjj
(6.3)
and for all
k 6= i
ctkk
(6.4)
h i
t
[c∗i ] , cjj
= f (t−i )
= pik (t−i ) .
j6=i
j6=i
t
cjj .
Eah
alternative ontrat that he oers against this array indues a ommitment orrespondene
f (t−j )
In a ontrat equilibrium, a player expets his opponents to oer the ontrats
and eliits some kind of response. The Lemma shows that provided the ontrats the others oer
are all denable funtions, there must exist some olletion of punishment orrespondenes
suh that for any ommitment orrespondene that player
i
wants, he an write his own ontrat
in suh a way that the others respond with exatly the same punishment
First, we reformulate the statement of the lemma. Let
sional vetor of subsets of
|T |
(Ai )τ −i
For all
n
τ
=
τ (t )
Ai −i −i
A−i−i
t−i
t−i ∈T−i
τ
⊂ (Ai )
: A−i−i
(t−i )
|T−i |
(Ai )τ
:
t
Ai−i
dene
⊂ A−i , ∃ci
S
∈
t
Ai and Ai−i
τ (t )
Ai −i −i
s. t.
=
t−i
t′
Ai−i if
pik
.
denote the set of
whih are measurable with respet to
|T−i |
t−i
(t−i )
t
Ai−i
Ai
i
pk
τ −i ,
|T−i |
dimen-
that is,
τ −i (t−i ) =
τ −i t′−i
.
as follows:
o
i
h
i
h
τ (t )
t−i
t−i
τ (t )
t−i
= A−i−i −i .
[ci ] , c−i
= Ai −i −i , c−i
ci [ci ] , c−i
DEFINABLE AND CONTRACTIBLE CONTRACTS
τ −i (tt−i )
A−i
Ai −i
n
o
τ (t )
S
Ai −i −i
∈S
τ
(t−i )
23
. By the denition of S ,
t−i
t−i
there exists a ontrat, ci , available for player i suh that if the type prole of the other players
τ −i (t−i )
is t−i , then if player i oers ci then his restrited ation spae will be Ai
and players −i's
τ −i (t−i )
. We laim that the statement of the lemma is equivalent
restrited ations spae will be A−i
Let us explain what it means that
to
∩ τ −i (t−i ) ff
Ai
(6.5)
t−i
t−i
6= {∅} .
To see this, suppose rst that the previous displayed statement is true, and
an element of the intersetion.
Dene
pki (t−i )
τ −i (t−i )
to be A−i
for all k
6= i
τ (t )
A−i−i −i
and
pki
t−i
is
t−i ∈ T−i .
(t−i ) t−i ∈
τ −i measurable funtion fi : T−i → Ãi , onsider S (fi (t−i ))t−i . Sine
S (fi (t−i ))t−i , and by the denition of S , there exists a c∗i suh that (6.3) and (6.4) are satised.
k
τ measurable funtions
Conversely, suppose that (6.5) is not true. Then, for all pi (t−i )
k6=i −i
o
n
τ −i (t−i )
|T |
∈ (Ai ) −i , suh that
there exists Ai
For a
t−i
pii (t−i ) = pki (t−i )
n
o
τ −i (t−i )
p−i
(t
)
∈
/
S
A
−i t−i
i
i
t−i
. Then if fi (t−i ) is dened to be
k6=i
∗
does not exist a ontrat ci suh that (6.4) is satised.
where
Proof.
Suppose by ontradition that ∩ τ (t ) ff
Ai −i −i
o
|T−i |
τ (t )
⊂ A−i
A−i−i −i
t−i
n
o τ −i (t−i )
S
Ai
. Let us
n
t−i
o
n
τ (t )
∀ A−i−i −i
Let
cx
fτ −i (t−i )
there exists an
x a funtion
t−i
denote the projetion of
f
τ
Ai −i
(t−i )
t−i
⊂ Ai
o
n
τ (t )
: A−i−i −i
t−i
orresponding to
t−i
|T−i |
i |T−i |
−i |T−i |
→ 2(A )
f : 2(A )
−i |T−i |
⊂ A
,
n
o
τ (t )
S
Ai −i −i
o
n
τ (t )
Ai −i −i
t−i
suh that
t−i
Sine
f
and
cτ −i (t−i )
are denable funtions,
cx
o
n
τ (t )
A−i−i −i
t−i
. Dene
c = cτ −i (t′ ) ,
−i
otherwise.
is a denable funtion in one free variable. Let
denote its Godel ode. Then

 f
cτ −i (t−i ) ([cγ ]) t
τ −i (t′−i )
−i
cγ (c) =

Ai
∈
/
.
−i
st.
there
Then, for all
f = fτ −i (t−i ) t
That is,
∃t′−i ∈ T −i
t−i ∈ T−i ,
= {∅}.
n
o
t−i
∈
/S f
A−i
t−i .
if
suh that
as follows:

 f ′ c
t−i
τ −i (t−i ) ([< x > (x)]) t−i
cx (c) =

Ai
for all
if
∃t′−i ∈ T −i
st.
c = cτ −i (t′ ) ,
−i
.
otherwise.
γ
24
MICHAEL PETERS AND BALÁZS SZENTES
Notie that
cτ −i (t−i ) ([cγ ]) t
(6.6)
−i
S . On the other hand,
=
cγ cτ −i (t−i )
t
by the denition of
=
cτ −i (t−i ) ([cγ ]) t
−i
f.
cγ
cτ −i (t−i )
t−i
−i
and therefore,
by the denition of
n
o cτ −i (t′ ) ([cγ ]) ′
ft−i
−i
t−i
t−i
,
f cτ −i (t−i ) ([cγ ]) t
−i
(6.7)
∈S
∈
/S
cγ
cτ −i (t−i )
t−i
Notie that (6.6) and (6.7) ontradit to eah others, and hene the (6.5)
holds.
6.3.
Only if part of the proof of Theorem 3.
Proof.
m
i m
,
Fix an equilibrium in the ontrating game. We shall onstrut the objets τ , s, {ri }i=1 , s
i=1
i m
and p
suh that the onstraints (5.1) and (5.2) are satised. Denote the equilibrium ontrat
i=1
ti
of Player i with type ti by ci . Dene the partition, τ , as follows:
o
n
t′
τ (t) = t′ ∈ T : ∀i ctii = cii .
m
Next, we onstrut the funtions
{ri }i=1 .
(6.8)
ri (t) = ctii
for all
i ∈ {1, ..., m}.
the denition of
τ.
Notie that
Let
ri (t) ∈ 2Ai .
h t i
ctii , cjj
j6=i
In addition,
ri
,
is measurable with respet to
stage strategy of Player
i
qiti
h i
t
cjj
j6=i
ti . Observe that
h i ti h tj i
tj
ti
ti
∈ ci
ci , cj
cj
qi
with type
(6.9)
j6=i
j6=i
and (6.9). Let
s (t)
is obviously measurable with respet to
denote
τ −i .
denote the seond
must be satised aording to the rules of the ontrating game. Dene
si (t)
by
The seond-stage strategies depend on the ontrats oered at the rst stage.
First, we deal with strategies on the equilibrium path. Let
The funtion
τ −i
si (t) to be qiti
In addition,
h i
t
cjj
j6=i
si (t) ∈ ri (t)
.
by (6.8)
(s1 (t) , ..., sm (t)).
(τ , {r
} , s) satisfy
(5.1). First, onsider this onstraint with
ih
i t
t
cjj
requires qi i
to be a best-response of player i to the
We are ready to show that the triple
t′i = ti .
Then, this onstraint
strategies of the other players. Sine
and hene, (5.1) is indeed satised.
requires player
i
j6=i
qiti
was an equilibrium strategy, it has to be a best response
Seond, onsider (5.1) with
t′i 6= ti .
ti
with type ti to prefer to oer ontrat ci instead of
t′
ci i .
Then, this onstraint
Indeed, the left-hand-side
is just his equilibrium payo and the right-hand-side is the maximum payo of player
i
with type
DEFINABLE AND CONTRACTIBLE CONTRACTS
ti
t′i
ci
if he oered
. Sine,
ctii
was an equilibrium ontrat, suh a deviation annot be protable
and hene, (5.1) is satised.
It remains to onstrut
for all
k 6= i
and for all
m
{pi }i=1
and
i ∈ {1, ..., m}
denote the ontrat of Player
i
k 6= i
ctkk
i
Let qk
h i
t
fi , cjj
j6=i
ally deviates to ontrat
i m
s i=1
and show that (5.2) is also satised. Dene
aording to the statement of Lemma 4. In addition, let
h i h i
t
cfi i , cjj
j6=i
h
i h i
t
cfi i , cjj
j6=i
measurable with respet to
cfi i
= fi (t−i )
= pki (t−i ) .
denote the o-equilibrium strategy of player
cfi i .
pik (t−i )
suh that
cfi i
and for all
25
Dene
sik (fi , t−i )
τ −ik (t−ik ).
to be
h i
t
i
qk fi , cjj
k when
j6=i
.
player
i
unilater-
The funtion
Given these notations, (5.2) requires that player
i
sik
is
an-
fi
not protably deviate by oering an o-equilibrium ontrat in the form of ci , and hene, this
i
onstraint is satised. Finally, sine qk is an o equilibrium strategy in an R-equilibrium, so
h i
t
qki fi , cjj
j6=i
Ri
cfi i
h i h i
t
cfi i , cjj
j6=i
t−i
, c−i
∈
k6=i
h i h i
t
cfi i , cjj
j6=i
, τ −i (t−i ) =
Ri [fi (t−i ) , pi (t−i ) , τ (t−i )]
and the renement ondition is satised.
6.4.
Example 1 - making ations depend on types.
The following examples illustrate the
properties of the ontrating equilibrium in the Bayesian ase. The rst example, illustrates how
ontrating equilibrium an be used to make one player's ation depend on another player's type.
This is something that annot be aomplished in the Bayesian equilibrium of the original game.
In this example, the row player is privately informed and has one of two equally likely types,
and
t2 .
player,
Eah player has two possible ations in the default Bayesian game,
{b1 , b2 }
{a1 , a2 }
t1
for the row
for the olumn player. The payos for eah of the row player's types are given in
the following tables:
b1
b2
b1
a1
3, 3 −1, 4 a1
a2
0,0
0, 0
a2
0,0
b2
0,0 .
−1, 4 3, 3
This is a relatively simple oordination problem, save two things - the way the players want to
oordinate depends on the row player's type, and if the olumn player learns the row player's type,
his weakly dominate ation is inonsistent with the oordinated outome. The unique Bayesian
equilibrium has player 1 using ation
a1
if his type is t1 and ation
a2
if his type is t2 . The olumn
26
MICHAEL PETERS AND BALÁZS SZENTES
player randomizes with equal probability between his two ations. The expeted payos to the
3
olumn player are 2 , the expeted payo to the row player is 1 for eah of his types.
s (t1 ) = (a1 , b1 )
A mehanism designer an implement the oordinated outome
(a2 , b2 )
and
s (t2 ) =
by simply asking the informed agent his type, then instruting the uniformed agent whih
of his ations to take. If either player refuses to partiipate, then they simply play the Bayesian
equilibrium desribed above. The alloation is inentive ompatible and individually rational from
the mehanism designer's perspetive.
To show that the alloation rule
mitment orrespondene
s
is implementable as a ontrat equilibrium, dene the om-
r1 (t1 ) = {a1 }, r1 (t2 ) = {a2 }, r2 (t1 ) = {b1 }, r2 (t2 ) = {b2 }.
This
ommitment orrespondene is measurable with respet to full information and implements the
alloation
s
sine players never have any hoies to make ex post. It is inentive ompatible so
it will be implementable if there is a type ontingent punishment that the row player an impose
8
that makes it unprotable for the olumn player to try to exploit this type information.
evidently the punishment
p1 (t1 ) = {a2 }
and
p1 (t2 ) = {a1 },
This is
sine this holds the olumn player's
payo to zero no matter what he does.
It might help at this point to desribe the way the ontrat equilibrium works in this example.
The informed player writes a dierent 'reiproal' ontrat for eah of his possible types. These
ontrats both speify the same target Godel ode, say
the Godel ode of the uninformed player's ontrat is
ation
a1 .
n
∗
n∗ .
The ontrat for type
t1
says that if
, then the informed player will ommit to
If the Godel ode of the uninformed player's ontrat is anything else, then the informed
player of type
t1
will ommit to ation
a2 .
The ontrat for t2 is similar with the ations reversed.
Enoding these ontrats gives a pair of Godel odes, say
m1
and
m2 ,
orresponding to eah of
the informed player's possible ontrats. The uninformed player writes a ontrat that says that
if the Godel ode of the informed player's ontrat is
ode of the informed player's ontrat is
to
{b1 , b2 }
integers
6.5.
m2 ,
m1 ,
then he will ommit to
then he will ommit to
b2 ,
b1 ,
if the Godel
otherwise he will ommit
and hoose among them ex post. The theorem above shows that there is a triple of
∗
(n , r1 , r2 )
suh that the Godel ode of the uninformed player's ontrat is
n∗ .
Example 2: ontrat equilibrium doesn't do as well as a mehanism designer.
In the example just desribed, the ontrat equilibrium supports everything that a mehanism
designer might want to implement. However, as we mentioned in the introdution to this setion,
ontrat equilibrium imposes a restrition on feasible alloations. When a player deides to deviate,
he knows that he will learn something about the types of the other players when he sees their
ontrats. In addition, sine he an ondition his behavior on ontrats, he an make his deviation
depend on this type information.
To illustrate the limitations that this imposes, onsider the following variant of the example
given above.
8
There are again two players eah with two possible ations.
The row player has
Of ourse, the uninformed player must also speify a punishment. For simpliity, we speify it below.
DEFINABLE AND CONTRACTIBLE CONTRACTS
two possible types, either
t1
or
t2 ,
whih are equally likely.
27
The olumn player has no private
information. The payos for eah of the informed player's possible types are given in the following
tables:
b1
b2
b1
a1
3, 3 −1, 4
a2
0, 4 2, −1
and
b2
a1
2, −1 0, 4
a2
−1, 4 3, 3
.
The Bayesian equilibrium of this default game has eah player randomizing with equal probability over eah of his ations no matter what his information. The informed (row) player has payo
5
1 in this equilibrium no matter what his type, while the uninformed olumn player has payo 2 .
The Myerson mehanism designer has no problem implementing the alloation
s (t2 ) = (a2 , b2 ).
s (t1 ) = (a1 , b1 ) and
He does this by inviting the players to partiipate in a mehanism in whih he
asks the row player to report his type. If he reports t1 then he instruts the players to use ations
a1
and
b1 ,
and similarly when type
t2
is reported. By agreeing to partiipate, the players ommit
themselves to follow the mehanism designer's instrution. This is inentive ompatible beause
the row player's payo falls from
3
to
2
if he misreports his type. The alloation is individually
rational in the usual mehanism design sense as long as a refusal to partiipate by either player
results in both players playing the (unique) Bayesian equilibrium of the original game.
This alloation rule annot be implemented as a ontrat equilibrium. Aording to Theorem 3,
to implement it, there must be a type ontingent ommitment that the row player an make, and
some speiation of the row player's ations ex post that hold the olumn players payo below
3
when he simply ommits to hoose his from his possible ations
there is no suh punishment, observe that if
ation
a1
π
b1
and
b2
ex post. To see that
is the probability with whih the row player uses
in the ex post game, then the olumn player's payo when the row player has type 1 is
max [π3 + (1 − π) 4, π4 − (1 − π)] =
max [4 − π, 5π − 1] ≥
19
.
6
The argument is idential when the row player has type 2. No suh punishment exists. Thus by
Theorem 3, there is no ontrat equilibrium that supports this outome.
As before, if there were a ontrat equilibrium that ould support this outome, then the olumn
player has to take an ation that depends on the row player's type. In priniple, he an do this
beause he an ommit himself to an ation that depends on the row player's ontrat, whih in
turn depends on the row player's type. However if the olumn player knows that the ontrat will
reveal the row player's type, then a deviation to a ontrat that simply allows the olumn player
to take his ation ex post has to be protable.
6.6. Payos lie between the Bayesian equilibrium and those implementable by a mehanism designer. This nal example is intended to illustrate a number of things. First, it shows
that the ontrat equilibrium implements stritly more than the Bayesian equilibrium of the default
28
MICHAEL PETERS AND BALÁZS SZENTES
game, but less than what is implementable by a mehanism designer. Seond, it has non-degenerate
ommitment and punishment orrespondenes, both of whih are type dependent.
The example also illustrates how randomization an be inorporated into the nal stage of the
ontrating proess when players hoose ations from their ommitment orrespondenes. We have
ignored randomization in the statement of our main theorem to simplify. This example illustrates
how the extension works.
The row player has three equally likely types supporting payos given in the following tables:
t1
b1
b2
t2
b1
b2
t3
a1
3, 3 −1, 4
a1
2, −1 0, 4
a1
a2
0, 4 2, −1
a2
−1, 4 3, 3
a2
b1
b2
−2, −2 4, −1
0, 4
2, 11
4
The payos in the rst two boxes are the same as they were in the seond example disussed
above.
The unique Bayesian equilibrium for this game has the uninformed player randomizing
equally between
b1
and
b2 .
The informed player randomizes equally when his types are
t2 , but hooses a1 with probability 59 when his type is
t3 .
and
The payo to the informed player is
no matter what his type, while the payo to the uninformed player is
designer an implement the alloation rule
t1
2.
1
A Myerson mehanism
s (t1 ) = (a1 , b1 ), s (t2 ) = s (t3 ) = (a2 , b2 )
exatly as he
does in the rst example, by asking the informed player his type, then telling both players what
ations to take. This alloation an't be supported as a ontrat equilibrium. The argument is
exatly as in the seond example above, sine the ontrat equilibrium has to enfore dierent
ations when the row players types are
t1
and
t2 .
However, the alloation in whih both players randomize equally between their ations when
the row player has type 1 or type 2, while the ations
a2
b2
and
are taken when the row player
has type 3 an be supported as a ontrat equilibrium. This alloation is measurable with respet
to the information partition
{{t1 , t2 } , {t3 }}
so the ommitment and punishment orrespondenes
an depend on the row player's type. In partiular, the ommitment orrespondene we want is
r1 (t1 ) = r1 (t2 ) = {a1 , b1 }, r1 (t3 ) = {a2 },
while
r2 (t1 ) = r2 (t2 ) = {b1 , b2 } and r2 (t3 ) = {b2 }.
punishment orrespondene for the row player is again multi-valued
while
p1 (t3 ) = {a1 }.
The olumn player punishes with
{b1 , b2 }
The
p1 (t1 ) = p1 (t2 ) = {a1 , a2 },
for all deviations.
The behavior to be supported involves randomization among the hoies in the ommitment
orrespondene. This an be inorporated in a straightforward way by requiring that the mappings
s (t)
si (t−i )
and
have their range in the set of mixtures over ations whose support lies within the
appropriate ommitment orrespondene.
si (t3 ) = {0, 1}
for
i = c, r.
So in the example,
eah
f,
and
s (f, t3 ) = {1, 0}
{{t1 , t2 } , {t3 }}.
1
2, 2
= si (t2 )
Let
sc (f, t1 ) = sc (f, t2 ) =
as is required by the punishment orrespondene.
is a type ontingent ommitment
partition
1
while
The behavior during the punishment phase is dened similarly.
Consider the ase where the olumn player deviates.
c
si (t1 ) =
f
1
1
2, 2
for
A deviation
that has to be measurable with respet to the information
As an example, take
f (t1 ) = {b1 , b2 } = f (t3 )
while
f (t3 ) = {b1 }.
The
DEFINABLE AND CONTRACTIBLE CONTRACTS
29
punishment has to make this and all other measurable type ontingent ommitments unprotable
given the olumn player's interim beliefs. It is straightforward to hek that it aomplishes this.
One way to implement this in a ontrat equilibrium is to have the row players types
both oer the same ontrat whih ommits them to
oers. When the row player has type
t3
{a1 , a2 } whatever ontrat the
he oers a ontrat that ommits him to
n∗c ,
a2
t1
and
t2
olumn player
if the olumn
a1 against
∗
any other Godel ode. The Godel ode of this ontrat is, say n3 . The olumn player oers a
∗
ontrat that ommits to b2 if the Godel ode of the row player's ontrat is n3 , but ommits to
player oers a ontrat whose Godel ode is equal to some target
{b1 , b2 }
but ommits to
against any other ontrat.
7.
Independent Private Values
An environment that is of some interest in appliations is the independent private value environment. The lassi rst or seond prie aution models are typial examples. However, the
tratability of suh models makes them popular. For the independent private value environment
we an use our theorem to provide something that looks like a folk theorem for Bayesian equilibrium. For our purposes, this 'folk theorem' is interesting beause it suggests an environment where
ontrats an be used to fully deentralize the mehanism designer's problem.
Players have
private values
if
ui (a, (ti , t−i )) = ui a, ti , t′−i
(players' payos don't depend on other players' types).
E (f (t−i ) : ti ) = E (f (t−i ) :
for all
a ∈ A, t−i , t′−i ∈ T−i
Types are independently distributed if
t′i ) for every i, ti , t′i and every integrable funtion
f.
Theorem 2. Let s : T → A be an alloation rule that is implementable by a entralized mehanism
designer in an independent private value environment. Then the alloation s an be supported as
a Bayesian equilibrium in ontratable ontrats.
Proof.
An alloation rule is implementable by a mehanism designer if there is a olletion of
punishments
si : T−i → A−i ,
where
sij
is the punishment partiipants will impose on player
i
if he
′
hooses not to partiipate suh that for eah ti and ti
Et−i
and
Et−i (ui (s (t) , t) : ti ) ≥
ui ai , s−i t′i , t′−i , ti , t′−i : ti ;
Et−i (ui (s (t) , t) : ti ) ≥
max Et−i ui ai , si (t−i ) , (ti , t−i ) : ti .
ai ∈Ai
We prove the theorem by onstruting the various omponents required by Theorem 3.
Begin with the punishment
si (·).
We have from the private value and independene assumption
max Et−i ui
ai ∈Ai
ai , si (t−i ) , (ti , t−i ) : ti =
max Et−i ui
ai ∈Ai
ai , si (t−i ) , ti .
30
MICHAEL PETERS AND BALÁZS SZENTES
Let
g̃
i
be the distribution on
A−i
indued by the funtion
si
and the distribution of t−i and dene
the punishment
s̃i (f, t−i ) = g̃ i
for eah
t−i
and every orrespondene
f : T−i → Ãi
that is measurable with respet to full
information (a set whih ontains all orrespondenes whih are measurable with respet to any
information struture). Then we have
Et−i (ui (s (t) , t) : ti ) ≥
max Et−i ui ai , s̃i (fai , t−i ) , ti =
ai ∈Ai
max E ui
ai ∈Ai
where
fai (t−i ) = {ai } ∀t−i ∈ T−i .
depend on
t−i .
ai , g̃ i , ti
The important aspet of this punishment is that it does not
f
Let τ i be the full information partition of
Ti
with
τf =
Q
i
τ fi .
Dene the
i
ommitment orrespondene ri (t) = {si (t)} and the punishment orrespondene pj (t) = Aj .
f
These orrespondenes are both trivially measurable with respet to τ . Furthermore, (5.1) holds
trivially sine the alloation must be inentive ompatible, and
any ommitment orrespondene
Et−i
f
ri
is always a singleton. Now for
measurable with respet to the full information partition
max Et′−i ui a, s̃
a∈fi (t−i )
i
fi , t′−i
max E ui
a∈Ai
, ti :
t′−i
a, g̃ i , ti
∈ τ −i (t−i ) : ti
τ f−i ,
≤
≤
Et−i (ui (s (t) , t) : ti ) .
So (5.2) is satised. Then by Theorem 3, the alloation
s is supportable as a ontrat equilibrium.
From the proof above, it should be apparent that what makes the theorem work is the fat that
the punishment that the mehanism designer uses to enfore partiipation has the same impat on
the non-partiipant no matter what he learns about the partiipants' types. This is a onsequene
of the private value assumption. Some interdependent value problems will also have this property.
For example, in a trading problem, not being able to trade may be worse for a player than trading
no matter what he learns about the types of the others. Similar folk theorems are possible in suh
environments.
8.
Conlusion
This paper shows how the ontrats on ontrats approah an be extended to environments
with inomplete information by restriting players to use denable ontrats. Denable ontrats
onstitute the largest lass of arithmeti ontrats whih an be written as a nite text in a rst
order language. In this sense denable ontrats embed most other interesting lasses of feasible
ontrats as subsets.
DEFINABLE AND CONTRACTIBLE CONTRACTS
31
In ontrast to the omplete information ase, we show that the 'folk theorem' doesn't generally
hold.
A entralized mehanism designer an implement alloations that an't be supported as
equilibrium with ontratible ontrats. This limitation is not a onsequene of the set of feasible
ontrats, but rather of the fat that publi ontrats reveal information about non-deviators'
type.
The restrition to denable ontrats allows us to provide a omplete haraterization of
equilibrium and to prove this result. One of the results we provide as part of our main theorem
illustrates the role that punishments play in a stati ontrating environment.
Referenes
[1℄ Anderlini, L: Some Notes on Churh's Thesis and on the Theory of Games.Theory
and Deision, 29, 19-52,
1990.
[2℄ Kyle Bagwell and Robert W. Staiger. Reiproity, non-disrimination and preferential agreements in the multilateral trading system.
European Journal of Politial Eonomy, 17(2):281325, June 2001.
[3℄ Canning, D: Rationality, Computability, and Nash Equilibrium.Eonometria, 60(4), pp. 877-888, 1992.
[4℄ L. Epstein and M. Peters. A revelation priniple for ompeting mehanisms.
88(1):119161, September 1999.
[5℄ C Fershtman and K.L. Judd. Equilibrium inentives in oligopoly.
Journal of Eonomi Theory,
Amerian Eonomi Review,
77:927940,
1987.
[6℄ Seungjin Han. Menu theorems for bilateral ontrating. Journal of Eonomi Theory, 127(1):157178, November
2006. available at http://ideas.repe.org/a/eee/jetheo/v131y2006i1p157-178.html.
[7℄ Kalai, A.T., Kalai, E., Lehrer, E., and D. Samet. A Commitment Folk Theorem. manusript, Tel-Aviv University, April, 2007.
[8℄ Mihael Katz. Observable ontrats as ommitments: Interdependent ontrats and moral hazard.
Eonomis and Management Strategy, 15(3):685706, September 2006.
Journal of
[9℄ David Martimort and Lars Stole. Communiations spaes, equilibria sets and the revelation priniple under
ommon ageny. University of Chiago unublished manusript, 1998.
[10℄ Mihael Peters. Common ageny and the revelation priniple. Eonometria, 69(5):13491372, September 2001.
Pratises that Credibly Failitate Ologopoly Coordination. Cambridge, MIT Press, 1986.
Leo Simon and W. Zame. Disontinuous games and endogenous sharing rules. Eonometria, 58(4):861872,
[11℄ S.C. Salop.
[12℄
1990.
[13℄ Moshe Tennenholz. Program Equilibrium.
Games and Eonomi Behavior,49(2):363373,2004.
[14℄ Takuro Yamashita. A revelation priniple and a folk theorem without repetition in games with multiple priniples and agents. manusript, Stanford University, February 2007.
Download