Copyright (C) 2011 David K. Levine This document is an open textbook; you can redistribute it and/or modify it under the terms of version 1 of the open text license amendment to version 2 of the GNU General Public License. The open text license amendment is published by Michele Boldrin et al at http://levine.sscnet.ucla.edu/general/gpl.htm; the GPL is published by the Free Software Foundation at http://www.gnu.org/copyleft/gpl.html. If you prefer you may use the Creative Commons attribution license http://creativecommons.org/licenses/by/2.0/ Period Length in LR-SR Games payoff matrix Player 2 L Player 1 +1 - x ,0 R 1,1 -1 - x ,0 2,-1 player 2 plays L in every Nash equilibrium player 1’s static Nash equilibrium payoff is - x also the minmax payoff for player 1 1 end of each stage game, public signal z Î ¡ depends only on action taken by player 1; probability distribution of the public signal is F (z | a1 ) F either differentiable and strictly increasing, or corresponds to a discrete random variable f (z | a1 ) denote the density function monotone likelihood ratio condition f (z | a1 = + 1) / f (z | a1 = - 1) nondecreasing in z player 2’s action is publicly observed availability of a public randomization device 2 t is length of the period t player 1 a long-run player with discount factor d = 1 - r t faces infinite series of short-run opponents most favorable perfect public equilibrium for LR characterized by the largest value v that satisfies the incentive constraints v = (1 - d)1 + dò w(z ) f (z | a1 = + 1)dz v = (1 - d)2 + dò w(z ) f (z | a1 = - 1)dz v ³ w (z ) ³ - x or v = 0 if no solution exists second incentive constraint must hold with equality, since otherwise it would be possible to increase the punishment payoff w while maintaining incentive compatibility 3 from the monotone likelihood ratio condition, iif there is a solution, w(z ) it must have cut-point form ìï v z ³ z * . w(z ) = ïí ïï w * z < z * î 4 Proof: Any w that is part of a solution to maximizing v subject to the incentive constraints must be a solution, for some constant c to max ò w(z ) f (z | a1 = + 1)dz subject to max ò w(z ) f (z | a1 = - 1)dz £ c Let l be the Lagrange multiplier for this constraint; the first order condition is f (z | a1 = + 1) = l f (z | a1 = - 1) , or f (z | a1 = + 1) = l f (z | a1 = - 1) The monotone likelihood ratio condition implies any solution to this has the cutoff form. 5 Define p= z* ò- ¥ f (z | a1 = + 1)dz , q = z* ò- ¥ f (z | a1 = - 1)dz . enables us to characterize most favorable perfect public equilibrium for LR as largest value v that satisfies incentive constraints v = (1 - d)1 + d(v - p(v - w )) v = (1 - d)2 + d(v - q(v - w )) v ³ w³ -x or v = 0 if no solution exists. linear programming problem characterizing the most favorable equilibrium has solution if and only if (*) r £ 1 t + ((1 + x ) [(q - p) / t ] - [ p / t ])- 1 in which case the solution is given by 6 (**) v * = 1 - p / (q - p) . 7 The Poisson Case [Abreu, Milgrom and Pearce, 1991] signal generated by underlying Poisson process in continuous time Poisson arrival rate l p if the action taken by LR is +1 and l q > l action taken by LR is –1 p if the Poisson signal is being “bad news” the number of signals received during the previous interval of length t . cutoff point is how many “bad signals” received before punishment v - w is triggered period length t is short, can ignore possibility of more than one signal, so the cutoff is necessarily a single signal approximation: the probability triggering punishment p = l p t , q = l q t . 8 in (**) v* = lp lq - l . p in (*) r £ (1 + x )(l q - l p ) - l p. independent of the period length if satisfied, best equilibrium payoff is v * , independent of period length and worst possible punishment x . 9 The Diffusion Case [Faingold and Sanikov [2005] signaled generated by underlying diffusion process in continuous time drift in process given by the long-run player’s action (a1 = +1 or –1) and the instantaneous variance of the diffusion s 2 random variable z is increment to the diffusion process during the previous interval of length t if the long-run player’s action is a1 signal is normally distributed with mean a1t and variance s 2 t . 10 allow the variance of the signal z to be given by s 2 t 2a where a < 1 diffusion case corresponds to a = 1/ 2 if F is the standard normal cumulative distribution æz * - t p = Fç ç è st a æz * + t q = Fç ç è st a ö ÷ ÷ ø ö ÷ ÷ ø 11 fix x and consider best equilibrium for player 1 as t ® 0 Proposition 1: For any a < 1 there exists t > 0 such that for 0 < t < t (*) has no solution. 12 Proof: z * ( t ) cutoff when period length is t work with normalized cutoff z( t ) = t - z * (t ) st a So: p = F (- z( t ) ) æ2t 1q = Fç ç ç è s a ö. ÷ - z( t ) ÷ ÷ ø if not (*) not violate then (q - p) / t and p / t both remain bounded away from zero as t ® 0 show that if (q - p) / t remains bounded away from zero, then necessarily p / t ® ¥ 13 compute (q - p) / t using the mean value theorem to observe that for each t there is a number f (t ) , 0 £ f ( t ) £ 1 , such that q- p = t = æ2t 1Fç ç ç è s 1 st a ( ) ö - z( t ) ÷ - F (- z( t )) ÷ ÷ ø t 1- a ö æ 2 t ÷ f ç - z( t ) + f ( t ) ÷ ç ÷ ç s ø è a Let c(t ) = (q - p) / t , we can invert this relationship to find 14 1æ 2 t f ç - z( t ) + f ( t ) ç ç s è (***) a ö ÷ = c( t )st a ÷ ÷ ø 11 1æ 2 t exp(- ç - z( t ) + f ( t ) ç 2ç s 2p è 1æ 2 t ç - z( t ) + f ( t ) ç ç s è z( t ) = - a ö2 ÷ ) = c( t )st a ÷ ÷ ø a ö2 ÷ = - 2 log( 2pc( t )s ) - 2a log( t ) ÷ ÷ ø 2t 1- 2 log( 2pc( t )s ) - 2a log( t ) + f ( t ) s a 15 want to show p / t ® ¥ c( t ) is bounded away from zero so z(t ) £ - log b - 2a log t + a t 1- a and p / t ³ F(- - log b - 2a log t - at 1- a ) / t by mean value theorem for some t ' Î [0, t ] 16 F (- - log b - 2a log t - a t 1- a ) / t æ ö a - a÷ 1- a = ç a (1 a )( t ') f ( log b 2 a log t ' a ( t ') ) ÷ ç ÷ ç èt ' - log b - 2a log t ') ø ö 1 æ a - a÷ ç = a (1 a )( t ') ÷ ÷ ç 2p ç èt ' - log b - 2a log t ') ø ´ exp(- 1/ 2) (- log b - 2a log t '+ 2a( t ')1- a - log b - 2a log t ' + a 2( t ')2- 2a ) ö b1/ 2 æ a - a÷ ç = a (1 a )( t ') ÷ ÷ ç 2p ç èt ' - log b - 2a log t ') ø ´ ( t ')a exp(- a( t ')1- a - log b - 2a log t ') exp(- 1/ 2) (a 2( t ')2- 2a ) a- 1 ö ab1/ 2 æ ( t ') - 2a log t ' ÷ ç ® exp( a ) ÷ ç a- 1 ÷ 2p ç ( t ') è - 2a log t ') ø 17 apply L’Hopital’s rule to show that t a- 1 t a- 2 ® (1 - a ) = 1 - a )t a - 1 ® ¥ 1/ t - log t . 18 recall that v * = 1 - p / (q - p) consider for fixed t taking very small cutoff z * ® - ¥ as in Mirlees [????] causes likelihood ratio q / p ® ¥ so p / (q - p) ® 0 ; that is v * ® 1, the pure commitment value, and the best PPE payoff for the most the long-run player can get when there is no noise in the signal this solution has p, q ® 0 , which means that for fixed x and t and z * sufficiently negative, (*) must be violated however, for any choice of z *, r , t , there is always an x sufficiently large that (*) holds the diffusion case the worst punishment determines the best equilibrium going far enough into the tail of the normal, arbitrarily reliable information is available and can be used to create incentives, provided sufficiently harsh punishment is available 19