Computational Approaches to Economic Valuation & Strategy Choice Colin Camerer Antonio Rangel Caltech Outline • • • • Brief history of the role of computation in Economics Models of valuation and simple choice (Rangel) Models of strategic choice and learning in games (Camerer) Computational issues at different levels: individuals, firms, markets (Camerer) • Future directions of research I Brief history of the role of computation in Economics Computation is at the heart of economic problems Consider some typical problems: •Individual: What snack should I pick out of the buffet table? •Individual: Optimal investment portfolio? •Firm: Price setting and production selection problem •Market system: price formation Thus, one would expect computational based models of decisionmaking to be common in Economics This is not the case: Traditional deliberate ignorance of computational detail • The triumph of “as if” modelling (economic behaviorism) -- Pareto (1987): “Pure political economy has therefore a great interest in relying as little as possible on the domain of psychology” -- Friedman (1953) Test predictions of theory rather than realism of assumptionsà can ignore computational detail • Fictional stand-ins for computation - Walrasian auctioneer - equilibrium in games • Underlying computational processes are modeled in REDUCED FORM Traditional view (cte.) • Axioms are considered primitives (logic vs biology as constraint on choices) • A developed preference for general mathematical proof over simulation – Study of “procedural rationality” algorithms (Simon) did not gain traction – Distrust of complicated many-equation macroeconomic models & simulations – Little post-1990 taste for SFI agent-based modeling Neuroeconomics: neurobiologically based computational models of decision-making Goals of Neuroeconomics: 1. What computations are carried out by the brain to make different types of economics decisions? 2. How are these computations implemented by the brain? 3. What are the implications of this knowledge for economics, finance, education, AI, marketing, … ? Computation is at the core of Neuroeconomics BUSINESS APPLICATIONS JUGDMENT & DM THERAPEUTIC APPLICATIONS ECONOMIC APPLICATIONS COMPUTAT. MODELS NEUROSCIENCE A.I. PSYCHOLOGY II Neuroeconomic Models of Valuation and Simple Choice Example I: Reward Prediction Learning 5 8 9 10 11 Reward wait 7 … Cue 6 Reward wait Reward 3 4 trial 3 Cue Time: 1 2 trial 2 wait Cue Event: trial 1 12 …. • Brain’s problem: learn to predict size & timing of rewards that follow each type of cue • Temporal-difference learning algorithms have been designed in CS to solve this problem (Sutton & Barto (1998)) How can the brain learn the reward function? Notation: - True value of state s: mean r(s) - pt(s) = computed predicted value at beginning of triat t (= brain’s best guess about the state’s true value) - t(s) = r(s) - pt(s) = error signal in trial t This error term is extremely important: it serves as THE teaching signal! Learning Algorithm Step 1. Arbitrarily initialize the decision values p1(s) for all s Step 2. Every trial t: -- begin with pt(s) -- measure actual reward -- Compute error (t) -- Update the DV for a and c active in trial as follows: pt+1(s) = pt(s) + (t) where -> (0,1) is a learning rate • Under very general conditions, E(pt(a|c)) -> E(r(s)) for all s How well do TD algorithms describe brain’s reward learning? Cue TD-Errors Before Learning TD-Errors During Learning TD-Errors After Learning if Unexpected Omission of reward Reward Can we find evidence of TD-error signals in monkeys’ brains? Single unit recordings from VTA dopamine neurons revealed that these neurons produce responses consistent with TD - learning: Schultz [1998] What brain areas show activation that correlates w/ TD-error signals in humans? p<<0.001 +3 R -3 -6 -30 +54 From O’Doherty et.al. [2003] at time of CS +6 CS+ trials Example II: Role of Visual Attention in Simple Choice ? Model: Three Parallel Processes • e=time elapsed since beginning of choice trial Visual attention DVs computation Comparator g(e)= L,R dL(e), dR(e) Choose L,R,or wait Visual attention switch g(e) g(e) DV Computation Comparator dL(e), dR(e) choose g(e) or switch Visual attention process First fixation: Stochastic bottom-up process •P0 = Prob first fixation to L •Exponential latency: Pr(First fixation begins at t)= 1- B.e- t Subsequent fixations: top-down process •Follow the commands of the comparator process Value construction process v+ 0 vt Comparator process: • During each fixation, the comparator either chooses g(e) or sends a signal to the visual system to switch gaze • Length of each fixation stochastic: -- d = duration of current fixation -- Pr(comparator evaluates at d)= 1- A.e- d • Decision made as follows: -- rx(e) = d(tx(e)) - d(ty(e)) • -- Choose g(e) with probability -- Wait (and switch fixation) with prob • Always switch after first fixation Model predictions • Behavioral: S-shaped choice probabilities • Process: RTs and #saccades increase with choice difficulty • Performance: - Importance of first fixation: P(choice=best|fist-fixation=best)> P(choice=best|fistfixation=worse) - First look bias: for items with similar value P(choice=L|fist-fixation=L)> P(choice=L|fist-fixation=R) -… Test Enforce 2000 ms fixation + Present until a choice is made 1000 ms + + Collect eye-fixations @ 50 Hz Results QuickTime™ and a TIFF (PackBits) decompressor are needed to see this picture. QuickTime™ and a TIFF (LZW) decompressor are needed to see this picture. QuickTime™ and a TIFF (LZW) decompressor are needed to see this picture. QuickTime™ and a TIFF (LZW) decompressor are needed to see this picture. Summary • Computation is at the core of the nascent field of Neuroeconomics • Goal is to (1) describe the computation and processes that the brain uses to make decisions and (2) establish their neural basis • Test the computational processes directly using modern neuroscience and psychology tools -- from fMRI to eye tracking • Feasibility of the research agenda has already been proven • Novel insights into DM are already being generated by this class of models. III Models of Strategic Choice & Learning in Games Some theoretical interest in computational models • Finite-state automata (Rubinstein, Neyman, et. al.) • Computational complexity (Gilboa-Zemel on NP-hard games) • Not linked to data or practical problems Cognitive hierarchy models of limited strategic thinking • Selten (1998): – “The natural way of looking at game situations…is not based on circular concepts, but rather on a step-by-step reasoning procedure” • Cognitive hierarchy – “Level 0” use a heuristic (e.g. randomize) – “Level k” best-respond to choices of level 0-(k-1) – Axiom f(k)/f(k-1) 1/k (k-th step increasingly difficult) f(k)=e-ttk/k! (Poisson) – Limit as t often converges to equilibrium – Simpler than equilibrium in some ways easier to compute predictions no problem of multiple equilibria Limited planning ahead in bargaining (Science, 03) 3-stage bargaining 1: $5 p1 offers 2: $2.50 p2 offers 3: $1.25 p1 offers (0,0) if rejected E.g. “P-beauty contest” (Ho et al AER 98) pick x in [0,100], x closest to (2/3) of average wins 1 0.8 equilibrium=0 0.7 data 0.6 CH prediction (τ=1.5) 0.5 0.4 0.3 0.2 0.1 number choice 95 85 75 65 55 45 35 25 15 0 5 relative frequency 0.9 “Choosing” computations are different than “belief formation” computations Bhatt-Camerer GEB 2005 Field application: “Cold opening” of movies (unavailable to critics for Friday review) Studios do not let worst movies get reviewed… ”cold” opening increases box office EWA learning in games: Generalized reinforcement • Reinforcement, fictitious play linked (Econometrica 99) • Update attractions to strategy j from payoff A ij (t) - A ij (t-1) = [*π(s ij,s-i (t)) -A ij (t-1)]/(ϕN(t-1)+1) = prediction error/increasing weight is “imagination” of counterfactual payoffs ϕ is recency weight Typical values: N(0)=1, ϕ=.8, weights go from .56 .20 • Can replace , ϕ with “self-tuning” functions (JET ’07) • Can add “sophistication”– players know others are learning (JET 02, GEB 06) Example: Price matching with loyalty rewards (Capra, Goeree, Gomez, Holt AER ‘99) • Players 1, 2 pick prices [80,200] ¢ Price is P=min(P1,,P2) Low price firm earns P+50 High price firm earns P-50 • What happens? – Theory: competition drives prices to 80 191~200 181~190 171~180 161~170 151~160 141~150 131~140 121~130 111~120 101~110 91~100 81~90 80 1 3 5 Period 7 9 5 9 191~200 181~190 171~180 161~170 151~160 141~150 131~140 121~130 111~120 101~110 91~100 81~90 1 80 3 Period 7 Empirical Frequency 0.9 0.8 0.6 0.7 0.5 0.4 Prob 0.3 0.2 0 0.1 Thinking fEWA Strategy 0.9 0.8 0.7 0.6 0.5 Prob 0.4 0.3 0.2 0.1 0 Strategy IV Computational Issues at different levels: individuals, firms, markets Levels of computational modelling in economics • • Individuals (what you’ve seen) Firms – Firms as hierarchies of imperfectly informed individuals (Radner-Van Zandt) Optimal hierarchies for aggregating formation • Mechanism design – Computability as an individual rationality constraint (Ledyard) • Markets – Markets as computational mechanisms • Computing equilibria (Judd, Kearns et al) – Smart markets: Hybrids of bids and optimal combination (e.g. combinatorial “package auctions” e.g. PCS spectrum) – Information aggregation • Markets ‘compute’ probabilities of events (e.g. prediction markets) Prediction markets • • • • • • • • • • Began with basic research: 20 yrs to wide use Plott and Sunder (1982 Econometrica): Markets for “contingent claims” Pay $1 if an event occurs. Prices reveal probabilities Markets are $-weighted opinion polls of self-selected respondents Iowa Political Markets 1988 (http://www.biz.uiowa.edu/iem/) Markets for political events predict surprisingly accurately Tradesports 2002 (http://www.tradesports.com/) et al Used by some companies, policy markets See Wolfers & Zitzewitz J Econ Perspectives 04 Six hours earlier (9pm EST Oct 26 ‘06): Guess about Karl Rove nonindictment appears in Intrade price drop…36 hrs before Oct 28 Libby indictment ROVE.INDICTED.31DEC ASK BID Price Price 1 25.1 29.9 1 10 25.0 30.0 3 1 24.6 31.9 1 1 24.4 32.0 2 1 24.2 32.7 2 1 23.7 33.0 10 1 23.6 34.9 4 3 23.4 35.0 20 11 23.3 39.0 5 27 23.2 40.0 11 16 23.0 50.0 10 13 22.5 68.8 5 10 22.0 70.0 99 20 21.0 72.0 4 74.9 1 Qty 11 20.0 Qty Google news at 1:46am EST Oct 27: Will Karl Rove be indicted? •Rove critics again turn up the volume New York Daily News - Oct 27 1:18 AM With rampant rumors of a soon-to-drop indictment in Special Counsel Patrick Fitzgerald 's CIA leak investigation, the Karl Rove literary business is booming. •Rove's Last Campaign Washington Post - Oct 26 11:31 AM Will Karl Rove, architect of President Bush's improbable political career, snatch one last victory from the jaws of defeat? (Or at least avoid getting indicted?) Something appears to have provoked special prosecutor Patrick J. Fitzgerald into a last-minute flurry of activity centered............ •Leak Counsel Is Said to Press on Rove's Role New York Times - Oct 25 7:25 PM Three days before the grand jury is set to expire, Patrick Fitzgerald appeared to be trying to determine Karl Rove's role in the outing of a C.I.A.'s officer's identity. •Libby, Rove Await Indictment Decisions By Martin Sieff, UPI Senior News Analyst Washington DC (UPI) Oct 25, 2005 Space War - Oct 26 9:53 PM Washington seethed with rumors and speculation Tuesday night on the eve of the expected announcement of possible indictments in the Valerie Plame CIA leak probe. Current (3/15) “prices” of Scooter Libby pardon Legal - I. Lewis Libby Contract Bid Ask Last Vol Chge I. Lewis (Scooter) Libby Pardon LIBBY.DEC 07.PARDO N I.Lewis Libby to be pardoned by 31 Dec 2007 M 10.0 17.0 9.5 1012 0 LIBBY.EOT .PARDON I.Lewis Libby to be pardoned by the end Mar 16 - 3:18AM GMT of President Bush's term in office M 62.0 63.2 63.2 948 -0.2 V Future of computational models of decision in Economics @ the Individual level • It will look like theoretical neuroscience • Focus on modeling the neural and psychological processes involved in decision-making • Modeling constraints provided by neural, psychological, and behavioral data • Models will be tested with techniques such: - fMRI - electrophysiology - TMS - eyetracing - behavioral predictions @ the firm and market levels • Will build on the properties of the individual level models • Model the interactions of many agents • Goal will be to improve our understanding of: - auctions - price formation in markets - financial markets dynamics - macroeconomic performance and policy