Slides - Theoretical REsearch in Neuroeconomic Decision

advertisement
Theoretical REsearch in Neuroeconomic Decision-making
(www.neuroeconomictheory.org)
From perception to action:
an economic model of brain processes
Isabelle Brocas
USC and CEPR
Juan D. Carrillo
USC and CEPR
What is “Neuroeconomic Theory”?
Use evidence from neuroscience to revisit economic theories of
decision-making
Neuroscience evidence includes:
Multiple systems in
the brain
Interactions between
systems
Physiological
constraints
What is “Neuroeconomic Theory”?
Revisiting theories of decision-making includes:
• Revisit the individual decision-making paradigm:
not decision theory but game-theory approach
• Provide “micro-microfoundations” for preferences, i.e. elements
traditionally considered as exogenous (discounting, risk-aversion, etc.)
• Provide foundations for processes traditionally taken for granted
(learning, information processing, etc.)
• Understand intra-personal conflicts and behavioral “biases”
This paper
Build a brain-based model of information processing using evidence from
Neurobiology:
i.
Neurons carry information from sensory circuitry to decision-making
circuitry through a cell firing process
ii. There is stochastic variability in neuronal cell firing
iii. Thresholds of neuronal activity trigger actions (economical
information processing technology)
iv. Thresholds can be modified (learning, adaptation)
A “simple” problem of information processing!
An illustration
• Today is a dangerous day (state A) or a safe day (state B)
• Individual collects some imperfect information before making decision:
he takes a look out of the cave
• Individual decides to stay in the cave (action a) or go hunting (action b)
• Individual obtains a payoff (catches animal, killed by predator, starves)
sensory system  decision system  motor system
OVERVIEW
•
•
•
•
•
A two-actions model
More complex environments
Behavior implications
Relation to neuroscience theories
Conclusions
A two-actions model
(most interesting for neuroeconomics, least interesting for economic theory)
The sensory system
There are three main elements: environment, preferences, information
1. Environment.
- Set of states of the world: S={A,B} with element s
- Set of possible actions:
={a,b} with element 
2. Preferences over outcomes.
Represented by loss/utility function l(|s).
Payoff maximized if a in A and if b in B.
The complete description is: L = {l(a|A),l(b|A),l(a|B),l(b|B)}
3. Information structure. It consists of:
- Prior belief : assessment of likelihood of states without information
(memories, encoded values in cells).
Pr(A) = p0 and Pr(B) = 1 - p0
- Information from outside world: encoded and translated into
neuronal activity c  [0,1] which is an indicator of the state from
c = 0 (no perceived danger) to c = 1 (highest perceived danger)
There is stochastic variability in neuronal cell firing (cell activity
varies, competition between neurons, metabolic costs, noise,
circumstances)
Perception is correlated with true state
Pr(c | A)  f A (c) and Pr(c | B)  f B (c) with
f A (c)  f B (1  c) (symmetry)
Pr(c)
fB(c)
(stochastic) low
cell firing if S=B
0
Example:
d  f B (c ) 

  0 (MLRP)
dc  f A (c) 
fA(c)
1/2
(stochastic) high
cell firing if S=A
1
c
c = proportion of neurons detecting a danger
c is high in “dangerous” days, c is low in “safe” days
The decision system
A decision is a mapping from (L, p0, c) into 
Objective of the brain: conditional on L and p0, determine under which
circumstances c should trigger action a or action b.
Premise of the model: process must be economical and compatible with
existing evidence of neuronal functioning.
• Changes in beliefs and magnitude of payoffs are correlated with activity
and choice  support for maximization of expected payoff
• Neurons perform statistical inferences closely based on information
received  support for Bayesian updating
• Neuronal activity: neuronal thresholds (high or low), synaptic
connections (weak or strong). Information is interpreted to trigger
decision  support for decision-threshold, i.e. a mechanism like
threshold x such that one action if c < x and another action if c > x
Note. This process is “economical” (some information is filtered out)
• Thresholds are modulated  support for (as if) optimization process
(see paper for discussion of neuroscience and neurobiology literature)
Summary of timing
Two-alternative
task (6)
prior p0
updated p1
S  {A,B}
threshold
c >
<x
action
nature
x
cell firing
  {a,b}
l (|S)
payoff
Assumption: decision threshold is set optimally
Decision threshold x maximizes expected payoff given Bayesian
updating and neuronal constraints (stochastic variability + inability to
process all information)
Optimal threshold
• At stage 2, the posterior is pH1  Pr(A | p0, c > x) or pL1  Pr(A | p0, c < x)
• Given Bayesian updating, it is immediate that for any x :
pH1 (x) > p0 > pL1 (x)
that is, c > x is evidence of state A and c < x is evidence of state B
• Therefore, optimal threshold x* such that  = a if c > x and  = b if c < x.
It solves:
max V(x) = Pr(c > x )U(a, pH1(x)) + Pr(c < x )U(b, pL1(x))
• Let l(a|A) = A, l(b|B) = B, l(a|B) = l(b|A) = 0, and  = A / B
Proposition 1. dx*/dp0 < 0 and dx*/d < 0
dx*/dp0 < 0
Tradeoff between likelihood and impact of information
Suppose that prior strongly favors state B (p0 is “small”):
 Only strong evidence of A convinces subject to choose  = a.
 Strong evidence means a high threshold must be surpassed.
 A high threshold means a low prob. (whether true state is A or B).
(there is an analogous result in Theory of Organizations literature)
dx*/d < 0
Tradeoff between likelihood and impact of mistakes
Suppose that correct choice in B has higher potential (B is large):
 Only strong evidence of A convinces subject to choose  = a.
… same logic as before
Property of the optimal threshold
Let p(̃ c) = Pr(A|c). The optimal threshold x* is such that:
U(a, p(̃ x*)) = U(b, p(̃ x*))
Therefore: U(a, p(̃ c)) > U(b, p(̃ c)) for all c > x*
U(a, p(̃ c)) < U(b, p(̃ c)) for all c < x*
 For the purpose of choice, it is equivalent to observe exact c or
only whether c >
< x*
 Threshold is economical and fully efficient when there are only
two possible actions (not robust to more complex situations)
More complex environments
(robustness analysis)
Definitions
Note: A decision threshold discriminates between two actions.
We make a distinction between
• Cognitive processes: discriminates between all relevant actions
(involves several thresholds if more than two actions)
• Affective processes : neglects some relevant actions (involves less
thresholds than necessary)
Continuum of actions
Loss function l(s-1) if S=A and l(s) if S=B
Example: A = danger, B = no danger
Choice = hunt at distance 1-s from the cave
a. l(z) linear or convex. Optimal solution is corner: * = 0 or * =1
 identical analysis as before
b. l(z) quadratic. Departures are increasingly costly. There is one optimal
action *(p1 ) for each posterior.
 Cognitive process: requires a continuum of thresholds.
 Affective process setting one threshold: x affects posteriors (pH1, pL1).
 Optimal x* minimizes expected error.
Proposition 2. Under regularity conditions:
(i) x* has same qualitative properties: dx*/dp0 < 0 and dx*/d < 0
(ii) There is a utility loss of not observing the exact cell firing c.
Dynamic information acquisition
Individual takes a second look before selecting an action:
- Neuronal activation threshold x
Brain learns if x is surpassed or not. Beliefs updated to p1
- Neuronal activation threshold y
Brain learns if y is surpassed or not. Beliefs updated to p2
Proposition 3. Under regularity conditions, beliefs are again more likely
to be reinforced. For linear and symmetric densities functions:
y*(p) < x*(p) <1/2 if p > 1/2
1/2 < x*(p) < y*(p) if p < 1/2
Intuition.
At stage 1, information acquisition is more important than knowing if p1 is
greater or smaller than 1/2. Thus, weaker 1st period modulation
 Snowball effect: threshold modulation exacerbated in dynamic settings
 There is a utility loss of not observing the exact cell firing c.
Continuum of states
State S  [0,1] with Pr(S) = pS
2 actions,   {a,b}
Payoff is l(1 - S) if s=a and l(- S) if s=b
Example:
S = intensity of danger
a = stay , b = hunt
(assume continuous version of MLRP)
Proposition 4. Same qualitative properties as before:
if weight on dangerous (high) states increases, threshold decreases;
undesirable outcomes are likely to be avoided.
Conclusions regarding threshold modulation are robust to:
• Dynamic choices
• Increased action space
• Increased state space
However, there is a utility loss in using this economical mechanism
Behavioral implications
Implication 1. Belief anchoring and first impressions
Existing beliefs are more likely to be endorsed and less likely to be
refuted than opposite beliefs.
If state A becomes more likely (p0 increases), threshold x* decreases, so
more likely to be surpassed.
The sequence in which signals are received affects beliefs and actions
Rational Stubbornness:
- Why people develop habits that are difficult to change
- Why individual are less likely to change their mind with age
Implication 2. Polarization of opinions
Individuals with different priors who are exposed to the same evidence and
are subject to the same cell firing may update beliefs in opposite directions
Suppose pi0 > pj0. By proposition 1, xi*< xj*. If c  [xi*, xj*], then
pi1 > pi0 and pj1 < pj0.
The individual with stronger conviction that predators are present will
interpret a mixed signal as evidence of danger whereas the other will
interpret the same signal as evidence of no danger.
Implication 3. Preferences shape beliefs
Individuals with identical priors, exposed to the same evidence, and
subject to the same cell firing may end up with different posteriors due
exclusively to differences in their marginal preferences over outcomes
Suppose iA / iB > jA / jB. By proposition 1, xi*< xj*. If c  [xi*, xj*],
then (i*=a, j*=b) and pi1 > p0 > pj1
The individual with higher utility of catching a prey may go hunting while
the other stays in the cave. In that case, the former will also believe that
there is less danger than the latter.
Implication 4. Probability functions
• An optimal decision threshold mechanism generates payoff dependent
posterior beliefs.
• The individual is best represented as one entity with utility function
∑ Ps(πA, πB) πsl(γ|s)
Implication 5. Elimination
• The number of relevant actions determine the number of thresholds
necessary to achieve efficient decision making
• Affective and cognitive channels:
- 3 states: S  {A,B,C}, and 3 actions:   {a,b,c}
- Payoff 1 if a when A or if b when B or if c when C
Affective process: sets one threshold x
Cognitive process: sets two thresholds x1 and x2
The affective channel optimally discriminates between the two actions
that are most likely to occur and ignores the third one. The loss is
therefore smallest when one state is highly unlikely
Relation to neuroscience theories
The Somatic Marker Hypothesis
(Damasio ’94)
a. “Pre-existing somatic states influence the threshold of neuronal cell
firing [...] Pre-existing positive states reinforce positive states, but
they may impede negative ones” (Bechara-Damasio, GEB 2005)
Emotions regulate neuronal activity by affecting thresholds in a
precise way: existing beliefs are likely to be supported
b. “These somatic states are indeed beneficial, because they consciously
or non-consciously bias the decision in an advantageous manner”
This threshold modulation improves decision-making
SMH provides no explanation why such a “bias” is “advantageous”
Our paper formalizes and proves their claim: if emotions are responsible
for threshold modulation in that particular way, then it is true that
emotions help decision-making.
Cognitive control
(Miller and Cohen ’01)
•
•
•
The function of the PFC is to override automatic responses to shape
behavior on the basis of plans or intentions (e.g. take best action
given preferences and information)
The flow of neural activity is guided along pathways that establish
correct mappings between inputs, internal states and eventual actions
(e.g. represented by decision threshold)
Some features are retained, others are neglected (e.g. threshold acts
as a filter of information).
Conclusions
•
A theory of the brain as a (constrained) optimal processor of information
(needless to say, it is an “as if” approach).
•
The theory relates the ability to make correct choices to primitives. It
determines why and when this economical process implies an inefficiency.
•
The theory has behavioral implications:
The existence of a confirmatory bias and belief anchoring mechanism
The possibility to generate polarization of opinions
The active role of preferences on formation of beliefs (agree to
disagree)
The relationship to non expected utility theory
The fact that options in large choice sets are eliminated
•
Implications of the theory are consistent with findings in neuroscience.
Some testable predictions can help design new experiments.
Download