utility-weighted sampling in decisions from experience · utility-weighted sampling in decisions...

Utility-weighted sampling in decisions from experience

Falk Lieder, Tom Griffiths, Ming HsuUC Berkeley

Extreme potential outcomes influence people as if they were far more likely than they really are.

38% of Americans say they are less likely to travel overseas because of 9/11.

Extreme potential outcomes influence people as if they were far more likely than they really are.

38% of Americans say they are less likely to travel overseas because of 9/11.

Expected Utility Theory

Take action

utility ofoutcome O

expected value

Take action

utility ofoutcome O

expected value

Take action

utility ofoutcome O

expected value

Intractable!

Take action

utility ofoutcome O

expected value

Intractable!

Violated!

EU can be Approximated by Sampling

simulated outcomes

EU estimates

simulated outcomes

EU estimates decision

simulated outcomes

EU estimates decision

finite time finitely many simulated outcomes

Representative sampling is dangerous

variance

biasvariance

Utility estimation by importance samplingfu

Utility estimation by importance sampling

simulatedoutcomes

o1, o2, o3 q~

simulatedoutcomes EU estimates

o1, o2, o3 q~

w1=p(o1)/ q(o1)p

o1, o2, o3 q~

w1=p(o1)/ q(o1)

w3=p(o3)/ q(o3)…

simulatedoutcomes EU estimates decision

o1, o2, o3 q~

w1=p(o1)/ q(o1)

w3=p(o3)/ q(o3)…

Which distribution should the brain sample from?

simulatedoutcomes EU estimates decision

o1, o2, o3 q~

w1=p(o1)/ q(o1)

w3=p(o3)/ q(o3)…

Answer: Utility-Weighted Sampling (UWS)

simulationfrequency

probability

extremity

Answer: Utility-Weighted Sampling (UWS)

simulationfrequency

probability

extremity

Lieder, Hsu, Griffiths (2014)

Decisions from Experience (Ludvig, et al., 2014)

Decisions from Experience (Ludvig, et al., 2008)

Inconsistent Risk Preferences Emerge from Learning

+40/0 vs. +20

-40/ 0 vs. -20

+45/5 vs. +25

-45/-5 vs. -25

UWS Can Emerge from Reward-Modulated Associative Learning

Actions:

Outcomes: -40 -20 +20 +40

Actions:

Outcomes: -40 -20 +20 +40

Actions:

Outcomes: -40 -20 +20 +40

prediction error

Actions:

Outcomes: -40 -20 +20 +40

learning rate

prediction error

Actions:

Outcomes: -40 -20 +20 +40

learning rate

prediction error

Actions:

Outcomes: -40 -20 +20 +40

For all o≠ot:

forgetting rate

learning rate

prediction error

Learning Rule Convergences to Utility-Weighted Sampling

Utility-weighted learning converges to

Learning Rule Convergences to Utility-Weighted Sampling

Utility-weighted learning converges to

with activation function the networklearns to perform utility-weighted sampling.

Efficient coding (Summerfield & Tsetsos, 2015)

Model fitting

Maximum-Likelihood-Estimation offrom block-by-block choice frequencies inExperiments 1-4 by Ludvig et al. (2014).

A single set of parameters fits all experiments.

UWS captures that people learn to overweight extreme outcomes

+40/0 vs. +20

-40/ 0 vs. -20

+45/5 vs. +25

-45/-5 vs. -25

UWS captures that people learn to overweight extreme outcomes

+40/0 vs. +20

-40/ 0 vs. -20

+45/5 vs. +25

-45/-5 vs. -25

Utility-Weighted Sampling Captures Memory Biases (Madan et al. 2014)

Which outcome comes to mind first?

Utility-Weighted Sampling Captures Frequency Estimation Bias (Madan et al. 2014)

How often did this door lead to each outcome?

+40: ___ %+20: ___ %

0: ___ %

+40: ___ %+20: ___ %

0: ___ %

+40: ___ %+20: ___ %

0: ___ %

Biased Beliefs Predict Risk Seeking

rpeople= +0.16; p<0.05

Judged Freq. of High Gain (%)

Biased Beliefs Predict Risk SeekingrUWS = +0.23rpeople= +0.16; p<0.05

Judged Freq. of High Gain (%)

Biased Beliefs Predict Risk SeekingrUWS = +0.23rpeople= +0.16; p<0.05 rpeople = -0.48; p< 0.05

Judged Freq. of High Gain (%) Judged Freq. of Large Loss (%)

Biased Beliefs Predict Risk SeekingrUWS = +0.23 rUWS = -0.44rpeople= +0.16; p<0.05 rpeople = -0.48; p< 0.05

Judged Freq. of High Gain (%) Judged Freq. of Large Loss (%)

Conclusions

Conclusions1. Utility-weighted sampling provides a unifying explanation

for biases in memory, judgment, and decision making.

for biases in memory, judgment, and decision making.2. Utility-weighted sampling can emerge from reward-

modulated associative learning.

modulated associative learning.3. People overweight extreme events, because it is rational

to focus on the most important eventualities.

modulated associative learning.3. People overweight extreme events, because it is rational

to focus on the most important eventualities.4. Some cognitive biases may serve or reflect the rational

allocation of finite cognitive resources.

Thank you!

This work was supported by grant number N00014-13-1-0341 from the Office of Naval Research to Thomas L. Griffiths, grant number RO1 MH098023 from the National Institutes of Health to Ming Hsu, and a Berkeley Graduate Fellowship to Falk Lieder.We thank Elliot Ludvig, Quentin Huys, and Mike Pacer for their suggestions and feedback.

Poster T28

utility-weighted sampling in decisions from experience · utility-weighted sampling in decisions...

Documents

aliasing and antialiasing - unc charlotteanti-aliasing...

dynamically weighted importance sampling in monte...

weighted temporal utility · econ theory doi...

multiple importance sampling -...

ispe good practice guide: sampling for...

aashtoware brm 5.2 - etouches · multi-criteria decision...

overview of neural network architectures for graph...

weighted geometric set multicover via quasi-uniform sampling...

importance sampling for mc simulation...

bayesian nonparametric weighted sampling inference · 606...

coordinated weighted sampling for estimating aggregates over...

aalborg universitet a new air-fuel wsggm for better utility...

conï¬dence-weighted marginal utility analyses for improved

weighted expected, rank dependent, and, cumulative prospect...

weighted random sampling over data streams · the...

comparing sequential sampling models with standard random...

a class of adaptive importance sampling weighted em...

a class of adaptive importance sampling weighted em ... ·...

data utility : improvements by targeting study design and...

test report - benetton...